support index-based selection by goblirsc · Pull Request #502 · HEP-FCC/FCCAnalyses

goblirsc · 2026-02-16T11:19:18Z

Add a few features to help with index-based object selection and truth-reco matching studies within FCCAnalyses:

Standard MCParticle selection structs now support both element-based and index-based selections
A new selByPredicate base struct is introduced to simplify adding future selection criteria without copy-pasting boilerplate code
Add a get_lists_of_stable_particles_from_decays function extending the existing get_list_of_stable_particles_from_decay to check multiple decaying particles in one call
Fix documentation of selRP_matched_to_list and add an index-based version. Add optional flag to also match metastable particles
Add index_range helper function to obtain range(coll.size()) for any RVec collection.
Add count_valid_indices helper function to count valid entries in a list of (selected) indices.
Add sel_byIndex helper function to move from index list to particle list
Add getVertex_matching_recoParticles function to VertexingUtils - find vertices that contain a user-specified list of recoparticles

The changes in this PR are designed to be transparent to all existing code - should require no adaptation by users.

kjvbrt · 2026-02-16T13:46:32Z

Hi @goblirsc, nice to see these :)

I believe quickest way to fix the tests would be to rebase on the latest master.

goblirsc · 2026-02-16T13:54:38Z

Hi @goblirsc, nice to see these :)

I believe quickest way to fix the tests would be to rebase on the latest master.

Thanks, missed the latest commits by a tiny bit :)
I see another change went in - should I wait and do one more rebase to be fully up to date? Or were the previous PR up to 500 sufficient for the pipeline?

kjvbrt · 2026-02-16T14:11:07Z

PRs up to 500 should be sufficient for the pipeline, the later PRs do not affect analyzers.

goblirsc · 2026-02-16T14:27:02Z

after running clang_format as suggested on the main page, the entire set of classes got reformatted. Are we missing some formatting style settings that need to be configured prior to running this?

Or is this genuine and the entire codebase was violating the formatting rules?

goblirsc · 2026-02-17T13:05:26Z

Dear @kjvbrt, can we retry the clang_format test to see if the format changes are expected or something went wrong with the formatter config?

kjvbrt · 2026-02-17T14:52:54Z

Hi @goblirsc, the test tries to do a selective format, only of the edited lines.

One can do it with the help of git

git clang-format master

You can get the clang-format git sub-command by sourcing Key4hep stack.

This reverts commit 24fc851.

goblirsc · 2026-02-17T16:13:13Z

Hi @goblirsc, the test tries to do a selective format, only of the edited lines.

One can do it with the help of git
git clang-format master
You can get the clang-format git sub-command by sourcing Key4hep stack.

Thank you! The number of format changes looks a lot more reasonable now :)

goblirsc · 2026-02-17T16:25:01Z

Hm, the CI seems to expect a different format to the one I get when I run clang_format locally.
Can I export the artifacts of the pipeline run as a patch to directly apply the changes it requests?

goblirsc · 2026-02-20T10:57:05Z

Dear @kjvbrt,

I now applied the diff printed by the formatter by hand, after git clang-format master applied a different formatting than the CI expected and clang-format -i -style=file /path/to/file.cpp modified the entire files.

To avoid this overhead in the future, is there a recipe for either reproducibly applying the formatter locally with changes consistent to what the CI expects, or for automatically applying the suggested patch from the CI to my branch?

Cheers, Max

goblirsc · 2026-02-26T10:24:25Z

Dear @kjvbrt,

in addition to the above, it seems the pipeline is set to only run when you manually trigger it. Could you please do so?
This PR already lost 10 days due to the formatter, which is a bit of a pity given there are many more cool functional improvements that could be made if we want to.

Cheers,
Max

goblirsc · 2026-03-02T09:59:46Z

next format update, please re-run the CI.

goblirsc · 2026-03-04T09:14:41Z

Dear @kjvbrt,

I think I finally figured out how to run the CI formatter stage locally :)
I added this to the Readme.md, so that other users won't run into the same issue.
Could you please trigger another CI run? Now we should be able to get past the formatter and be able to start review of the changes themselves 🤞

May I make another suggestion? Could the formatter check

either be automatically triggered when a new commit arrives (to remove the delay until a manual trigger) or
be run on the client side as a pre-commit hook to ensure that submitted changes already pass the requirement (with corresponding documentation in the readme)?

Cheers, Max

kjvbrt · 2026-03-04T10:07:21Z

+```
+ git clang-format  --style=file $(git merge-base upstream/master HEAD)
+```
+to only format the lines you changed (otherwise you will reformat the entire file). 


Thanks for figuring this out :)

Maybe (otherwise you will reformat the entire file) is a bit redundant, as the same information is a few lines above.

In any case, at some point we need to run formatter across the whole repository.

Fixed the description

Nice, maybe MR -> PR?

Fixed (caught a long-term gitlab user :) )

kjvbrt · 2026-03-04T10:30:49Z

    bool  operator() (ROOT::VecOps::RVec<edm4hep::MCParticleData> in);
  };

+  /// @brief Helper struct to select entries matching a certain predicate.


Could you also provide an example of usage?

Hi, sure - would you prefer this in the doxygen doc or on the PR?
The replacements for the selectors below this definition can serve as "inline" examples to refer to.

The general pattern is

struct mySelection: selByPredicate{ mySelection(): selByPredicate(some_selection_function){} };

Where some_selection_function is a function returning a "keep" decision for a given input object. Using lambda captures, the function can be configured with constructor arguments - for example, a configurable cut threshold.

This object can then be used to obtain a copy-vector of passed objects from an input object list, or a set of passing indices, or a set of passing element for each of a list of input index vectors.

The main motivation is that we save a lot of common boilerplate code when implementing selection functions (loop over containers, output allocation, copy operations, ...). The pattern also ensures that deep-copies are avoided where possible. We also gain the ability to apply the same selection functor transparently on index-based or copy-based selection logic, avoiding a need to duplicate logic if both are to be supported.

It would actually be more elegant to generalise this to a template consuming an arbitrary input object type, rather than being restricted to the MCParticleData class.

Nice explanation, I think having it in Doxygen would be the best. Would you also add an example how to use this when working with the dataframe in Python? I mean a snipped a user might write.

Added explanation for doxygen, including snippets. Also moved the selByPredicate to Utils, making it available more generally than before.

goblirsc force-pushed the MG_indexBasedSelCP branch from 9be047e to 4591373 Compare February 16, 2026 13:52

goblirsc added 4 commits February 16, 2026 15:26

support index-based selection

fce2d4a

fix a bug

8583260

add one more helper

a1413ca

clang_format

24fc851

goblirsc force-pushed the MG_indexBasedSelCP branch from 900ab16 to 24fc851 Compare February 16, 2026 14:26

goblirsc added 2 commits February 17, 2026 17:11

Revert "clang_format"

547242a

This reverts commit 24fc851.

format fixes

67704a9

the formatter strikes again

598ad75

while (!CI_happy) run(formatter)

4ff8c17

goblirsc added 2 commits March 4, 2026 10:09

yay

1637717

document how to use clang format

7f736d2

kjvbrt reviewed Mar 4, 2026

View reviewed changes

kjvbrt mentioned this pull request Mar 4, 2026

Run C++ and Python formatters across the whole repo #508

Open

goblirsc and others added 3 commits March 4, 2026 11:51

clarify readme

5dcba23

move selByPredicate to Utils, add more doc

609d6e4

clang format

6ff7a38

Conversation

goblirsc commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kjvbrt commented Feb 16, 2026

Uh oh!

goblirsc commented Feb 16, 2026

Uh oh!

kjvbrt commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

goblirsc commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

goblirsc commented Feb 17, 2026

Uh oh!

kjvbrt commented Feb 17, 2026

Uh oh!

goblirsc commented Feb 17, 2026

Uh oh!

goblirsc commented Feb 17, 2026

Uh oh!

goblirsc commented Feb 20, 2026

Uh oh!

goblirsc commented Feb 26, 2026

Uh oh!

goblirsc commented Mar 2, 2026

Uh oh!

goblirsc commented Mar 4, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

goblirsc commented Feb 16, 2026 •

edited

Loading

kjvbrt commented Feb 16, 2026 •

edited

Loading

goblirsc commented Feb 16, 2026 •

edited

Loading