Add PandoraLArRecoNDBranchFiller #76

jback08 · 2024-08-22T14:50:21Z

Created the PandoraLArRecoNDBranchFiller class to store the reconstruction information from Pandora's LArRecoND package (used for the DUNE ND). It requires a ROOT file created by the HierarchyAnalysisAlgorithm, which uses Pandora's Hierarchy Tools.

The filled reco branches are rec.nd.lar.pandora.tracks and .rec.nd.lar.pandora.showers. No distinction is made (yet) between tracks and showers, and so the same 3D-cluster Particle Flow Objects (PFOs) are used for both. Here is the list of reco variables used:

Start position = cluster vertex point or the first hit position if no vertex is available
End position = cluster end position
Energy = charge Q
Length = primary principal axis length
Quality = number of 3D hits.

Any changes needed here can be done mostly in the LArRecoND package, since the CAF just retrieves the output stored by Pandora's hierarchy algorithm.

The MC truth matching uses Pandora's Hierarchy Tools and not TruthMatcher. Pandora requires all MC particles to have a unique ID (even if they originate from different neutrinos), and so the following offsets are applied to the original (ndlar-flow) MC Ids:

nuId = orig_nuId + 10^8
mcId = orig_mcId + nuIndex * 10^6

where nuIndex = 0 to N-1 for an event with N neutrinos, and the orig_mcId number range restarts from zero for each neutrino. This ensures all IDs are unique, for up to 100 neutrino interactions per event, each containing up to 10^6 hits.

These ID offsets are reversed when the Pandora CAF truth information is filled using the best reco-MC match achieved with Hierarchy Tools, and so the mcId's should then have values consistent with the equivalent input ndlar-flow files. The filled truth information corresponds to:

truth.ixn = orig_nuId
truth.part = orig_mcId (best match)
truth.type = primary, secondary or other
truthOverlap = completeness (best match)

H5 input files are first converted to ROOT using h5_to_root_ndlarflow.py before they are used by LArRecoND & Pandora, which in turn creates the hierarchy analysis output file that can be used to make the equivalent CAFs.

For the trigger, the event run numbers are propagated using those originally from the input ndar_flow ROOT files. Also, the start time is currently set using the ts_start variable, but this needs to be updated (in LArRecoND) to use the correct time and units.

Also updated ndcaf_setup.sh and build.sh to use a consistent environment.

Updated ndcaf_setup.sh and build.sh to use a consistent environment.

noeroy · 2024-08-22T16:20:32Z

I think that the truth part of it could be an issue, as the truth branches are shared by all reconstructions.

The particles are uniquely identified with two ids, one that identifies the interaction inside a given spill: vertex_id in the flow files, and one that is uniquely identifying for a given interaction which is the Geant4 trajectory id, I believe it is traj_id in the flow files. Those IDs are shared within MLReco and MINERvA so that truth particles are matched to the same id. Would it be possible to have a match between orig_nuId and vertex_id and between orig_mcId and traj_id?

…raLArRecoND

jback08 · 2024-10-03T15:01:37Z

I think this is ready for review now. We have added the unix_ts variable to set the trigger time, and we are using the same MC Id variables as the other reco methods.

noeroy · 2024-10-04T14:21:12Z

src/reco/PandoraLArRecoNDBranchFiller.cxx

+	const int traj_id = mcId - nuIndex*m_maxMCId;
+
+	caf::TrueParticleID trueID;
+	trueID.ixn = vertex_id;


Tracks and shower truth.ixn are not directly the vertex_id, but the position of the neutrino interaction associated to that vertex_id in the truth interaction stack.

I think you can have access to it by doing:

caf::SRTrueInteraction & srTrueInt = truthMatch->GetTrueInteraction(sr, vertex_id, false); // Gets the true interaction in the stack int srTrueIntIdx = std::distance(sr.mc.nu.begin(), std::find_if(sr.mc.nu.begin(), sr.mc.nu.end(), [&srTrueInt](const caf::SRTrueInteraction& ixn) {return ixn.id == srTrueInt.id;})); // Get the position of the truth interaction

Also I see that particles that are not "primaries" have a mcNuId == 0 so that makes the truth matching tricky for them.

And finally, I'm not sure the vertex_id has the same definition as in the other part of the chain production chain.

For instance, in entry 2 of the Pandora MiniRun6 file 1, you have a particle with mcNuId==100100016, hence vertex_id = 100016 while vertex_id has for definition 1E6 * TaggedRunId + EdepSimEventId so should be >1e6 for the file 1. in that case the equivalent runID in the other files is 10000016.

Furthermore, rock muons vertex_id are expected to be at the 1e15 order, but I don't see any big number in that file.

The mcNuId is truncated in the H5-to-ROOT script to go up to 10^6 here. This affects all vertex_id numbers, including rock muons.

We store eventID and run numbers, where eventID normally goes from 0 to N (with some numbers skipped), but it looks like run is always set to zero. We don't use TaggedRunId. Is eventID = EdepSimEventId? For an event with N neutrinos, what are the typical values of TaggedRunId and EdepSimEventId?

So vertex_id is defined here. That's the one in Flow hence the one used in Pandora if I'm not mistaken.
The way I understand the code you sent me, it seems that it takes the 1st digit of the vertex ID, the last 5 digits and bunch them together. I'm not sure how you can recover the initial definition vertex_id from that.

For current Minirun files, for instance run 15, TaggedRunId varies between 15*10 and 16*10 for the neutrino events and 15*10+1e9 - 16*10 1e9 for rock muons. So basically the run_number *10 annd (run_number +1) *10, and same +1e9 for rock muons. The ratio 10 TaggedRunId per file is not set in stone though.

For EdepSimEventId, for a MiniRun file, it can go up to 30 000 in what I've seen, but I wouldn't say it's an absolute limit.

noeroy · 2024-10-04T14:40:24Z

src/reco/PandoraLArRecoNDBranchFiller.cxx

+	} else {
+	    trueID.type = caf::TrueParticleID::kSecondary;
+	}
+	trueID.part = traj_id;


Tracks and showers truth.part are not directly the traj_id.
Is that a unique ID created inside Pandora or the original Trajectory id from Edepsim/G4?

if it's the latest, then that variable can be retrieved the following way:

caf::SRTrueParticle & srTruePart = isPrimary? truthMatch->GetTrueParticle(sr, srTrueInt, traj_id, true, false) : truthMatch->GetTrueParticle(sr, srTrueInt, traj_id, false, true); //If it's a primary, the particle is already stored in the particle stack so we just want to retrieve it, if it's not we might want to create a new particle if it wasn't created originally. if (truePartID.type == caf::TrueParticleID::kPrimary) truePartID.part = traj_id; //We could be a bit smarter like what's done for the MLNDLArRecoBranchFiller but that works else { truePartID.part = std::distance(srTrueInt.sec.begin(), std::find_if(srTrueInt.sec.begin(), srTrueInt.sec.end(), [traj_id](const caf::SRTrueParticle& part) { return part.G4ID == traj_id; })); // we just filled it so it should be fine }

That way you also make sure that the truth matcher algorithm save the particle of interest if that was not done before.

noeroy · 2024-10-04T14:40:42Z

src/reco/PandoraLArRecoNDBranchFiller.cxx

+	const int traj_id = mcId - nuIndex*m_maxMCId;
+
+	caf::TrueParticleID trueID;
+	trueID.ixn = vertex_id;


Same comment as for tracks

noeroy · 2024-10-04T14:40:58Z

src/reco/PandoraLArRecoNDBranchFiller.cxx

+	} else {
+	    trueID.type = caf::TrueParticleID::kSecondary;
+	}
+	trueID.part = traj_id;


Same Comment as for tracks

jback08 added 2 commits August 20, 2024 17:45

Add PandoraLArRecoND reco branch filler & example pandora.fcl.

54e23b4

Updated ndcaf_setup.sh and build.sh to use a consistent environment.

Set Pandora track length using start and end points.

58cd6b9

noeroy self-assigned this Aug 22, 2024

jback08 added 4 commits August 27, 2024 17:16

Use isShower flag to check if PFOs are showers or tracks.

4bd318d

Add unix_ts trigger time and update MCId variable names

8f9264f

Merge branch 'main' of github.com:DUNE/ND_CAFMaker into feature/Pando…

89c4003

…raLArRecoND

Add RecoFillerType to PandoraLArRecoNDBranchFiller.

611c011

noeroy self-requested a review October 7, 2024 16:17

noeroy reviewed Oct 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PandoraLArRecoNDBranchFiller #76

Add PandoraLArRecoNDBranchFiller #76

jback08 commented Aug 22, 2024

noeroy commented Aug 22, 2024

jback08 commented Oct 3, 2024

noeroy Oct 4, 2024

noeroy Oct 4, 2024

noeroy Oct 4, 2024

jback08 Oct 7, 2024

noeroy Oct 7, 2024

noeroy Oct 4, 2024

noeroy Oct 4, 2024

noeroy Oct 4, 2024

Add PandoraLArRecoNDBranchFiller #76

Are you sure you want to change the base?

Add PandoraLArRecoNDBranchFiller #76

Conversation

jback08 commented Aug 22, 2024

noeroy commented Aug 22, 2024

jback08 commented Oct 3, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment