Glossary¶
- cut-flow
- A series of “cuts” which remove events from the processing.
- data-space
- The current set of variables known to fast-carpenter and passed between stages. Stages can modify the data-space which will affect what subequent stages see. Before any stages have been run, the data-space contains only those variables given in the input datasets.
- processing config
- A YAML-based description of the way the input data should be processed.
- processing stage
- A single step in the processing chain, which can modify the data-space or produce new outputs.
- dataset config
- A YAML-based description of the input files which form the datasets to be processed.
- expression
- A string representing some mathematical manipulation of variables in the data-space.
- dataframe
- A programmatic interface to a table-like representation of data.
In the context of fast-carpenter, “dataframe” will usually refer to the
pandas.DataFrame
implementation. - jagged array
- A generalisation of a multi-dimensional numpy array where the length of each sub-array in the second (and third, and fourth, and so on) dimension can vary. For example, if each event contains a list of particles produced in that event, this would be represented by a jagged array, since there can be different numbers of particles in each event. Typically for fast-carpenter, a jagged array refers to the specific implementation from the awkward-array package