Create a master notebook for evaluating performances #78

Laurits7 · 2024-10-11T11:31:40Z

Currently various plotting-evaluation scripts and notebooks are scattered around the repository. Create a master notebook that calculates all the results from the trainings of the three training scenarios - dm_multiclass, jet_regression and binary_classification.
There is an evaluator written for all of these training scenarios under enreg/tools/ - decay_mode_evaluator.py, regression_evaluator.py and tagger_evaluator.py.
An example of their use can be found in notebooks/paper_performance_plotting.ipynb

The master notebook should be executable by papermill. An example of its usage can be found using a simple google search.

Ideally dm_evaluator would need also the capability to combine different evaluators in order to make direct comparisons. The direct comparison would be Fig.6 from our most recent paper. Fig.6 is produced currently with the notebooks/DM_CM.ipynb notebook.

As a next step, it would be preferred if also the metrics tracked during training could be plotted within this master script for each training scenario - currently the comparisons are done with notebooks/foundation_model_eval.ipynb and notebooks/losses.ipynb

Laurits7 assigned HardiVv Oct 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a master notebook for evaluating performances #78

Create a master notebook for evaluating performances #78

Laurits7 commented Oct 11, 2024

Create a master notebook for evaluating performances #78

Create a master notebook for evaluating performances #78

Comments

Laurits7 commented Oct 11, 2024