Modify how losses are computed in a multi-replicas hyperopt #2145

Radonirinaunimi · 2024-08-16T09:47:14Z

Addresses the following:

include the Positivity filter in the hyperopt card
select a fraction of the replicas to compute the hyperopt $\chi^2$ losses
fix how $\varphi^2$ of the hold out folds are computed during hyperopt (this line)
select a fraction of the replica models to evaluate $\varphi^2$

Radonirinaunimi · 2024-08-16T09:55:49Z

@Cmurilochem, are you actually including penalties in losses?

nnpdf/n3fit/runcards/hyperopt_studies/restricted_search_space_renew_hyperopt.yml

Line 131 in e37aa4e

penalties_in_loss: True

At some point in then draft you mentioned that this is not the case.

goord · 2024-08-18T20:38:31Z

Hi @Radonirinaunimi I'm afraid most of our GPU budget is burnt and I'm not sure the folks at SURF are willing to give us more again...

About the penalties in the hyperloss: I'm pretty sure they are excluded. Also, I received an invite for the slack channel but it won't allow me in!

Radonirinaunimi · 2024-08-18T20:46:01Z

Hi @Radonirinaunimi I'm afraid most of our GPU budget is burnt and I'm not sure the folks at SURF are willing to give us more again...

Hi @goord, I hope it was not our runs that burnt the GPUs 😬

For the paper, I think it would be realistic to only perform 250 trials. This is because from the PDF point of view we are doing a proof of concept, and we use restricted parameter space anyway. Do you think this would not even be possible? At some point, @Cmurilochem was planning to run 3 more sets of 250.

About the penalties in the hyperloss: I'm pretty sure they are excluded.

Ok, good! As it should be. I will modify the card here.

Also, I received an invite for the slack channel but it won't allow me in!

Hmhm, what is the message that you received? Maybe @juanrojochacon knows how to solve this?

goord · 2024-08-18T21:00:12Z

Well we can at least run a 5-day job, there is enough budget for that. In the meantime we can explore our options for more compute (Leonardo or new pilot project on Snellius?). @Cmurilochem maybe you can find the time to start a job?

Regarding the slack: tried again and now it works

juanrojochacon · 2024-08-19T06:41:08Z

yes @goord @Cmurilochem Slack has been a mess in the last few weeks but it is sorted out now, back to our Pro plan so all communication can proceed via there as usual now. Thanks!

Cmurilochem · 2024-08-19T07:10:44Z

Hi @Radonirinaunimi I'm afraid most of our GPU budget is burnt and I'm not sure the folks at SURF are willing to give us more again...

Hi @goord, I hope it was not our runs that burnt the GPUs 😬

For the paper, I think it would be realistic to only perform 250 trials. This is because from the PDF point of view we are doing a proof of concept, and we use restricted parameter space anyway. Do you think this would not even be possible? At some point, @Cmurilochem was planning to run 3 more sets of 250.

About the penalties in the hyperloss: I'm pretty sure they are excluded.

Ok, good! As it should be. I will modify the card here.

Also, I received an invite for the slack channel but it won't allow me in!

Hmhm, what is the message that you received? Maybe @juanrojochacon knows how to solve this?

Hi @Radonirinaunimi and @goord. Yes. I excluded penalties in all runs. So, if this is the problem, nothing to be worried about.

Also, @goord is right. We have a limited budget and I suspect that we left a 3-4 days job. Tomorrow I am back home and will submit it again. But we currently have more than 250 trials for sure.

Radonirinaunimi · 2024-08-19T07:38:34Z

Also, @goord is right. We have a limited budget and I suspect that we left a 3-4 days job. Tomorrow I am back home and will submit it again.

Perfect, thanks! Please use this branch for the runs.

But we currently have more than 250 trials for sure.

I fear that we cannot use those, unfortunately. But we should always make backups of them, just in case.

Cmurilochem · 2024-08-21T05:13:52Z

Hi @Radonirinaunimi and @goord. Just submitted the new hyperopt from this branch; we currently have just ~3.5 days of budget. I am currently on holidays with family, but will find some time to give you feedback on the progress of the calculation.

n3fit/runcards/hyperopt_studies/restricted_search_space_renew_hyperopt.yml

Radonirinaunimi · 2024-09-13T10:52:32Z

Unrelated: it looks like the polarized theories have been updated (cc, @giacomomagni, @scarlehoff)? Now the C-factors are no longer present. This is the reason why the tests are failing.

giacomomagni · 2024-09-13T19:35:07Z

Unrelated: it looks like the polarized theories have been updated (cc, @giacomomagni, @scarlehoff)? Now the C-factors are no longer present. This is the reason why the tests are failing.

maybe I've forgot them during my last update... I'll check it

EDIT: Something went wrong when removing the eko.tar, now it should be okay.

giacomomagni · 2024-09-24T13:59:48Z

n3fit/src/n3fit/hyper_optimization/rewards.py

@@ -44,7 +44,7 @@
 log = logging.getLogger(__name__)


-def _average_best(fold_losses: np.ndarray, proportion: float = 0.9, axis: int = 0) -> float:
+def _average_best(fold_losses: np.ndarray, proportion: float = 0.05, axis: int = 0) -> float:


Are you sure we do want to have a default value for proportion ?

giacomomagni · 2024-09-24T14:01:01Z

n3fit/src/n3fit/hyper_optimization/rewards.py

+            # If a proportion allow as a keyword argument, use 80% and 10%
+            # as a proxy of
+            # "80% of the replicas should be good, but only a small % has to cover the folds"
+            # The values of 80% and 10% are completely empirical and should be investigated further


on a similar line maybe we can pass the values from the runcard?

tbh, it would be a good idea. It was added quickly there for the sake of the meeting, but it would be good to have it as an input parameter.

Radonirinaunimi added hyperoptimization escience labels Aug 16, 2024

Radonirinaunimi requested review from goord, scarlehoff, RoyStegeman and Cmurilochem August 16, 2024 09:47

Radonirinaunimi changed the title ~~Filter positivity datapoints~~ Filter positivity datapoints in hyperopt Aug 18, 2024

scarlehoff reviewed Sep 5, 2024

View reviewed changes

n3fit/runcards/hyperopt_studies/restricted_search_space_renew_hyperopt.yml Outdated Show resolved Hide resolved

Radonirinaunimi force-pushed the positivity-hyperopt branch from b303842 to d2feade Compare September 13, 2024 11:05

Radonirinaunimi force-pushed the positivity-hyperopt branch from d25b744 to 6075488 Compare September 23, 2024 08:30

giacomomagni reviewed Sep 24, 2024

View reviewed changes

Radonirinaunimi changed the title ~~Filter positivity datapoints in hyperopt~~ Modify how losses are computed in a multi-replicas hyperopt Sep 30, 2024

Radonirinaunimi force-pushed the positivity-hyperopt branch from 6075488 to 945693d Compare September 30, 2024 20:01

scarlehoff and others added 6 commits October 6, 2024 12:18

first attempt

628e455

filter positivity datapoints

98a3dc1

remove penalties in losses

a6c8bda

completely remove penalties & increase threshold

4e398cf

change validation to 80%

983f2f2

explain choice of % in validation/fold

14c202a

Radonirinaunimi force-pushed the positivity-hyperopt branch from 945693d to 14c202a Compare October 6, 2024 10:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modify how losses are computed in a multi-replicas hyperopt #2145

Modify how losses are computed in a multi-replicas hyperopt #2145

Radonirinaunimi commented Aug 16, 2024 •

edited

Loading

Radonirinaunimi commented Aug 16, 2024 •

edited

Loading

goord commented Aug 18, 2024

Radonirinaunimi commented Aug 18, 2024 •

edited

Loading

goord commented Aug 18, 2024

juanrojochacon commented Aug 19, 2024

Cmurilochem commented Aug 19, 2024

Radonirinaunimi commented Aug 19, 2024

Cmurilochem commented Aug 21, 2024

Radonirinaunimi commented Sep 13, 2024

giacomomagni commented Sep 13, 2024 •

edited

Loading

giacomomagni Sep 24, 2024

giacomomagni Sep 24, 2024

scarlehoff Sep 24, 2024

Modify how losses are computed in a multi-replicas hyperopt #2145

Are you sure you want to change the base?

Modify how losses are computed in a multi-replicas hyperopt #2145

Conversation

Radonirinaunimi commented Aug 16, 2024 • edited Loading

Radonirinaunimi commented Aug 16, 2024 • edited Loading

goord commented Aug 18, 2024

Radonirinaunimi commented Aug 18, 2024 • edited Loading

goord commented Aug 18, 2024

juanrojochacon commented Aug 19, 2024

Cmurilochem commented Aug 19, 2024

Radonirinaunimi commented Aug 19, 2024

Cmurilochem commented Aug 21, 2024

Radonirinaunimi commented Sep 13, 2024

giacomomagni commented Sep 13, 2024 • edited Loading

giacomomagni Sep 24, 2024

Choose a reason for hiding this comment

giacomomagni Sep 24, 2024

Choose a reason for hiding this comment

scarlehoff Sep 24, 2024

Choose a reason for hiding this comment

Radonirinaunimi commented Aug 16, 2024 •

edited

Loading

Radonirinaunimi commented Aug 16, 2024 •

edited

Loading

Radonirinaunimi commented Aug 18, 2024 •

edited

Loading

giacomomagni commented Sep 13, 2024 •

edited

Loading