Use MBAR error as uncertainty with a single protocol repeat in RBFE #883

frannerin · 2024-07-04T13:47:43Z

When only a single repeat of the RBFE ProtocolUnit is used in a project (which is the approach in e.g., FEP+), the MBAR error estimate should be used as the uncertainty of the free energy estimate. If not, the uncertainty is 0 (applying np.std() to a list of one number returns 0).

Checklist

Added a news entry

Developers certificate of origin

I certify that this contribution is covered by the MIT License here and the Developer Certificate of Origin at https://developercertificate.org/.

pep8speaks · 2024-07-04T13:47:49Z

Hello @frannerin! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file openfecli/commands/gather.py:

Line 182:45: E261 at least two spaces before inline comment
Line 185:48: E261 at least two spaces before inline comment
Line 195:50: E261 at least two spaces before inline comment
Line 196:62: W291 trailing whitespace
Line 222:48: E261 at least two spaces before inline comment
Line 268:48: E261 at least two spaces before inline comment
Line 340:5: E266 too many leading '#' for block comment
Line 351:1: W293 blank line contains whitespace
Line 363:9: E266 too many leading '#' for block comment
Line 366:1: W293 blank line contains whitespace
Line 380:80: E501 line too long (86 > 79 characters)
Line 381:1: W293 blank line contains whitespace
Line 391:46: E261 at least two spaces before inline comment

Comment last updated at 2024-07-05 09:34:35 UTC

IAlibay · 2024-07-04T13:54:39Z

@frannerin - I think this requires more discussion. I don't believe this is the right way to do this, as you end up with a case where the return can mean two vastly different things statistically.

The programmatic approach to getting the mbar errors is via get_individual_estimates, although I recognise that it isn't the most intuitive thing nor does it play very well with the CLI. What is your use case?

frannerin · 2024-07-04T14:15:34Z

I'm running the rbfe tutorial and wanted to do everything, up to using the "gather" cli command and plotting with cinnabar, but making only 1 protocol repeat and getting !=0 errors in the plots. After, I will try my own system and ligand set but that's a different beast

IAlibay · 2024-07-04T14:19:32Z

Thanks, I think the solution here might be to fix the CLI gather command to fetch mbar errors instead if there is insufficient data to do a standard deviation (rather than changing the behaviour of the results object itself).

frannerin · 2024-07-04T16:37:35Z

So for gather the column should still be named "uncertainty (kcal/mol)" even if MBAR error est. is used (and e.g. print a warning)? I'm thinking in case there are transformations with different number of protocol repeats that are gathered together, for example if gather is run before all calculations finished, or some other weird extreme cases (or put a check/assert so that all transformations must have the same number of protocolunits)

IAlibay · 2024-07-04T21:16:19Z

So for gather the column should still be named "uncertainty (kcal/mol)" even if MBAR error est. is used (and e.g. print a warning)? I'm thinking in case there are transformations with different number of protocol repeats that are gathered together, for example if gather is run before all calculations finished, or some other weird extreme cases (or put a check/assert so that all transformations must have the same number of protocolunits)

That's a good question - and where it gets really messy.

This definitely would require a longer chat (would you be interested in joining a call about it at some point?).

A take (and not guaranteed to be the right one - definitely needs more thinking through!):

Gather should populate a "stdev (kcal/mol)" and "mbar error (kcal/mol)" column, where the former is just the standard deviation, and the latter is the propagated MBAR error (I think that would be something like sqrt(err1^2 + err2^2.. + errN^2)/ N)?
If all the samples have a standard deviation, then use the standard deviation when passing things to cinnabar.
If not, raise a UserWarning and use the MBAR errors.

Note: we're in a 1.x release, so renaming "uncertainty" might be a tad bit problematic - that might need some consideration (nothing stops us from starting a 2.0 branch though!).

This reverts commit 8ec28a7.

This reverts commit 61eec4d.

This reverts commit d2cf741.

…orrected raw report

IAlibay · 2024-07-05T10:51:47Z

@frannerin may I please suggest that we have that chat before you spend a lot of time working on this? I would hate to see you spend a lot of efforts on a solution just for us to turn around and claim we want something else.

frannerin · 2024-07-05T11:59:47Z

Hi @IAlibay yes I'm open for a chat, but I'm no expert I have to warn you, and I don't think I could contribute much. The last pushed commits where honestly for me and forgot they would show up here hehe 🐋 What you propose in your previous comment seems reasonable to me

hannahbaumann

Thank you @frannerin , this looks good to me! I think only things left would be to update the tests with the new expected outcomes as well as clean up some of the code that's commented out if that is not required?

frannerin · 2024-08-19T13:38:03Z

Hi @hannahbaumann, first of all sorry for the delay in replying.

@IAlibay raised some very reasonable concerns about this PR in #883 (comment). I would be happy to give a go to the implementation of whatever final decision is taken in terms of column names, error propagation, etc, but I don't think there is one yet? I have to say, I'm already using this code for my own system and it's going ok.

On the other hand, my other PR #884 (imo) does solve a bug that does not have as many conceptual implications as this one, and actually the fix that's in that PR is present already in this one, could someone take a look at that one?

I would also be happy to merge the updated main into any or both of these branches to see if everything is still working ok (and cleaning comments etc)

Francho Nerin Fonz and others added 2 commits July 4, 2024 11:40

RBFE protocol changes to use MBAR error for single PU repeat

d2cf741

added comments

61eec4d

fixed pep8 compliant line lengths in touched lines

8ec28a7

IAlibay and others added 5 commits July 4, 2024 22:16

Merge branch 'main' into Single-protocol-repeat_MBAR-uncertainty

ada00fe

Revert changes

c2aefd4

This reverts commit 8ec28a7.

Revert "added comments"

2dc5c03

This reverts commit 61eec4d.

Revert "RBFE protocol changes to use MBAR error for single PU repeat"

13e8acb

This reverts commit d2cf741.

Use of MBAR estim. errors in gather for single protocol repeats and c…

3323df5

…orrected raw report

jameseastwood assigned hannahbaumann and IAlibay Aug 13, 2024

hannahbaumann requested changes Aug 15, 2024

View reviewed changes

atravitz added this to the 1.2 milestone Oct 21, 2024

atravitz marked this pull request as draft November 5, 2024 15:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use MBAR error as uncertainty with a single protocol repeat in RBFE #883

Use MBAR error as uncertainty with a single protocol repeat in RBFE #883

frannerin commented Jul 4, 2024

pep8speaks commented Jul 4, 2024 •

edited

Loading

IAlibay commented Jul 4, 2024

frannerin commented Jul 4, 2024

IAlibay commented Jul 4, 2024

frannerin commented Jul 4, 2024

IAlibay commented Jul 4, 2024 •

edited

Loading

IAlibay commented Jul 5, 2024

frannerin commented Jul 5, 2024

hannahbaumann left a comment

frannerin commented Aug 19, 2024

Use MBAR error as uncertainty with a single protocol repeat in RBFE #883

Are you sure you want to change the base?

Use MBAR error as uncertainty with a single protocol repeat in RBFE #883

Conversation

frannerin commented Jul 4, 2024

Developers certificate of origin

pep8speaks commented Jul 4, 2024 • edited Loading

Comment last updated at 2024-07-05 09:34:35 UTC

IAlibay commented Jul 4, 2024

frannerin commented Jul 4, 2024

IAlibay commented Jul 4, 2024

frannerin commented Jul 4, 2024

IAlibay commented Jul 4, 2024 • edited Loading

IAlibay commented Jul 5, 2024

frannerin commented Jul 5, 2024

hannahbaumann left a comment

Choose a reason for hiding this comment

frannerin commented Aug 19, 2024

pep8speaks commented Jul 4, 2024 •

edited

Loading

IAlibay commented Jul 4, 2024 •

edited

Loading