Add comments, suggestions and clean up to matchMinima.py, look for #M #1

magee256 · 2017-07-11T07:09:05Z

Here's my feedback. I gave myself like an hour and a half so it might not be too thorough but I think I suggested some pretty specific improvements. A lot of it is just style based though.

vtlim · 2017-07-12T06:11:44Z

matchMinima.py

@@ -19,6 +19,8 @@



+#M Do what OE does and define your own molecule objects, they could even 
+#M extend the OE chem definitions to save you work. 


what do you mean by this?

OE seems to have a molecule object. If you wanted to you could define your own personal molecule object that inherits from the OE molecule. That would mean your new object would have all the methods and attributes an OE molecule object has, plus whatever you wanted to add. Declare it like class MyMolecule(OEMolecule):

vtlim · 2017-07-12T06:13:15Z

matchMinima.py

    allIndices = [] # for M mols, N reference minima of each mol, P matching indices for each ref minimia
    elists = [] # 2D list: K mols per file x J numFiles
    tlists = [] # 2D list: K mols per file x J numFiles
    refNumConfs = [] # number of conformers for each mol in reference file
    molNames = [] # name of each molecule. for plotting.

-    for i, sdfQuery in enumerate(sdfList):
-        qthry = thryList[i]
+    for sdfQuery, qthry in zip(sdfList,thryList):


vtlim · 2017-07-12T06:14:43Z

matchMinima.py

@@ -432,13 +395,15 @@ def getRatioTimes(allMolTimes, zeroes):

    """

+#M Unrelated: You're compring times spent in each minima? What for?


i'm taking into acct how long it takes to optimize each conformer of a molecule. then getting an average per molecule per level of theory. that's what's being compared, the diff levels of theory

Ok so you're interested in the time it takes for optimization as well as the energy differences between levels of theory.

vtlim · 2017-07-12T06:15:08Z

matchMinima.py

    relByFile = []
    sdByFile = []
    for i, molist in enumerate(allMolTimes):
        molTimes = []
        molStds = []
        for j, filelist in enumerate(molist):
            rels = np.asarray(filelist)/np.asarray(molist[0])
+            #M I trust this is safe?


what do you mean by safe?

Does eliminating nan's like that bias your results? I assume some conformers don't finish optimization because of a timeout.

vtlim · 2017-07-12T06:15:29Z

matchMinima.py

-#    if len(trimE) != len(zeroes):
-#        print len(trimE), zeroes
-#        sys.exit("Error in determining reference confs for molecules.")
+#M Delete unused code unless you expect to add it back in soon. Reduces clutter


it's a work in progress

vtlim · 2017-07-12T06:17:22Z

matchMinima.py

@@ -580,14 +515,20 @@ def reorganizeSublists(theArray,allMolIndices):
    minimaE = []
    for i, molIndices in enumerate(allMolIndices):
        molE = [] # all conf energies from ith mol in all files
+        #M Try this:
+        #M      flipped = np.array(theArray[i]).T
+        #M      molE.append([ nan if (x == None or x == -2) else theArray[i][j][k] for ... ])


vtlim · 2017-07-12T06:18:18Z

matchMinima.py

+#M I take it there's no better way to do this?
+#M You should take a look at: https://docs.python.org/2/library/unittest.html
+#M I haven't used it myself but I really should. Also look into using assert
+#M statements


I actually need to take this out, this was from an older version

thanks for the other two references

vtlim · 2017-07-12T06:23:38Z

this was helpful, thanks. broader concepts like defining a class would be useful but probably not worth spending the time on for this particular script. will keep in mind for other applications.

magee256 · 2017-07-12T17:50:44Z

Yeah at this point it might take a while to rewrite the script to use classes. Classes do tend to make code shorter and more organized so it's good to think about where they could be used when starting a project.

merge in master

Add comments, suggestions and clean up to matchMinima.py, look for #M

a140796

vtlim reviewed Jul 12, 2017

View reviewed changes

vtlim pushed a commit that referenced this pull request Nov 29, 2017

Merge pull request #1 from vtlim/master

66d4943

merge in master

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add comments, suggestions and clean up to matchMinima.py, look for #M #1

Add comments, suggestions and clean up to matchMinima.py, look for #M #1

magee256 commented Jul 11, 2017

vtlim Jul 12, 2017

magee256 Jul 12, 2017 •

edited

Loading

vtlim Jul 12, 2017

vtlim Jul 12, 2017

magee256 Jul 12, 2017

vtlim Jul 12, 2017

magee256 Jul 12, 2017

vtlim Jul 12, 2017

vtlim Jul 12, 2017

vtlim Jul 12, 2017

vtlim Jul 12, 2017

vtlim commented Jul 12, 2017

magee256 commented Jul 12, 2017

		@@ -19,6 +19,8 @@



		#M Do what OE does and define your own molecule objects, they could even
		#M extend the OE chem definitions to save you work.

		@@ -432,13 +395,15 @@ def getRatioTimes(allMolTimes, zeroes):

		"""

		#M Unrelated: You're compring times spent in each minima? What for?

Add comments, suggestions and clean up to matchMinima.py, look for #M #1

Are you sure you want to change the base?

Add comments, suggestions and clean up to matchMinima.py, look for #M #1

Conversation

magee256 commented Jul 11, 2017

Choose a reason for hiding this comment

magee256 Jul 12, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vtlim commented Jul 12, 2017

magee256 commented Jul 12, 2017

magee256 Jul 12, 2017 •

edited

Loading