You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Public data repositories such as GEO require md5 sums to be included in the metasheet containing the experimental details. This can be quite a painful process to carry out down the line but it's something that can be automated during the pipeline. I'd imagine that the md5 sums would be generated for the raw fastqs, processed bam, peaks and bigwig files. Would also have to parse the Picard insert size metrics file to get the insert mean and std dev. These could all be collected at the end of the pipeline and placed in a tsv file that could then be copy and pasted appropriately into the metasheet.
The text was updated successfully, but these errors were encountered:
Public data repositories such as GEO require md5 sums to be included in the metasheet containing the experimental details. This can be quite a painful process to carry out down the line but it's something that can be automated during the pipeline. I'd imagine that the md5 sums would be generated for the raw fastqs, processed bam, peaks and bigwig files. Would also have to parse the Picard insert size metrics file to get the insert mean and std dev. These could all be collected at the end of the pipeline and placed in a tsv file that could then be copy and pasted appropriately into the metasheet.
The text was updated successfully, but these errors were encountered: