Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DSL2: Genotyping on multiple snp sets in one run? #1077

Open
TCLamnidis opened this issue Jun 21, 2024 · 1 comment
Open

DSL2: Genotyping on multiple snp sets in one run? #1077

TCLamnidis opened this issue Jun 21, 2024 · 1 comment
Labels
DSL2 enhancement New feature or request
Milestone

Comments

@TCLamnidis
Copy link
Collaborator

It might be nice to be able to genotype on multiple SNP sets in a single run. I'm specifically thinking of pileupcaller here, not sure how it would apply to other genotypers, but:

Currently, the reference sheet takes one pileupcaller_{bed,snp} per reference. That means that if one wanted to genotype on two sets of positions, they would need to run the entire pipeline twice, or duplicate a row in the reference sheet just for that additional genotyping. Now, since the latter option will not fly with the ref-sheet validation, one would have to "fake" an entire new reference, thus duplicating all the processing, just for the extra genotypes.

Solution:
Maybe we can turn the pipleupcaller_bed/snp columns into a list column, e.g. multiple files separated by ;, that would then get split into separate channel elements with the same meta, and thus only duplicate the genotyping step?

@TCLamnidis TCLamnidis added enhancement New feature or request DSL2 labels Jun 21, 2024
@TCLamnidis TCLamnidis added this to the 3.0 Augsburg milestone Jun 21, 2024
@TCLamnidis
Copy link
Collaborator Author

Something like:

x="potato.bed;banana.bed;tomato.bed"

y=Channel.of(x)
  .flatMap{
      x -> 
      def y=x.split(';')
      y
  }
  .view()

potato.bed
banana.bed
tomato.bed

These can then be separately input into genotyping and produce their own genotypes, or get catted to produce one superset?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
DSL2 enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant