-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
in the output *.metrics.tsv file, many metrics are 0/-nan #37
Comments
Would it be possible to provide the entire metrics file? This can be very helpful for diagnosing the issue. If not, could you at least show the count of Edit: I sincerely apologize for the late reply |
Thanks for reply. Here are the contents of the entire metrics file: |
This seems to indicate that your data was a single-ended library. If that's the case, you need to run rnaseqc using the |
Actually, my data is from pair-end library. |
Then something really bad happened during alignment. Rna-seqc is reporting 60 million unpaired reads out of your library of about 70 million reads. However, it doesn't include alternative alignments in the count of unpaired reads, and I'd hazard a guess that those are unpaired too |
All the 8 samples I analyzed by RNASEQC show 60000000 unpaired reads, which is strange. I paste here total reads number. 1.metrics.tsv:Total Reads 70438247 |
That is certainly strange. Would it be possible to share some of the fastqs? Even just a subset of reads would help us understand what is going on. |
yes, I can share some fqs, how can I share with you? |
Depending on access control requirements, you could upload them here if the data is safe to be shared 100% publicly, or you could upload them to a cloud storage platform of your choice and grant read access to [email protected] and I will take a look |
I am having the same problem, I am using paired end data and getting below results. the only difference in my gtf/bam file is that I have chr prefix both in gtf & bam will that be a problem? |
No, in fact we require that the chromosome names in the gtf and bam match. Are you also observing a significant number of unpaired reads? |
Not really. I don't have very high number of unpaired reads. |
Hello, is there an update on this issue? I'm having the same problem, I'm using paired end reads and I also have a chr prefix in my gtf and bam. I'm getting the below results: Genes Detected 0 |
Hi, are you able to share example files that reproduce the issue? |
Thank you for your quick reply. No unfortunately I cannot share the data, I was hoping a resolution to one of the above issues would help me debug. Do you know of any possible causes that I can look into? |
If it helps, I was able to run an older version of rnaseqc. I attached some of the results here. |
These are the fields in my gtf, I see that they differ slightly when compared to the example. Is gene_status a required gtf field? chr1 HAVANA exon 13221 14403 . + . gene_id "ENSG00000223972.5_4"; transcript_id "ENSG00000223972.5_4"; gene_type "transcribed_unprocessed_pseudogene"; gene_name "DDX11L1"; transcript_type "transcribed_unprocessed_pseudogene"; transcript_name "DDX11L1"; level 2; hgnc_id "HGNC:37102"; havana_gene "OTTHUMG00000000961.2_4"; remap_status "full_contig"; remap_num_mappings 1; remap_target_status "overlap"; exon_id "ENSG00000223972.5_4_4; exon_number 4"; |
Hello hello, an update, I tried a couple of things but I am able to get gene counts when I use the legacy mode flag. |
Greetings, thank you for your assistance earlier. I was able to get rnaseqc to run and detect genes in legacy mode, however I believe I'm missing some metrics in my output, such as End 1 Bases/ End 2 Bases etc. They're reported as 0, but this seems improbable. Do you know what may be the cause of this? Mapping Rate 0.959287 |
Can you provide the command used to generate these results? |
I am having the same issues as above -- I believe that I can share data, though I'll need to discuss with collaborators. I would be interested to know if there has been any progress on the issue? Edit: I initially thought this had something to do with the annotation file I am using, though nothing sticks out to me yet. However, I ran into another issue with this data set earlier that was caused by a bug in version 2.5.1a of STAR, which is the version that was used to align this data. I haven't yet run rnaseqc on the data after re-aligning it with a recent version of STAR -- I wonder if others were a) using STAR and b) if it was an older version? |
Hi @agraubert and @francois-a, I had the same issue with STAR-aligned Illumina Stranded RNA-seq BAM, and I realized it was because I did not change the default As "lower bound for exon coverage counting" by default is set to 255, nothing appears to be "high quality" since MAPQ is defined to be [0, 2^8-1] in SAM spec. Do you think it makes sense to lower this default to something more reasonable? |
I think that 255 is still a sensible default. STAR uses 255 for uniquely mapped reads and GMAP defaults to 255 unless it has reason to lower the quality. In my experience, a bam without any 255 quality reads is reason for concern. You can use the |
Thanks @agraubert for clarifying !! I don't have access to the original STAR command but I suspect it used |
No problem! And thanks for making the mapq observation in the first place. I suspect that might be the same source of the problem for most of the people in this thread |
Hello, I keep getting 0/Na for End 1 End 2 mismatch and mapping rates columns, after reading this forum I suspect it has to do with STAR. Would --alignEndsType EndToEnd be a problem? Thanks in advance -------- parameters used Sample *.starAligned.sortedByCoord.out.bam |
I am using RNASeQC 2.3.5 to evaluate some RNA-seq data. However, in the output *.metrics.tsv file, many metrics are 0/-nan. I paste some lines in my outputs files here. Here is the command I use:
rnaseqc.v2.3.5 gencode.v33.primary_assembly.genes.gtf *bam outdir
I have tried several samples and all appear this problem. Could you please help to solve the problem?
Genes Detected 0
Estimated Library Complexity 0
Genes used in 3' bias 0
Mean 3' bias 0
Median 3' bias 0
3' bias Std 0
3' bias MAD_Std 0
3' Bias, 25th Percentile 0
3' Bias, 75th Percentile 0
Median of Avg Transcript Coverage 0
Median of Transcript Coverage Std 0
Median of Transcript Coverage CV 0
Median Exon CV 0
Exon CV MAD 0
The text was updated successfully, but these errors were encountered: