Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MayoRNAseq biospecimen file does not have unique specimenIDs for all specimens #18

Open
3 tasks
avanlinden opened this issue Dec 6, 2021 · 0 comments
Open
3 tasks
Labels
curation issue related to curation or cleaning of AD portal data

Comments

@avanlinden
Copy link
Collaborator

Several assays just use the individualID as the specimenID, which leads to duplicates. Each row does represent a unique sample (appending the assay name to the specimenID results in one unique specimenID per row), but the specimenIDs don't reflect that.

To bring this in line with our metadata standards this should really be fixed. It will involve:

  • assigning new unique specimenIDs to each row in the metadata
  • changing specimenID annotations on files to match the new unique IDs
  • making sure the unique specimenIDs are compatible with existing multispecimen files (these probably just use the individualID anyway)
@avanlinden avanlinden added the curation issue related to curation or cleaning of AD portal data label Dec 6, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
curation issue related to curation or cleaning of AD portal data
Projects
None yet
Development

No branches or pull requests

1 participant