Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

normalize missing values #26

Open
cmungall opened this issue Sep 29, 2020 · 2 comments
Open

normalize missing values #26

cmungall opened this issue Sep 29, 2020 · 2 comments

Comments

@cmungall
Copy link
Collaborator

e.g. Not applicable, missing data, etc

@wdduncan
Copy link
Collaborator

Do we to replace with NaNs?

@cmungall
Copy link
Collaborator Author

No, this is pandas-specific. Let's start be looking at GSC guidelines:

https://gensc.org/mixs/

If a value is missing, please consider using the INSDC missing value vocabulary
Not applicable		information is inappropriate to report, can indicate that the standard itself fails to model or represent the information appropriately
Missing	Not collected	information of an expected format was not given because it has not been collected
Not provided	information of an expected format was not given, a value may be given at the later stage
Restricted access	information exists but can not be released openly because of privacy concerns

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants