Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

explore TPOT package for more automated transform, feature and model selection #20

Open
realmarcin opened this issue Sep 25, 2020 · 0 comments

Comments

@realmarcin
Copy link
Collaborator

... this is for once we have a usable numeric matrix for analysis/ML.

The TPOT package is actually an amazing feat I think, wish they had made it sooner!
https://github.com/EpistasisLab/tpot

The notebook I committed:
https://github.com/realmarcin/biosample-analysis/blob/master/notebooks/first_analysis_notebook.ipynb

is a pipeline prototype going all the way from the original TSV (or some subset), to a numeric matrix passed to DecisionTree and viz. Lots more can be added, but this pipeline could already be plugged into TPOT.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant