Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How do we help newcomers / overview of dataframe libraries #4

Open
maartenbreddels opened this issue Sep 2, 2019 · 1 comment
Open

Comments

@maartenbreddels
Copy link

https://pyviz.org gives an overview of the pyviz landscape, should we have something for the dataframes as well?
How would one choose one library over the other, what are the strong and weak points of each library.

@xhochy xhochy mentioned this issue Sep 3, 2019
@xhochy
Copy link
Member

xhochy commented Sep 3, 2019

There are several choices one can make:

  • Does data fit into RAM?
  • If not: memory-map to disk or distributed execution?
  • Lazy or eager execution?
  • Single vs multi-core
  • CPU or GPU or something else?
  • Inplace edits allowed or fully immutable?
  • Numeric data, geographic data, string data?

Different answers will lead to different libraries or add-ons. I guess we don't have a working implementation for all answers but for a surprisingly large combination.

This though only gets you to the point to chose which DataFrame library, not where/how to store your data which is more #5.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants