Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Config files #77

Open
Jean-Baptiste-Camps opened this issue Dec 10, 2024 · 0 comments
Open

Config files #77

Jean-Baptiste-Camps opened this issue Dec 10, 2024 · 0 comments

Comments

@Jean-Baptiste-Camps
Copy link
Member

Implement the idea of having config files, giving all the options for corpus creation, which could facilitate having several sets of features. It could be some sort of yaml, like:

sampling:
  sample_size: 1000
  sample_random: True
  max_samples: 1000
features:
  feature:
    type: words
    n: 1
    feat_list: functionwords.txt
  feature:
    type: affixes
    k: 500
  feature:
    type: pos
    n: 3
    k: 500

etc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant