Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Git Proxy Plugin: Detection of AI/ML usage (incl. weights, models etc.) #788

Open
shabbirflow opened this issue Nov 12, 2024 · 1 comment

Comments

@shabbirflow
Copy link

shabbirflow commented Nov 12, 2024

ABOUT

This plugin aims to detect AI/ML usage in codebase based on checks such as :

  • Identifying files with extensions commonly associated with model weights files like .h5, .pb, .pt, etc.
  • File extensions for large datasets like .csv, .xlsx
  • Scans for code that uses or requires AI/ML libraries such as Tensorflow, Pytorch, Keras, etc.
  • Files containing configuration keys & information ( such as epochs, learning_rate ) for artificial intelligence models configuration in JSON / YAML files.
  • Files containing common AI/ML functions such as tokenize, train_model, predict, evaluate, transform.

A user can customize the detection criteria by specifying which parameters to check based on requirements.

@shabbirflow
Copy link
Author

Hey @JamieSlome, I am working on this and in progress of coming up with an implementation for the checks mentioned above.
Please let me know if you’d like any adjustments to this approach or any additional checks you’d recommend including.

@shabbirflow shabbirflow changed the title Git Proxy Plugin: Detect the usage of AI/ML usage (incl. weights, models etc.) Git Proxy Plugin: Detect the usage of AI/ML (incl. weights, models etc.) Nov 12, 2024
@shabbirflow shabbirflow changed the title Git Proxy Plugin: Detect the usage of AI/ML (incl. weights, models etc.) Git Proxy Plugin: Detection of AI/ML usage (incl. weights, models etc.) Nov 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant