-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(download): allow fetching model files with multiple partitions #3258
feat(download): allow fetching model files with multiple partitions #3258
Conversation
0d52e83
to
3ce0c48
Compare
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #3258 +/- ##
==========================================
- Coverage 63.05% 62.93% -0.12%
==========================================
Files 186 187 +1
Lines 20268 20506 +238
==========================================
+ Hits 12779 12906 +127
- Misses 7489 7600 +111
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
a2be452
to
d03e1bf
Compare
let model_path = self.get_model_path(name); | ||
// get_model_path returns the entrypoint of the model, | ||
// will look for the file with the prefix "00001-of-" | ||
pub fn get_model_entry_path(&self, name: &str) -> Option<PathBuf> { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
does this function have to be public? please also check newly added pub
function in this struct.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is also used in llama-cpp-server
https://github.com/TabbyML/tabby/pull/3258/files#diff-7e6c7f751bfa100a469d7b7e1b68d567287a88df6425e2c8155333a229ecce59R282
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Otherwise LGTM
this PR is only for reviewing the changes onto #3216, I have also raise a PR for #3216: umialpha#1
I have fork the registry and add Qwen2.5-Coder, which has a 3 files model, could be test by: