Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(download): allow fetching model files with multiple partitions #3258

Merged
merged 17 commits into from
Oct 21, 2024

Conversation

zwpaper
Copy link
Member

@zwpaper zwpaper commented Oct 10, 2024

this PR is only for reviewing the changes onto #3216, I have also raise a PR for #3216: umialpha#1

I have fork the registry and add Qwen2.5-Coder, which has a 3 files model, could be test by:

tabby serve --model zwpaper/Qwen2.5-Coder-7B-Instruct --device metal --chat-model zwpaper/Qwen2.5-Coder-7B-Instruct

@zwpaper zwpaper force-pushed the roylee/download-multiple-model-files branch from 0d52e83 to 3ce0c48 Compare October 11, 2024 16:31
Copy link

codecov bot commented Oct 15, 2024

Codecov Report

Attention: Patch coverage is 52.05479% with 105 lines in your changes missing coverage. Please review.

Project coverage is 62.93%. Comparing base (28ecaa8) to head (cd2b35e).
Report is 35 commits behind head on main.

Files with missing lines Patch % Lines
crates/tabby-download/src/lib.rs 42.14% 70 Missing ⚠️
crates/aim-downloader/src/bar.rs 40.81% 29 Missing ⚠️
crates/llama-cpp-server/src/lib.rs 0.00% 4 Missing ⚠️
crates/tabby-common/src/registry.rs 95.55% 2 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main    #3258      +/-   ##
==========================================
- Coverage   63.05%   62.93%   -0.12%     
==========================================
  Files         186      187       +1     
  Lines       20268    20506     +238     
==========================================
+ Hits        12779    12906     +127     
- Misses       7489     7600     +111     
Flag Coverage Δ
?

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@wsxiaoys wsxiaoys changed the title feat: download multiple model files feat(download): allow fetching model files with multiple partitions Oct 16, 2024
crates/tabby-common/src/registry.rs Outdated Show resolved Hide resolved
crates/tabby-common/src/registry.rs Outdated Show resolved Hide resolved
crates/tabby-common/src/registry.rs Outdated Show resolved Hide resolved
crates/tabby-download/Cargo.toml Show resolved Hide resolved
crates/tabby-download/src/lib.rs Show resolved Hide resolved
crates/tabby-download/src/lib.rs Outdated Show resolved Hide resolved
crates/aim-downloader/src/bar.rs Outdated Show resolved Hide resolved
crates/tabby-download/src/lib.rs Outdated Show resolved Hide resolved
@zwpaper zwpaper force-pushed the roylee/download-multiple-model-files branch from a2be452 to d03e1bf Compare October 18, 2024 16:26
let model_path = self.get_model_path(name);
// get_model_path returns the entrypoint of the model,
// will look for the file with the prefix "00001-of-"
pub fn get_model_entry_path(&self, name: &str) -> Option<PathBuf> {
Copy link
Member

@wsxiaoys wsxiaoys Oct 20, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

does this function have to be public? please also check newly added pub function in this struct.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Member

@wsxiaoys wsxiaoys left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Otherwise LGTM

@wsxiaoys wsxiaoys merged commit 779f785 into TabbyML:main Oct 21, 2024
3 of 5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants