fix: set processor_repo to vision weights repo in Kimi model cards#2054
Open
Drifter4242 wants to merge 2 commits into
Open
fix: set processor_repo to vision weights repo in Kimi model cards#2054Drifter4242 wants to merge 2 commits into
Drifter4242 wants to merge 2 commits into
Conversation
The processor_repo fields for Kimi K2.5 and K2.6 pointed to the full LLM repos (moonshotai/Kimi-K2.5, moonshotai/Kimi-K2.6), which are not downloaded locally (595+ GB). The image processor must be loaded from the vision weights repo, which IS downloaded alongside the vision weights. - mlx-community/Kimi-K2.5: processor_repo -> Drifter4242/Kimi-K2.5-vision (also fixes weights_repo: davehind/ -> Drifter4242/ after account transfer) - mlx-community/Kimi-K2.6-mlx-DQ3_K_M-q8: processor_repo -> exolabs/Kimi-K2.6-vision - moonshotai/Kimi-K2.6: processor_repo -> exolabs/Kimi-K2.6-vision Without this fix vision loading fails immediately with a FileNotFoundError when trying to load the tokenizer/processor from the non-existent local path for the full model.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Just some small changes to the vision model cards. Two things: if the model isn't loaded then specifying it in the processor_repo can cause it to fail (I must have had the models downloaded before so it wasn't a problem). Also I've renamed my hf depot (old links should still work though), so I updated the links.
The rest is written by Sonnet (reviewed by me):
The processor_repo fields for Kimi K2.5 and K2.6 pointed to the full LLM repos (moonshotai/Kimi-K2.5, moonshotai/Kimi-K2.6), which are not downloaded locally (595+ GB). The image processor must be loaded from the vision weights repo, which IS downloaded alongside the vision weights.
(also fixes weights_repo: davehind/ -> Drifter4242/ after account transfer)
Without this fix vision loading fails immediately with a FileNotFoundError when trying to load the tokenizer/processor from the non-existent local path for the full model.
Motivation
The
processor_repofields for three Kimi vision model cards pointed to thefull LLM repos (
moonshotai/Kimi-K2.5,moonshotai/Kimi-K2.6). These are500–660 GB models that are not downloaded locally. When exo loads the vision
pipeline, it calls
build_model_path(ModelId(processor_repo))to construct alocal filesystem path — if that path doesn't exist, loading fails.
The image processor must be loaded from the vision weights repo, which is
downloaded automatically alongside the vision weights and is always present.
A secondary fix:
mlx-community/Kimi-K2.5hadweights_repo = "davehind/Kimi-K2.5-vision",a now-stale HuggingFace account that was transferred to
Drifter4242.Changes
mlx-community/Kimi-K2.5weights_repodavehind/Kimi-K2.5-visionDrifter4242/Kimi-K2.5-visionmlx-community/Kimi-K2.5processor_repomoonshotai/Kimi-K2.5Drifter4242/Kimi-K2.5-visionmlx-community/Kimi-K2.6-mlx-DQ3_K_M-q8processor_repomoonshotai/Kimi-K2.6exolabs/Kimi-K2.6-visionmoonshotai/Kimi-K2.6processor_repomoonshotai/Kimi-K2.6exolabs/Kimi-K2.6-visionWhy It Works
vision.pyresolvesprocessor_repoviabuild_model_path(ModelId(processor_repo)),which maps a HuggingFace repo ID to a local cache path. The full LLM repos are
never fully downloaded by vision users, so their local paths do not exist.
The vision-specific repos (
Drifter4242/Kimi-K2.5-vision,exolabs/Kimi-K2.6-vision)are small (~1–2 GB) and are downloaded as part of the normal vision weight fetch,
so their local paths are always present by the time the processor is needed.
Test Plan
Manual Testing
Hardware: 2× Mac Studio M3 Ultra 512 GB, Thunderbolt 5 direct bridge,
MlxJacclRDMA tensor-parallel (moonshotai/Kimi-K2.6, 595 GB INT4, 61 layers).moonshotai/Kimi-K2.6+exolabs/Kimi-K2.6-visionweights.Automated Testing
No code changes — model card data files only. All existing tests pass:
pytest src -m "not slow" --import-mode=importlib