mamba2: remove hardcoded 2x expansion factor and invalid d_inner % d_state check by limloop · Pull Request #23082 · ggml-org/llama.cpp

limloop · 2026-05-15T01:48:28Z

This PR removes two unnecessary restrictions in Mamba2 that prevent loading models with custom architectures.

Changes:

Remove hardcoded 2x expansion factor (GGML_ASSERT(2 * n_embd == d_inner))
- The assert assumed all Mamba2 models have expand=2
- expand is not stored in GGUF, only d_inner is
- Removing it allows models with any expansion factor (1, 2, 3, etc.)
Remove invalid d_inner % d_state check
- In Mamba2, d_inner and d_state are unrelated parameters
- This assert has no architectural justification for Mamba2

Testing:

✅ Default Mamba2 (expand=2) — loads and runs correctly
✅ Custom model (expand=1, d_inner=512, d_model=512) — loads and generates coherent output

Backward compatibility: Models with expand=2 work identically to before.

Related discussion: #21346

ggml-gh-bot · 2026-05-15T01:52:35Z

Hi @limloop, thanks for your contribution!

Per our contribution guidelines, the automated PR checker found the following issue(s) that need your attention:

AI-generated content: This project does not accept PRs, descriptions or commit messages that are fully or predominantly AI-generated. If you have used AI to assist you in writing code, please make sure to disclose that explicitly.

Please note that maintainers reserve the right to make final decisions on PRs. If you believe there is a mistake, please comment below.

CISC

Do you have links to models with differing expand?

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

limloop · 2026-05-15T16:21:32Z

@CISC, here's a model with non-standard expand:

limloop/whiff-mamba2-50M-v0.1
https://huggingface.co/limloop/whiff-mamba2-50M-v0.1

Config values:

expand = 1
d_model = 512
d_inner = 512 (computed as expand * d_model)

With current llama.cpp (the hardcoded 2 * n_embd == d_inner assert), this model fails to load.

With my changes (this PR), it loads and generates coherent text.

CISC · 2026-05-15T17:12:21Z

Thanks, rebase and adjust accordingly to refactor please (moved to conversion/mamba.py).

limloop · 2026-05-15T18:56:34Z

@CISC updated, ready for review

CISC

Sorry for the long delay, I had hoped @compilade would take a look.

limloop added 2 commits May 15, 2026 04:04

mamba2: remove hardcoded 2x expansion factor, support any expand value

6597bcb

mamba2: remove invalid d_inner %% d_state check (unrelated parameters)

59b0a73

limloop requested a review from CISC as a code owner May 15, 2026 01:48

github-actions Bot added model Model specific python python script changes labels May 15, 2026

CISC reviewed May 15, 2026

View reviewed changes

Comment thread convert_hf_to_gguf.py Outdated

CISC requested a review from compilade May 15, 2026 12:03

Update convert_hf_to_gguf.py: make expand optional with default 2

4c49edb

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

limloop added 2 commits May 15, 2026 21:39

Merge branch 'master' into fix/mamba2-any-expand

f799381

mamba2: apply expand fix to refactored conversion/mamba.py

44c5f63

Merge branch 'ggml-org:master' into fix/mamba2-any-expand

4827a29

github-actions Bot added the conversion label Jun 25, 2026

also check for mamba_expand

ea081b5

CISC approved these changes Jun 25, 2026

View reviewed changes

CISC added the merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. label Jun 25, 2026

ggerganov merged commit 960d628 into ggml-org:master Jun 26, 2026
21 of 27 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mamba2: remove hardcoded 2x expansion factor and invalid d_inner % d_state check#23082

mamba2: remove hardcoded 2x expansion factor and invalid d_inner % d_state check#23082
ggerganov merged 7 commits into
ggml-org:masterfrom
limloop:fix/mamba2-any-expand

limloop commented May 15, 2026

Uh oh!

ggml-gh-bot Bot commented May 15, 2026

Uh oh!

CISC left a comment

Uh oh!

Uh oh!

limloop commented May 15, 2026

Uh oh!

CISC commented May 15, 2026

Uh oh!

limloop commented May 15, 2026

Uh oh!

CISC left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

limloop commented May 15, 2026

Uh oh!

ggml-gh-bot Bot commented May 15, 2026

Uh oh!

CISC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

limloop commented May 15, 2026

Uh oh!

CISC commented May 15, 2026

Uh oh!

limloop commented May 15, 2026

Uh oh!

CISC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants