Gemma4 31b rope fix and ci by Gasoonjia · Pull Request #19627 · pytorch/executorch

Gasoonjia · 2026-05-18T06:47:18Z

Summary

Currently materialize_runtime_buffers in model.py was zeroing out ALL meta buffers, including each layer's inv_freq (RoPE frequencies). The follow-up attn.inv_freq.to(device) was a no-op on already-zero tensors. So RoPE produced cos=1, sin=0 for every position → model had NO positional information → introduce the period-N echo cycle pattern.

This PR fix the issue by recomputing inv_freq per-layer with real values (using the layer's head_dim, partial_rotary, rope_theta, is_sliding flag) in materialize_runtime_buffers.

Test plan

Add e2e ci for gemma4-31b model and check its output.

pytorch-bot · 2026-05-18T06:47:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19627

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 2 Active SEVs

There are 2 currently active SEVs. If your PR is affected, please view them below:

❌ 1 New Failure, 3 Unclassified Failures

As of commit 86ba97b with merge base 54f1f28 ():

NEW FAILURE - The following job has failed:

trunk / test-models-macos-coreml (resnet50) / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1

UNCLASSIFIED FAILURES - DrCI could not classify the following jobs because the workflow did not run on the merge base. The failures may be pre-existing on trunk or introduced by this PR:

Build Windows Wheels / pytorch/executorch / build-wheel-py3_10-cpu (gh) (this job did not run on the merge base, so DrCI cannot tell whether the failure is pre-existing)
Process completed with exit code 1.
Build Windows Wheels / pytorch/executorch / upload / upload-wheel-py3_10-cpu (gh) (this job did not run on the merge base, so DrCI cannot tell whether the failure is pre-existing)
Unable to download artifact(s): Artifact not found for name: pytorch_executorch__3.10_cpu_x64
Test CUDA Builds / export-model-cuda-artifact (SocialLocalMobile, gemma-4-31B-it-HQQ-INT4, quantized-int4-tile-packed) / linux-job (gh) (this job did not run on the merge base, so DrCI cannot tell whether the failure is pre-existing)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…mma4_31b CI - model.py: strip explanatory comment from materialize_runtime_buffers RoPE inv_freq block (keep hand-rolled formula as-is). - inference.py: revert all hf_validator + quant_compile_validator additions (--use-hf-api / --compare / --compare-quant / --prompts-file flags and their helpers); keep --bf16 HF checkpoint load path and existing prequantized / gguf flows. - .github/workflows/cuda.yml: add SocialLocalMobile/gemma-4-31B-it-HQQ-INT4 matrix entry (prequant tile-packed only) to export-model-cuda-artifact and test-model-cuda-e2e; pin to linux.aws.a100 like qwen3_5_moe. - .ci/scripts/export_model_artifact.sh: add gemma4_31b export branch mirroring qwen3_5_moe pattern. - .ci/scripts/test_model_e2e.sh: add gemma4_31b runner args + tokenizer handling.

Gasoonjia requested review from GregoryComer and lucylq as code owners May 18, 2026 06:47

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 18, 2026

Gasoonjia and others added 2 commits May 17, 2026 23:55

lint

7cffb3d

Gasoonjia force-pushed the gemma4-31b-rope-fix-and-ci branch from d0214b5 to 7cffb3d Compare May 18, 2026 06:57

lint

86ba97b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemma4 31b rope fix and ci#19627

Gemma4 31b rope fix and ci#19627
Gasoonjia wants to merge 3 commits into
gemma4-chat-templatefrom
gemma4-31b-rope-fix-and-ci

Gasoonjia commented May 18, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented May 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Gasoonjia commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot Bot commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19627

❗ 2 Active SEVs

❌ 1 New Failure, 3 Unclassified Failures

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Gasoonjia commented May 18, 2026 •

edited

Loading

pytorch-bot Bot commented May 18, 2026 •

edited

Loading