feat: add onerec 3b performance optimization and support old model. by DragonFive · Pull Request #1286 · jd-opensource/xllm

DragonFive · 2026-04-15T09:00:36Z

Summary

align OneRec 3B NPU runtime with xllm_rec
keep legacy and 3B scaling paths gated in the NPU runtime
update xllm_atb_layers submodule to merged master commit 96d3deb2

Notes

branch keeps two commits intentionally: one for 3B runtime alignment, one for legacy/3B gating and submodule linkage
xllm_atb_layers changes are already merged into git_code master

gemini-code-assist

Code Review

This pull request enhances the NPU OneRec block layer by introducing ACLNN-based attention linear support, a decoder prefill-only execution mode, and improved handling for fused FFN and MoE expert weights. It also adds a compatibility path for specific checkpoint prefixes and updates the MoE attention mask logic. Review feedback highlights a critical memory safety issue where a local tensor is referenced via a dangling pointer in the variant pack. Furthermore, multiple style guide violations were noted, specifically regarding the use of auto for simple types, missing vector reserve() calls, and the need for parameter annotations on constant arguments.

DragonFive · 2026-04-16T04:02:53Z

/gemini review

gemini-code-assist

Code Review

This pull request updates the OneRec NPU implementation to support ACLNN-based attention linear operations, introduces a prefill-only execution path for decoders, and enhances weight loading and merging for MoE layers, including support for fused expert weights and dynamic quantization. Feedback highlights a critical thread-safety concern regarding the storage of request-specific data as member variables in the layer implementation, which could lead to data corruption in concurrent environments. Additionally, several style guide violations were noted, specifically the improper use of 'auto' for simple types and the use of plain 'int' instead of fixed-width integers.

…d-opensource#1286)

bugfix: align onerec 3b npu runtime with xllm_rec.

bdb079c

DragonFive requested review from DongheJin, JimHsiung, RobbieLeung, XuZhang99, liutongxuan, walsonyang and yq33victor as code owners April 15, 2026 09:00

bugfix: gate onerec legacy and 3b scaling paths.

b11c793

DragonFive force-pushed the feat/onerec-rec-upstreamize-range2 branch from dcbb93e to b11c793 Compare April 15, 2026 09:01

gemini-code-assist Bot reviewed Apr 15, 2026

View reviewed changes

DragonFive changed the title ~~bugfix: align onerec 3b and legacy npu runtime with xllm_rec.~~ feat: add onerec 3b performance optimization and support old model. Apr 15, 2026

bugfix: address onerec review feedback.

5594aa2

gemini-code-assist Bot reviewed Apr 16, 2026

View reviewed changes

bugfix: address remaining onerec review feedback.

fd31036

XuZhang99 approved these changes Apr 17, 2026

View reviewed changes

msmilezz approved these changes Apr 20, 2026

View reviewed changes

DragonFive merged commit 0f5c74f into jd-opensource:main Apr 20, 2026
24 of 31 checks passed

maojunx99 pushed a commit to maojunx99/xllm that referenced this pull request Apr 21, 2026

feat: add onerec 3b performance optimization and support old model. (j…

f2b67cc

…d-opensource#1286)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add onerec 3b performance optimization and support old model.#1286

feat: add onerec 3b performance optimization and support old model.#1286
DragonFive merged 4 commits intojd-opensource:mainfrom
DragonFive:feat/onerec-rec-upstreamize-range2

DragonFive commented Apr 15, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DragonFive commented Apr 16, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

DragonFive commented Apr 15, 2026

Summary

Notes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DragonFive commented Apr 16, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants