bugfix: fix the accuracy error of NPU xattention. by LMX-xin · Pull Request #1303 · jd-opensource/xllm

LMX-xin · 2026-04-17T07:31:39Z

No description provided.

gemini-code-assist

Code Review

This pull request updates the xllm_ops subproject and refactors the NPU beam search implementation in rec_worker_impl.cpp to manually initialize beam tensors during the first round. A logic error was identified where out_token_index and out_beam_count_prefix_sums are incorrectly zeroed, which breaks functionality for batch sizes greater than one. The reviewer provided a code block to correctly calculate base indices for proper KV cache selection and output attribution.

LMX-xin requested review from DongheJin, JimHsiung, RobbieLeung, XuZhang99, liutongxuan, walsonyang and yq33victor as code owners April 17, 2026 07:31

LMX-xin requested a review from DragonFive April 17, 2026 07:31

XuZhang99 previously approved these changes Apr 17, 2026

View reviewed changes

gemini-code-assist Bot reviewed Apr 17, 2026

View reviewed changes

Comment thread xllm/core/runtime/rec_worker_impl.cpp

LMX-xin dismissed XuZhang99’s stale review via 4e5a6de April 17, 2026 07:58

LMX-xin force-pushed the feat/xllm_npu_xattention branch from 9e444ef to 4e5a6de Compare April 17, 2026 07:58

bugfix: fix the accuracy error of NPU xattention.

512ef71

LMX-xin force-pushed the feat/xllm_npu_xattention branch from 4e5a6de to 512ef71 Compare April 20, 2026 03:28

DragonFive approved these changes Apr 20, 2026

View reviewed changes

walsonyang approved these changes Apr 20, 2026

View reviewed changes

LMX-xin merged commit 724c8ed into jd-opensource:main Apr 20, 2026
15 of 29 checks passed

maojunx99 pushed a commit to maojunx99/xllm that referenced this pull request Apr 21, 2026

bugfix: fix the accuracy error of NPU xattention. (jd-opensource#1303)

7ca5fc2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bugfix: fix the accuracy error of NPU xattention.#1303

bugfix: fix the accuracy error of NPU xattention.#1303
LMX-xin merged 1 commit intojd-opensource:mainfrom
LMX-xin:feat/xllm_npu_xattention

LMX-xin commented Apr 17, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

LMX-xin commented Apr 17, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants