Skip to content

Pull requests: jd-opensource/xllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: support hetero mlu mooncake push pd.
#1304 opened Apr 17, 2026 by phantomlei3 Collaborator Loading…
bugfix: fix the accuracy error of NPU xattention.
#1303 opened Apr 17, 2026 by LMX-xin Collaborator Loading…
bugfix: clarify linear state cache allocation failure message.
#1302 opened Apr 17, 2026 by Kang-Meng Collaborator Loading…
perf: use fused gdn gating for qwen3.5 prefill.
#1301 opened Apr 16, 2026 by yingxudeng Collaborator Loading…
bugfix: add calculation of mm_embedding size in shm manager.
#1298 opened Apr 16, 2026 by shan-chen-feng Collaborator Loading…
feat: reuse pre-planned ExecCfg in sparse MoE prep_in.
#1297 opened Apr 16, 2026 by yq33victor Collaborator Loading…
perf: add llm decode metadata update fast path.
#1294 opened Apr 16, 2026 by RobbieLeung Collaborator Loading…
refactor: refactor the allocation of kvcache.
#1293 opened Apr 16, 2026 by XuZhang99 Collaborator Loading…
feat: add num_return_sequences support for rec beam search.
#1289 opened Apr 16, 2026 by DragonFive Collaborator Loading…
Glm5 cp benchmark data readme
#1287 opened Apr 15, 2026 by ltdo111 Contributor Loading…
feat: add onerec 3b performance optimization and support old model.
#1286 opened Apr 15, 2026 by DragonFive Collaborator Loading…
feat: support REC extended item info parsing and response output.
#1282 opened Apr 15, 2026 by DragonFive Collaborator Loading…
feat: support Risk-Aligned Cache under Classifier-Free Guidance.
#1273 opened Apr 14, 2026 by yiming-l21 Collaborator Loading…
feat: support /v1/audio/speech api request.
#1272 opened Apr 14, 2026 by wxh571001500 Contributor Loading…
bugfix: add ILU process group ctor for explicit local rank. (#1240)
#1271 opened Apr 13, 2026 by liutongxuan Collaborator Loading…
feat: add startup profile run for mlu llm engine.
#1266 opened Apr 13, 2026 by phantomlei3 Collaborator Loading…
bugfix: fix mtp prefix cache prefill starvation.
#1264 opened Apr 13, 2026 by phantomlei3 Collaborator Loading…
bugfix: unify device_index logging conversion with helper overloads
#1263 opened Apr 12, 2026 by kuma-loong Contributor Loading…
bugfix: fix DeepSeek V3 crash and V3.2 prefix-cache OOM.
#1258 opened Apr 10, 2026 by DongheJin Collaborator Loading…
bugfix: fix CFG negative prompt judgment & simplify variable names fo…
#1256 opened Apr 10, 2026 by yiming-l21 Collaborator Loading…
perf: Qwen Image Optimize.
#1242 opened Apr 9, 2026 by shan-chen-feng Collaborator Loading…
ProTip! Updated in the last three days: updated:>2026-04-14.