-
Notifications
You must be signed in to change notification settings - Fork 189
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: support hetero mlu mooncake push pd.
#1304
opened Apr 17, 2026 by
phantomlei3
Collaborator
Loading…
bugfix: fix the accuracy error of NPU xattention.
#1303
opened Apr 17, 2026 by
LMX-xin
Collaborator
Loading…
bugfix: clarify linear state cache allocation failure message.
#1302
opened Apr 17, 2026 by
Kang-Meng
Collaborator
Loading…
perf: use fused gdn gating for qwen3.5 prefill.
#1301
opened Apr 16, 2026 by
yingxudeng
Collaborator
Loading…
bugfix: add calculation of mm_embedding size in shm manager.
#1298
opened Apr 16, 2026 by
shan-chen-feng
Collaborator
Loading…
feat: reuse pre-planned ExecCfg in sparse MoE prep_in.
#1297
opened Apr 16, 2026 by
yq33victor
Collaborator
Loading…
feat: add conv1d_fn op for Qwen3.5 linear attention on NPU.
#1295
opened Apr 16, 2026 by
maojunx99
Loading…
perf: add llm decode metadata update fast path.
#1294
opened Apr 16, 2026 by
RobbieLeung
Collaborator
Loading…
refactor: refactor the allocation of kvcache.
#1293
opened Apr 16, 2026 by
XuZhang99
Collaborator
Loading…
feat: update conv1d_update op for Qwen3-Next/Qwen3.5.
#1291
opened Apr 16, 2026 by
maojunx99
Loading…
feat: add num_return_sequences support for rec beam search.
#1289
opened Apr 16, 2026 by
DragonFive
Collaborator
Loading…
feat: add onerec 3b performance optimization and support old model.
#1286
opened Apr 15, 2026 by
DragonFive
Collaborator
Loading…
feat: support REC extended item info parsing and response output.
#1282
opened Apr 15, 2026 by
DragonFive
Collaborator
Loading…
feat: support Risk-Aligned Cache under Classifier-Free Guidance.
#1273
opened Apr 14, 2026 by
yiming-l21
Collaborator
Loading…
feat: support /v1/audio/speech api request.
#1272
opened Apr 14, 2026 by
wxh571001500
Contributor
Loading…
bugfix: add ILU process group ctor for explicit local rank. (#1240)
#1271
opened Apr 13, 2026 by
liutongxuan
Collaborator
Loading…
feat: add startup profile run for mlu llm engine.
#1266
opened Apr 13, 2026 by
phantomlei3
Collaborator
Loading…
bugfix: fix mtp prefix cache prefill starvation.
#1264
opened Apr 13, 2026 by
phantomlei3
Collaborator
Loading…
bugfix: unify device_index logging conversion with helper overloads
#1263
opened Apr 12, 2026 by
kuma-loong
Contributor
Loading…
bugfix: fix DeepSeek V3 crash and V3.2 prefix-cache OOM.
#1258
opened Apr 10, 2026 by
DongheJin
Collaborator
Loading…
bugfix: fix CFG negative prompt judgment & simplify variable names fo…
#1256
opened Apr 10, 2026 by
yiming-l21
Collaborator
Loading…
bugfix: remove spurious backslash breaking output redirection in launch scripts.
#1248
opened Apr 10, 2026 by
kuishou68
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-04-14.