-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[opt] replace extend_prefix_lens with extend_prefix_lens_cpu to avoid cudaStreamSync.
#7563
opened Jun 26, 2025 by
zzt93
Loading…
2 of 6 tasks
Fix incomplete tool call capture issue in streaming response of DeepSeek-V3 when enable MTP
#7562
opened Jun 26, 2025 by
xianzhiT
Loading…
1 of 6 tasks
[CI] Add CI Testing for Prefill-Decode Disaggregation with Router
#7540
opened Jun 25, 2025 by
key4ng
Loading…
1 of 6 tasks
[AMD] Add unit-test-sgl-kernel-amd to AMD CI
#7539
opened Jun 25, 2025 by
hubertlu-tw
Loading…
2 of 6 tasks
P/D load balancer forwards profiling requests to instances
#7525
opened Jun 25, 2025 by
gronsti-amd
Loading…
1 of 6 tasks
[CPU] add c++ kernel to bind CPU cores and memory node
#7524
opened Jun 25, 2025 by
chunyuan-w
Loading…
[CPU] remove process_group from inputs of shm_allreduce and shm_allgather
cpu
cpu backend performance optimization
intel
sgl-kernel
#7486
opened Jun 24, 2025 by
chunyuan-w
Loading…
[AMD] Remove vllm's scaled_fp8_quant and moe_sum when SGLANG_USE_AITER=1
high priority
#7484
opened Jun 23, 2025 by
hubertlu-tw
Loading…
3 of 6 tasks
[Feature] dynamic server payload size limit
#7475
opened Jun 23, 2025 by
khan-yin
Loading…
4 of 6 tasks
fix(bench_serving): handle None tokenizer.bos_token when apply_chat_template==True
#7466
opened Jun 23, 2025 by
renne444
Loading…
1 of 6 tasks
[BugFix] Destroy nccl Comm to fix cuda memory leak of destroy_model_parallel
#7465
opened Jun 23, 2025 by
wcsjtu
Loading…
2 of 6 tasks
Support non-contiguous query input for extend/decode attention
cpu
cpu backend performance optimization
intel
sgl-kernel
#7462
opened Jun 23, 2025 by
yanbing-j
Loading…
6 tasks
[benchmark] print final benchmark args in json format
#7455
opened Jun 23, 2025 by
staugust
Loading…
1 of 6 tasks
Fix for fp8 quantization failure of qwen 2.5 VL 7B model.
high priority
#7448
opened Jun 22, 2025 by
PanJason
Loading…
2 of 6 tasks
Support dynamic LoRA loading / unloading in engine/server API
ready-to-merge
The PR is ready to merge after the CI is green.
#7446
opened Jun 22, 2025 by
lifuhuang
Loading…
2 of 6 tasks
OPTForCasualLM Support (facebook/opt Series)
new-model
#7440
opened Jun 22, 2025 by
b8zhong
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.