Integrate turbomind into sgl-kernel #2999

bjmsong · 2025-01-20T06:10:20Z

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling.

zhyncs · 2025-01-20T06:12:03Z

sgl-kernel/MANIFEST.in

@@ -0,0 +1,2 @@
+include src/sgl-kernel/turbomind/lib/_turbomind_ext.cpython-312-x86_64-linux-gnu.so


Why is the Python version specified here?

because pybind11 is used in building

We can’t hardcode the Python version

.gitmodules

zhyncs · 2025-01-20T06:13:03Z

sgl-kernel/src/sgl-kernel/turbomind/__init__.py

@@ -0,0 +1 @@
+import sgl_kernel.turbomind.lib._turbomind_ext as _turbomind_ext


Please run make format

sgl-kernel/src/sgl-kernel/__init__.py

… into turbomind_integrate

zhyncs · 2025-01-20T06:17:04Z

BTW Can we only use the C++ and CUDA code from turbomind's third-party library? We can handle the pybind11 interface in the sgl-kernel, eliminating the need for turbomind runtime requirements.

zhyncs · 2025-01-20T06:44:07Z

BTW we should ensure the PR Test (sgl-kernel) is successful.

… into turbomind_integrate

zhyncs · 2025-01-26T05:27:42Z

sgl-kernel/build.sh

@@ -22,6 +22,9 @@ docker run --rm \
    export SGL_KERNEL_ENABLE_SM90A=${ENABLE_SM90A} && \
    mkdir -p /usr/lib/x86_64-linux-gnu/ && \
    ln -s /usr/local/cuda-${CUDA_VERSION}/targets/x86_64-linux/lib/stubs/libcuda.so /usr/lib/x86_64-linux-gnu/libcuda.so && \
-    cd /sgl-kernel && \
+    cd /sgl-kernel/3rdparty/turbomind && \


We should handle the build process internally within setup.py, not this way.

zhyncs · 2025-01-26T05:27:50Z

sgl-kernel/setup.py

+extra_link_args = [
+    "-Wl,-rpath,$ORIGIN/../../torch/lib",
+    "-L/usr/lib/x86_64-linux-gnu",
+    f"{str(root)}/3rdparty/turbomind/build/lib/libgemm2.a",


same as above

… into turbomind_integrate

init

aba6ed9

bjmsong requested review from zhyncs, ispobock, HandH1998, BBuf, yizhang2077 and merrymercy as code owners January 20, 2025 06:10

Merge branch 'main' into turbomind_integrate

56fb7cf

zhyncs requested changes Jan 20, 2025

View reviewed changes

mdattack added 2 commits January 20, 2025 14:14

format

9cbc71d

Merge branch 'turbomind_integrate' of https://github.com/bjmsong/sglang…

38fea7b

… into turbomind_integrate

zhyncs self-assigned this Jan 20, 2025

add test

1348fb5

bjmsong requested a review from Ying1123 as a code owner January 20, 2025 09:43

mdattack and others added 14 commits January 21, 2025 17:32

reslove conflict

0aa82f2

init shell script

151fba3

resolve conflict

d1e055d

reslove conflict

b545e1e

minor

e2b4929

build with CUDAExtension

ea3ece8

minor

e6d8606

Merge branch 'main' into turbomind_integrate

5d95885

minor

3dbbb67

Merge branch 'turbomind_integrate' of https://github.com/bjmsong/sglang…

8f146a0

… into turbomind_integrate

change build.sh

d4e0678

minor

99713f1

reslove confilict

8e2e96f

Merge branch 'main' into turbomind_integrate

2be66a8

zhyncs requested changes Jan 26, 2025

View reviewed changes

mdattack added 2 commits January 26, 2025 19:17

build turbomind in sgl-kernel && bind using torch

0ea0cea

Merge branch 'turbomind_integrate' of https://github.com/bjmsong/sglang…

1e75372

… into turbomind_integrate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate turbomind into sgl-kernel #2999

Integrate turbomind into sgl-kernel #2999

bjmsong commented Jan 20, 2025 •

edited

Loading

zhyncs Jan 20, 2025

bjmsong Jan 20, 2025

zhyncs Jan 20, 2025 •

edited

Loading

zhyncs Jan 20, 2025

zhyncs commented Jan 20, 2025

zhyncs commented Jan 20, 2025

zhyncs Jan 26, 2025

zhyncs Jan 26, 2025

		@@ -0,0 +1,2 @@
		include src/sgl-kernel/turbomind/lib/_turbomind_ext.cpython-312-x86_64-linux-gnu.so

		@@ -0,0 +1 @@
		import sgl_kernel.turbomind.lib._turbomind_ext as _turbomind_ext

Integrate turbomind into sgl-kernel #2999

Are you sure you want to change the base?

Integrate turbomind into sgl-kernel #2999

Conversation

bjmsong commented Jan 20, 2025 • edited Loading

Checklist

zhyncs Jan 20, 2025

Choose a reason for hiding this comment

bjmsong Jan 20, 2025

Choose a reason for hiding this comment

zhyncs Jan 20, 2025 • edited Loading

Choose a reason for hiding this comment

zhyncs Jan 20, 2025

Choose a reason for hiding this comment

zhyncs commented Jan 20, 2025

zhyncs commented Jan 20, 2025

zhyncs Jan 26, 2025

Choose a reason for hiding this comment

zhyncs Jan 26, 2025

Choose a reason for hiding this comment

bjmsong commented Jan 20, 2025 •

edited

Loading

zhyncs Jan 20, 2025 •

edited

Loading