Commit Graph

10 Commits

Author SHA1 Message Date
ffb4de7642 命令如下
docker run -dit --name gpt-120b \
-e NVIDIA_VISIBLE_DEVICES=0,1  \
--shm-size 64g \
--network=host -v ./:/data/models \
--restart=always \
lmsysorg/sglang:dev  python3 -m sglang.launch_server --model /data/models --host 0.0.0.0 --port 40002 --trust-remote --tp 2 --served-model-name gpt-120b --tool-call-parser gpt-oss
自己替换下模型ID,针对模型部署和评测详细感兴趣的可以微信搜索 AI云枢 有详细讲解和其他模型的部署评测教程
2025-08-15 01:14:46 +00:00
6d460b1f00 Configure <|call|> as an EOS token (#109)
- Configure `<|call|>` as an EOS token (a44a7f14988426860a0a8145cf2bdfda2d7cadac)


Co-authored-by: Pedro Cuenca <pcuenq@users.noreply.huggingface.co>
2025-08-14 11:47:38 +00:00
110e39cb08 Update README.md
Clarify MXFP4 quantization callout
2025-08-13 13:12:36 +00:00
00330d7e17 Update chat_template.jinja (#94)
- Update chat_template.jinja (6965815ae896176f6b820fd508d15ebbca89a903)

Co-authored-by: Quentin Gallouédec <qgallouedec@users.noreply.huggingface.co>
2025-08-09 08:16:19 +08:00
c5fe96c09a Updated chat template (#82)
- Updated chat template (03fd454aae9be5799e9531726db7f6d0673675cb)

Co-authored-by: Matthew Carrigan <Rocketknight1@users.noreply.huggingface.co>
2025-08-08 06:14:10 +08:00
d62b81784f Update the model name in USAGE_POLICY (#52)
- Update USAGE_POLICY (ccef4ca7b48a5797b0b436e77f5cd0c643449942)

Co-authored-by: B. <Enes@users.noreply.huggingface.co>
2025-08-07 06:46:22 +08:00
bfbcca03ca Upload folder using ModelScope SDK 2025-08-05 18:38:51 +00:00
4a757648f3 Upload folder using ModelScope SDK 2025-08-05 18:38:50 +00:00
0422a104e2 System update meta information 2025-08-05 18:30:22 +00:00
b435140cbb System init .gitattributes 2025-08-05 18:30:21 +00:00