From f3e6023a327cdb95f819b950036c5f8edc241c90 Mon Sep 17 00:00:00 2001 From: Cherrytest Date: Tue, 14 Oct 2025 12:35:54 +0000 Subject: [PATCH] Update README.md --- README.md | 27 +++++++++++++++++++++++++-- 1 file changed, 25 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index e32a4b9..e003aac 100644 --- a/README.md +++ b/README.md @@ -58,10 +58,10 @@ This is the weight repository for Qwen3-VL-8B-Thinking. **Multimodal performance** -![](https://qianwen-res.oss-accelerate.aliyuncs.com/Qwen3-VL/table_thinking_vl_8b.jpg) +![](https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-VL/qwen3vl_4b_8b_vl_thinking.jpg) **Pure text performance** -![](https://qianwen-res.oss-accelerate.aliyuncs.com/Qwen3-VL/table_thinking_text_8b.jpg) +![](https://qianwen-res.oss-accelerate.aliyuncs.com/Qwen3-VL/qwen3vl_4b_8b_text_thinking.jpg) ## Quickstart @@ -128,6 +128,29 @@ output_text = processor.batch_decode( print(output_text) ``` +### Generation Hyperparameters +#### VL +```bash +export greedy='false' +export top_p=0.95 +export top_k=20 +export repetition_penalty=1.0 +export presence_penalty=0.0 +export temperature=1.0 +export out_seq_length=40960 +``` + +#### Text +```bash +export greedy='false' +export top_p=0.95 +export top_k=20 +export repetition_penalty=1.0 +export presence_penalty=1.5 +export temperature=1.0 +export out_seq_length=32768 (for aime, lcb, and gpqa, it is recommended to set to 81920) +``` + ## Citation