diff --git a/README.md b/README.md index e32a4b9..e003aac 100644 --- a/README.md +++ b/README.md @@ -58,10 +58,10 @@ This is the weight repository for Qwen3-VL-8B-Thinking. **Multimodal performance** -![](https://qianwen-res.oss-accelerate.aliyuncs.com/Qwen3-VL/table_thinking_vl_8b.jpg) +![](https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-VL/qwen3vl_4b_8b_vl_thinking.jpg) **Pure text performance** -![](https://qianwen-res.oss-accelerate.aliyuncs.com/Qwen3-VL/table_thinking_text_8b.jpg) +![](https://qianwen-res.oss-accelerate.aliyuncs.com/Qwen3-VL/qwen3vl_4b_8b_text_thinking.jpg) ## Quickstart @@ -128,6 +128,29 @@ output_text = processor.batch_decode( print(output_text) ``` +### Generation Hyperparameters +#### VL +```bash +export greedy='false' +export top_p=0.95 +export top_k=20 +export repetition_penalty=1.0 +export presence_penalty=0.0 +export temperature=1.0 +export out_seq_length=40960 +``` + +#### Text +```bash +export greedy='false' +export top_p=0.95 +export top_k=20 +export repetition_penalty=1.0 +export presence_penalty=1.5 +export temperature=1.0 +export out_seq_length=32768 (for aime, lcb, and gpqa, it is recommended to set to 81920) +``` + ## Citation