From 345f52a843dc984063db606e8d0efe48b8b62d5f Mon Sep 17 00:00:00 2001 From: ai-modelscope Date: Sat, 28 Jun 2025 00:16:28 +0800 Subject: [PATCH] Update README.md --- README.md | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/README.md b/README.md index 0856493..1390c38 100644 --- a/README.md +++ b/README.md @@ -97,6 +97,14 @@ Comparison with 30B-70B open-source models: | MMLongBench-DOC (Acc) | 42.1 | - | 38.8 | - | +Text results, comparison with 30B-level non-thinking VLMs: + +| Benchmark (Metric) | Kimi-VL-A3B-Thinking-2506 | Qwen2.5-VL-32B | Gemma3-27B-IT | +|----------------------------|---------------------------|---------------|---------------| +| MMLU | **82.0** | 78.4 | 76.9 | +| MMLU-Pro | 68.5 | **68.8** | 67.5 | +| MATH | **91.8** | 82.2 | 89.0 | +| GPQA-Diamond | 42.3 | **46.0** | **46.0** | ## 3. Usage