diff --git a/README.md b/README.md index d59bf3a..d5d9715 100644 --- a/README.md +++ b/README.md @@ -28,7 +28,7 @@ library_name: transformers  | 🤖️ ModelScope  | - 📔 Technical Report + 📔 Technical Report  |
@@ -203,7 +203,7 @@ Example script ```py from transformers import AutoModel, AutoModelForCausalLM, AutoTokenizer -model_id = "XiaomiMiMo/MiMo-7B-Base" +model_id = "XiaomiMiMo/MiMo-7B-RL" model = AutoModelForCausalLM.from_pretrained(model_id, trust_remote_code=True) tokenizer = AutoTokenizer.from_pretrained(model_id) inputs = tokenizer(["Today is"], return_tensors='pt') @@ -221,16 +221,18 @@ print(tokenizer.decode(output.tolist()[0])) ## V. Citation ```bibtex -@misc{xiaomi2025mimo, - title={MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining}, +@misc{coreteam2025mimounlockingreasoningpotential, + title={MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining}, author={{Xiaomi LLM-Core Team}}, year={2025}, + eprint={2505.07608}, + archivePrefix={arXiv}, primaryClass={cs.CL}, - url={https://github.com/XiaomiMiMo/MiMo}, + url={https://arxiv.org/abs/2505.07608}, } ``` ## VI. Contact -Please contact us at [mimo@xiaomi.com](mailto:mimo@xiaomi.com) or open an issue if you have any questions. \ No newline at end of file +Please contact us at [mimo@xiaomi.com](mailto:mimo@xiaomi.com) or open an issue if you have any questions. diff --git a/config.json b/config.json index 6a1be49..b7ccfc0 100644 --- a/config.json +++ b/config.json @@ -2,6 +2,9 @@ "architectures": [ "MiMoForCausalLM" ], + "tokenizer_class":[ + "Qwen2Tokenizer" + ], "auto_map": { "AutoConfig": "configuration_mimo.MiMoConfig", "AutoModel": "modeling_mimo.MiMoModel",