mirror of
https://www.modelscope.cn/IndexTeam/IndexTTS-2.git
synced 2026-04-02 11:42:53 +08:00
07b8cbc64d216fbf44b9a9af264d29f51289f5e6
👉🏻 IndexTTS2 👈🏻
IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech
Contact
QQ Group:553460296(No.1) 1048202584(No.2) 764630270(No.3)
Discord:https://discord.gg/uT32E7KDmy
Emal:indexspeech@bilibili.com
欢迎大家来交流讨论!
📣 Updates
2025/09/08🔥🔥🔥 We release the IndexTTS-2- The first autoregressive TTS model with precise synthesis duration control: supporting both controllable and uncontrollable modes
- The model achieves highly expressive emotional speech synthesis, with emotion-controllable capabilities enabled through multiple input modalities.
2025/05/14🔥🔥 We release the IndexTTS-1.5, Significantly improve the model's stability and its performance in the English language.2025/03/25🔥 We release IndexTTS-1.0 model parameters and inference code.2025/02/12🔥 We submitted our paper on arXiv, and released our demos and test sets.
Acknowledge
📚 Citation
🌟 If you find our work helpful, please leave us a star and cite our paper.
IndexTTS2
@article{zhou2025indextts2,
title={IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech},
author={Siyi Zhou, Yiquan Zhou, Yi He, Xun Zhou, Jinchao Wang, Wei Deng, Jingchen Shu},
journal={arXiv preprint arXiv:2506.21619},
year={2025}
}
IndexTTS
@article{deng2025indextts,
title={IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System},
author={Wei Deng, Siyi Zhou, Jingchen Shu, Jinchao Wang, Lu Wang},
journal={arXiv preprint arXiv:2502.05512},
year={2025},
doi={10.48550/arXiv.2502.05512},
url={https://arxiv.org/abs/2502.05512}
}
Description
Languages
Jinja
100%