IndexTTS-2

ai-models/IndexTTS-2

Fork 0

mirror of https://www.modelscope.cn/IndexTeam/IndexTTS-2.git synced 2026-07-16 05:32:56 +08:00

Go to file

indextts 07b8cbc64d System delete file

2025-09-08 09:37:55 +00:00

qwen0.6bemo4-merge

Upload to IndexTeam/IndexTTS-2 on ModelScope hub (batch 1/1)

2025-09-07 15:01:41 +00:00

.gitattributes

fix attributes

2025-09-08 16:40:50 +08:00

bpe.model

Upload bpe.model to ModelScope hub

2025-09-07 14:48:45 +00:00

config.yaml

Upload config.yaml to ModelScope hub

2025-09-07 14:49:53 +00:00

configuration.json

System init configuration.json

2025-09-07 14:45:07 +00:00

feat1.pt

Upload feat1.pt to ModelScope hub

2025-09-07 14:50:25 +00:00

feat2.pt

Upload feat2.pt to ModelScope hub

2025-09-07 14:50:39 +00:00

gpt.pth

Upload gpt.pth to ModelScope hub

2025-09-07 14:57:30 +00:00

README.md

Update README.md

2025-09-08 08:47:32 +00:00

s2mel.pth

Upload s2mel.pth to ModelScope hub

2025-09-08 08:33:35 +00:00

wav2vec2bert_stats.pt

Upload wav2vec2bert_stats.pt to ModelScope hub

2025-09-07 14:48:28 +00:00

README.md

👉🏻 IndexTTS2 👈🏻

IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech

Contact

QQ Group：553460296(No.1) 1048202584(No.2) 764630270(No.3)
Discord：https://discord.gg/uT32E7KDmy
Emal：indexspeech@bilibili.com
欢迎大家来交流讨论！

📣 Updates

2025/09/08 🔥🔥🔥 We release the IndexTTS-2
- The first autoregressive TTS model with precise synthesis duration control: supporting both controllable and uncontrollable modes
- The model achieves highly expressive emotional speech synthesis, with emotion-controllable capabilities enabled through multiple input modalities.
2025/05/14 🔥🔥 We release the IndexTTS-1.5, Significantly improve the model's stability and its performance in the English language.
2025/03/25 🔥 We release IndexTTS-1.0 model parameters and inference code.
2025/02/12 🔥 We submitted our paper on arXiv, and released our demos and test sets.

Acknowledge

📚 Citation

🌟 If you find our work helpful, please leave us a star and cite our paper.

IndexTTS2

@article{zhou2025indextts2,
  title={IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech},
  author={Siyi Zhou, Yiquan Zhou, Yi He, Xun Zhou, Jinchao Wang, Wei Deng, Jingchen Shu},
  journal={arXiv preprint arXiv:2506.21619},
  year={2025}
}

IndexTTS

@article{deng2025indextts,
  title={IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System},
  author={Wei Deng, Siyi Zhou, Jingchen Shu, Jinchao Wang, Lu Wang},
  journal={arXiv preprint arXiv:2502.05512},
  year={2025},
  doi={10.48550/arXiv.2502.05512},
  url={https://arxiv.org/abs/2502.05512}
}

README.md Unescape Escape

👉🏻 IndexTTS2 👈🏻

IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech

Contact

📣 Updates

Acknowledge

📚 Citation

README.md