Change the repository type filter
All
Repositories list
12 repositories
Fun-ASR
PublicFun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.SenseVoice
PublicMultilingual Voice Understanding ModelFunResearch
PublicThis repository is maintained by the Speech Team at Alibaba’s Tongyi Lab, serving as an open-source platform for our cutting-edge research in speech, audio, NLP technologies. We believe in accelerating scientific progress through transparent collaboration, and invite the global research community to explore, reproduce, and build upon our work.ThinkSound
Public[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.OmniAudio
PublicFunMusic
PublicA fundamental toolkit designed for music, song, and audio generationFunAudioLLM-APP
Public