Stars
This is the official repo for the paper "LongCat-Flash-Omni Technical Report"
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
IMO-2025 SOLUTION FROM HUAWEI XIAOYI AI TEAM
This project partially embodies the state-of-the-art practices in speaker verification technology up until 2020, while attaining the state-of-the-art performance on the VoxCeleb1 test sets.
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
A PyTorch Implementation of End-to-End Models for Speech-to-Text
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.