Ph.D., NLP researcher, data scientist, and entrepreneur.
Author of prof. Torchenstein's Pytorch course.
Stars
Conv AI
6 repositories
A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech models. Supports OpenAI, Groq, Elevanlabs, CartesiaAI, and Deepg…
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
A flexible and efficient training framework for large-scale alignment tasks
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
A powerful framework for building realtime voice AI agents 🤖🎙️📹