Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ngtiendong's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Vietnam
  • 03:56 (UTC +07:00)
  • Codestin Search App in/ntdong

Block or report ngtiendong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

AI Prediction api of the MusicLang package

Python 291 18 Updated Mar 25, 2024

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

Python 7,020 638 Updated Oct 21, 2025

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 1,799 202 Updated Oct 5, 2025
Python 831 44 Updated Sep 15, 2025

An open-source RAG-based tool for chatting with your documents.

Python 24,590 2,021 Updated Jul 4, 2025

🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data

TypeScript 65,539 5,163 Updated Oct 30, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,210 657 Updated Oct 29, 2025
Python 1,116 133 Updated Oct 27, 2025

META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Execute ⚡

Python 263 58 Updated Sep 18, 2025

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting…

455 32 Updated Sep 28, 2022

Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Python 332 13 Updated Aug 15, 2025

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching

Python 82 11 Updated Oct 9, 2025

Production-ready platform for agentic workflow development.

TypeScript 117,681 18,180 Updated Oct 30, 2025

A curated list of awesome LLM agents frameworks.

Python 1,145 116 Updated Oct 26, 2025

The Open Source Code of UniAudio

Python 579 38 Updated Jul 22, 2024

A curated compilation of AI-driven generative music resources and projects. Explore the blend of machine learning algorithms and musical creativity.

398 26 Updated Nov 3, 2023

A curated list of resources dedicated to the safety of Large Vision-Language Models. This repository aligns with our survey titled A Survey of Safety on Large Vision-Language Models: Attacks, Defen…

157 12 Updated Oct 8, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,960 378 Updated Oct 30, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,621 300 Updated Oct 20, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 18,262 2,118 Updated Sep 24, 2025
Jupyter Notebook 98 11 Updated Dec 23, 2024

This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding.

Python 28 3 Updated Nov 14, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,268 7,418 Updated Oct 30, 2025

Two conversational AI agents switching from English to sound-level protocol after confirming they are both AI agents

TypeScript 4,716 387 Updated Jul 28, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,576 1,069 Updated Oct 30, 2025

Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed

Jupyter Notebook 103 6 Updated Oct 25, 2024

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,771 933 Updated Oct 30, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,726 1,231 Updated Oct 27, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,161 1,660 Updated Sep 24, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,407 730 Updated Sep 22, 2025
Next