Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ZhengkunTian's full-sized avatar

Block or report ZhengkunTian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[TMLR 2025] Efficient Diffusion Models: A Survey

123 5 Updated Jun 11, 2025

MiMo-VL

574 27 Updated Aug 21, 2025

Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model

38 4 Updated Jul 23, 2025

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 38,443 8,749 Updated Oct 27, 2025

Multi-Character Story Generation with Dialogue Rendering

3 Updated Aug 4, 2025

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

951 53 Updated Aug 17, 2025

China Unicom's Yuanjing Wanwu Agent Platform is an enterprise-grade, multi-tenant AI agent development platform. It helps users build applications such as intelligent agents, workflows, and rag, an…

Go 2,003 37 Updated Oct 31, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 74,353 9,693 Updated Nov 1, 2025

😎 Awesome lists about all kinds of interesting topics

410,985 32,056 Updated Oct 28, 2025

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 7,146 1,056 Updated Aug 5, 2024

Official Repository of "LLM × DATA" Survey Paper

525 52 Updated Oct 28, 2025

PAFTS : Library That Preprocessing Audio For TTS.

Python 23 5 Updated Nov 15, 2024

A framework for building native applications using React

C++ 124,375 24,915 Updated Oct 31, 2025

A latent text-to-image diffusion model

Jupyter Notebook 71,719 10,516 Updated Jun 18, 2024

Latest Advances on System-2 Reasoning

Python 1,262 72 Updated Jun 8, 2025

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 4,285 461 Updated Oct 5, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,671 971 Updated Oct 30, 2025

A project for tri-modal LLM benchmarking and instruction tuning.

Python 48 7 Updated Mar 27, 2025

How to use wandb?

Python 681 55 Updated Sep 5, 2023

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 19,356 1,833 Updated Oct 28, 2025

AI for Science 论文解读合集(持续更新ing),论文/数据集/教程下载:hyper.ai

2,530 396 Updated Mar 22, 2025

A visuailzation tool to make deep understaning and easier debugging for RLHF training.

Python 261 9 Updated Feb 20, 2025

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

Python 336 49 Updated Oct 6, 2025

科技爱好者周刊,每周五发布

78,098 3,683 Updated Oct 31, 2025

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 959 35 Updated Oct 22, 2025

Collection of AWESOME vision-language models for vision tasks

2,985 221 Updated Oct 14, 2025

Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms

Python 4,801 567 Updated Oct 22, 2025

数字人资料整理

1,011 119 Updated Jan 8, 2025

RocketMQ企业级一站式服务平台

Java 2,224 283 Updated Oct 28, 2025

A collection of benchmarks and datasets for evaluating LLM.

522 30 Updated Jul 13, 2024
Next