Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View ChenJian7578's full-sized avatar

Block or report ChenJian7578

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,460 338 Updated Feb 3, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,340 1,606 Updated Jan 30, 2026

【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models

Python 2,303 142 Updated Jul 15, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,229 1,816 Updated Feb 26, 2025

这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。

Python 750 100 Updated Feb 18, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 85,690 12,981 Updated Feb 19, 2026

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Python 1 Updated Jun 6, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,391 5,115 Updated Feb 18, 2026

cgan(条件对抗生成网络)

Python 2 Updated Aug 30, 2022

这是一个yolo3-pytorch的源码,可以用于训练自己的模型。

Python 2,113 579 Updated Jan 26, 2024

功能: 使用阿里云智能语音服务中的录音文件识别 API,实现将视频、音频文件转写出 srt 字幕

Python 132 28 Updated Feb 2, 2022