Thanks to visit codestin.com
Credit goes to github.com

ConvAndConv

Follow

ConvAndConv

Follow

1 follower · 1 following

Lists (2)

Sort

🔮 Future ideas

project

准备的项目

Stars

datawhalechina / llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

Jupyter Notebook 3,960 541 Updated Aug 15, 2024

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,778 447 Updated Aug 5, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 23,164 2,691 Updated Dec 30, 2025

holmescao / TOPICTrack

[IEEE TIP] TOPIC: A Parallel Association Paradigm for Multi-Object Tracking under Complex Motions and Diverse Scenes

Python 458 47 Updated Mar 15, 2025

vukasin-stanojevic / BoostTrack

Python 216 31 Updated Jun 3, 2025

hustvl / SparseTrack

Official PyTorch implementation of SparseTrack

Python 163 14 Updated Mar 6, 2025

NJU-PCALab / STTrack

[AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking

Python 116 4 Updated May 18, 2025

PaddlePaddle / PaddleVideo

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video ta…

Python 1,679 387 Updated Feb 12, 2025

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 70,902 9,837 Updated Feb 16, 2026

CV-Magician / MMM-CLIP

Multi-label-image-classification with Multi-method CLIP

Python 26 2 Updated Apr 11, 2024

QData / C-Tran

General Multi-label Image Classification with Transformers

Python 281 44 Updated Nov 2, 2024

TommyZihao / vlm_arm

机械臂+大模型+多模态=人机协作具身智能体

Jupyter Notebook 1,108 190 Updated Jun 23, 2025

yjxiong / temporal-segment-networks

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

Python 1,577 474 Updated Oct 27, 2020

alibaba-mmai-research / TAdaConv

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

Python 241 33 Updated Aug 23, 2023

mit-han-lab / temporal-shift-module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Python 2,181 423 Updated Jul 11, 2024

OpenGVLab / UniFormerV2

[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer

Python 339 20 Updated Apr 2, 2024

Sense-X / UniFormer

[ICLR2022] official implementation of UniFormer

Python 896 115 Updated Mar 29, 2024

HHTseng / video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

Jupyter Notebook 971 219 Updated Dec 7, 2020

facebookresearch / pytorchvideo

A deep learning library for video understanding research.

Python 3,543 432 Updated Jan 12, 2026

facebookresearch / SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 7,288 1,293 Updated Feb 18, 2026

jax-ml / jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 34,900 3,425 Updated Feb 19, 2026

google / flax

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 7,077 788 Updated Feb 17, 2026

PaddlePaddle / awesome-DeepLearning

深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI

Jupyter Notebook 3,572 857 Updated Jul 25, 2024

ConvAndConv / AI-learning-Path

The experience of learning AI

1 Updated May 30, 2024

THU-MIG / yolov10

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 11,225 1,175 Updated Mar 14, 2025

dragon9001 / MMG-page

home page of Multi-Modal Learning.

CSS 15 3 Updated Sep 28, 2018

lonePatient / awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 5,516 511 Updated Feb 16, 2026

LLM-Red-Team / step-free-api

🚀 阶跃星辰跃问YueWen Step 多模态大模型逆向API【特长：超强多模态】，支持高速流式输出、联网搜索、长文档解读、图像解析、多轮对话，零配置部署，多路token支持，自动清理会话痕迹，仅供测试，如需商用请前往官方开放平台。

TypeScript 249 100 Updated Dec 16, 2024

zai-org / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,725 448 Updated May 29, 2024

hukaixuan19970627 / yolov5_obb

yolov5 + csl_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）基于yolov5的旋转目标检测

Python 1,936 429 Updated Oct 13, 2023