Lists (2)
Sort Name ascending (A-Z)
Stars
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
中文nlp解决方案(大模型、数据、模型、训练、推理)
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
[IEEE TIP] TOPIC: A Parallel Association Paradigm for Multi-Object Tracking under Complex Motions and Diverse Scenes
Official PyTorch implementation of SparseTrack
[AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video ta…
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Multi-label-image-classification with Multi-method CLIP
General Multi-label Image Classification with Transformers
Code & Models for Temporal Segment Networks (TSN) in ECCV 2016
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
[ICLR2022] official implementation of UniFormer
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
A deep learning library for video understanding research.
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Flax is a neural network library for JAX that is designed for flexibility.
深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
🚀 阶跃星辰跃问YueWen Step 多模态大模型逆向API【特长:超强多模态】,支持高速流式输出、联网搜索、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹,仅供测试,如需商用请前往官方开放平台。
a state-of-the-art-level open visual language model | 多模态预训练模型
yolov5 + csl_label.(Oriented Object Detection)(Rotation Detection)(Rotated BBox)基于yolov5的旋转目标检测