Stars
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
A comprehensive tool for processing and analyzing video footage, producing detailed insights into gameplay and player performance enhancing game understanding and performance evaluation.
SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Learning Convolutional Neural Networks with Interactive Visualization.
We write your reusable computer vision tools. 💜
Alex Krizhevsky's original code from Google Code
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Doctor Dignity is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.
Official ECCV 2022 repository for SUPR: A Sparse Unified Part-Based Human Representation
Self-Supervised Learning of 3D Human Pose using Multi-view Geometry (CVPR2019)
Real-Time end-to-end 2D-to-3D Video Conversion, based on deep learning.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[CVPR2024, Highlight] Official code for DragDiffusion
BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability
yiyu / nanoGPT
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
HumanML3D: A large and diverse 3d human motion-language dataset.
PyTorch code and models for the DINOv2 self-supervised learning method.
Integrate cutting-edge LLM technology quickly and easily into your apps
Painter & SegGPT Series: Vision Foundation Models from BAAI
🐍 Geometric Computer Vision Library for Spatial AI
MediaPipe(Python版)を用いて手の姿勢推定を行い、検出したキーポイントを用いて、簡易なMLPでハンドサインとフィンガージェスチャーを認識するサンプルプログラムです。(Estimate hand pose using MediaPipe(Python version). This is a sample program that recognizes hand signs and…
A collaboration friendly studio for NeRFs
yiyu / omni3d
Forked from facebookresearch/omni3dCode release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"