- Hangzhou, China
-
23:42
(UTC +08:00) - https://vra.github.io/about
- https://www.zhihu.com/people/yunfeng-87
Highlights
Lists (31)
Sort Name ascending (A-Z)
action-recognition
Application-of-AI
audio
Awesome
Body
C++
CG
่ฎก็ฎๆบๅพๅฝขๅญฆ็ไธ่ฅฟComputer Vision
ไผ ็ป่ง่ง็ฎๆณ๏ผ่ทDLๆ ๅ ณ็CV็ฎๆณDataset
deep learning
Detection
ๆฃๆตไปปๅก็ธๅ ณ๏ผๅ ๆฌYOLO, ๆฃๆตๆกๆถ็ญdoc
e-book
Face
Face Detection, Face Alignment, Face 3DGAN
Hand
Large-Language-Models and AIGC
ๅคง่ฏญ่จๆจกๅ, AIGCmac
machine-learning
mcp
misc
nerf
Python
PythonๅบPytorch
Pytorch็ธๅ ณๅบRust
segmentation
shape3d
tts
VIM
wasm
Web
- All languages
- Assembly
- Awk
- Batchfile
- C
- C#
- C++
- CMake
- CSS
- Cuda
- Cython
- Dockerfile
- Elixir
- Emacs Lisp
- Go
- HTML
- Haskell
- JSON
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Lua
- MATLAB
- MDX
- Markdown
- Mojo
- Nim
- Objective-C
- PHP
- PureBasic
- Python
- R
- Roff
- Ruby
- Rust
- SCSS
- Sass
- Scala
- Shell
- Svelte
- Swift
- TeX
- Terra
- TypeScript
- Vim Script
- Vue
- Zig
Starred repositories
FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation
A command-line interface for running Supertonic TTS models using MNN.
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how tโฆ
Scaling Spatial Intelligence with Multimodal Foundation Models
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model
A task runner that works well with poetry or uv.
A lightweight, single-header C++11 Jinja2 template engine for LLM chat templates.
๐ Solve Rubik's Cube in 20 moves using Xiaomi AI Glasses. ็จๅฐ็ฑณ AI ็ผ้ๅจ 20 ๆญฅๅ ่ฟๅ้ญๆนใ
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
The official Soundwave repository
A framework for efficient model inference with omni-modality models
SkyRL: A Modular Full-stack RL Library for LLMs
๐ Token-Oriented Object Notation (TOON) โ Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
The definitive Web UI for local AI, with powerful features and easy setup.
GraphQA: Natural Language Graph Analysis Framework - Ask questions about any graph in natural language
Awesome Literature Graph Learning Challenges
A dataset of complex questions on semi-structured Wikipedia tables
Code for the ICSC 2025 paper "Ontology-Guided, Hybrid Prompt Learning for Generalization in Knowledge Graph Question Answering"
[Paper][EMNLP 2025] SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
General technology for enabling AI capabilities w/ LLMs and MLLMs
[ACL 2024] TaxoLLaMA: WordNet-based Model for Solving Multiple Lexical Sematic Tasks
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B