-
ICT, CAS
- Beijing, China
-
20:19
(UTC +08:00) - https://ycmin95.github.io/
- https://scholar.google.com/citations?user=qc2906sAAAAJ&hl=zh-CN
Stars
Complete Claude Code configuration collection - agents, skills, hooks, commands, rules, MCPs. Battle-tested configs from an Anthropic hackathon winner.
☑️ A simple and extensible shell script for managing your todo.txt file.
An all-in-one enhancement suite for Google Gemini - timeline navigation, folder management, prompt library, and chat export in one powerful extension.
🚀 An awesome list of curated Nano Banana pro prompts and examples. Your go-to resource for mastering prompt engineering and exploring the creative potential of the Nano banana pro(Nano banana 2) AI…
V-Attack: Targeting Disentangled Value Features for Controllable Adversarial Attacks on LVLMs
[WACV'25 Oral] Precise Integral in NeRFs: Overcoming the Approximation Errors of Numerical Quadrature
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs
Enjoy the magic of Diffusion models!
PreLAR: World Model Pre-training with Learnable Action Representation, ECCV 2024
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers
Jodi: Unification of Visual Generation and Understanding via Joint Modeling
Breaking Boundary Between Pre-training and Fine-tuning with Hybrid Prompting for Knowledge-Based VQA
Codes for the WACV 2023 paper: "Semantic Guided Latent Parts Embedding for Few-Shot Learning"
official codes for our WACV 2024 paper (Interpretable Object Recognition by Semantic Prototype Analysis)
This repository contains the reference source code for the paper ["Scalable Modular Network: A Framework for Adaptive Learning via Agreement Routing"](https://openreview.net/forum?id=pEKJl5sflp) in…
This repository is the official implementation of the paper "Understanding Few-Shot Learning: Measuring Task Relatedness and Adaptation Difficulty via Attributes" in Neural Information Processing S…
Famous Vision Language Models and Their Architectures
[TPAMI 2025] Dynamic Attention Analysis for Backdoor Detection in Text-to-Image Diffusion Models
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation