Thanks to visit codestin.com
Credit goes to github.com

zengwang430521

Follow

zengwang430521

Follow

29 followers · 2 following

Achievements

Achievements

Stars

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,721 1,209 Updated Feb 25, 2026

open-compass / MMBench-GUI

Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent with a hierarchical manner across multiple platforms, includi…

Python 100 6 Updated Sep 8, 2025

EPFL-VILAB / fm-vision-evals

Jupyter Notebook 72 8 Updated Jul 20, 2025

JoeLeelyf / OVO-Bench

[CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Python 121 5 Updated Jul 24, 2025

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 67,523 8,223 Updated Feb 24, 2026

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,371 1,613 Updated Jan 30, 2026

LLaVA-VL / LLaVA-NeXT

Python 4,571 447 Updated Sep 14, 2025

rachelcao277 / LabelImage

一款在线图像标注工具（矩形、多边形、持续更新中……），可用于深度学习实例分割模型训练（Mask R-CNN）等。

JavaScript 496 95 Updated Sep 26, 2023

ZrrSkywalker / Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Python 1,650 111 Updated Jul 22, 2024

google-research-datasets / screen_annotation

The Screen Annotation dataset consists of pairs of mobile screenshots and their annotations. The annotations are in text format, and describe the UI elements present on the screen: their type, loca…

84 10 Updated Mar 7, 2024

google-research-datasets / screen2words

The dataset includes screen summaries that describes Android app screenshot's functionalities. It is used for training and evaluation of the screen2words models (our paper accepted by UIST'21 will …

63 Updated Jul 27, 2021

Paitesanshi / LLM-Agent-Survey

2,882 154 Updated Feb 20, 2025

boozallen / MOTIF

Python 164 21 Updated Oct 27, 2022

aburns4 / MoTIF

Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments

Jupyter Notebook 61 3 Updated Aug 19, 2024

Jamie725 / Multimodal-Object-Detection-via-Probabilistic-Ensembling

Python 168 21 Updated Jan 31, 2025

InternLM / InternLM

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 7,158 507 Updated Oct 30, 2025

thunlp / ToolLearningPapers

917 44 Updated Jul 24, 2024

OpenBMB / BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python 2,786 252 Updated Dec 5, 2023

tangqiaoyu / ToolAlpaca

the official code for "ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases"

Python 884 37 Updated Oct 26, 2024

xlang-ai / xlang-paper-reading

Paper collection on building and evaluating language model agents via executable language grounding

365 13 Updated Apr 29, 2024

OpenBMB / ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,531 474 Updated May 21, 2025

LlamaFamily / Llama-Chinese

Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用

Python 14,744 1,306 Updated Apr 6, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,356 4,528 Updated Feb 23, 2026

meta-llama / llama

Inference code for Llama models

Python 59,166 9,823 Updated Jan 26, 2025

CLUEbenchmark / SuperCLUE-Llama2-Chinese

Llama2开源模型中文版-全方位测评，基于SuperCLUE的OPEN基准 | Llama2 Chinese evaluation with SuperCLUE

127 8 Updated Aug 2, 2023

openxrlab / xrmocap

OpenXRLab Multi-view Motion Capture Toolbox and Benchmark

Python 400 47 Updated Jul 1, 2025

AlvinYH / Faster-VoxelPose

[ECCV 2022] Official implementation of Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection

Python 188 21 Updated Mar 23, 2024

microsoft / voxelpose-pytorch

Official implementation of "VoxelPose: Towards Multi-Camera 3D Human Pose Estimation in Wild Environment"

Python 538 93 Updated Jul 24, 2023

seanbell / opensurfaces

Crowdsourcing pipeline and website for OpenSurfaces [SIG '13] and Intrinsic Images in the Wild [SIG '14]

Python 157 41 Updated May 4, 2020

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 161,086 30,030 Updated Dec 18, 2025