Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View CerberusX's full-sized avatar
  • SouthEast University
  • Nanjing
  • 01:02 (UTC +08:00)

Block or report CerberusX

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,143 1,658 Updated Sep 24, 2025

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 2,254 130 Updated May 30, 2025
Python 1 Updated Mar 27, 2024

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Python 1,042 44 Updated May 31, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,848 2,649 Updated Aug 12, 2024

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Python 356 14 Updated Dec 18, 2023

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,270 6,116 Updated Sep 18, 2024

Monash FIT5171 Assignment 1 Test Plan and Unit/Integration Testing on Airline Reservation System

Java 1 Updated Apr 30, 2024

An android project of FIT5046

Java 2 1 Updated Apr 30, 2023

An open-source framework for training large multimodal models.

Python 4,030 316 Updated Aug 31, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,431 8,381 Updated Sep 20, 2025

Sign Language Transformers (CVPR'20)

Python 1 Updated Feb 3, 2023

Sign Language Transformers (CVPR'20)

Python 280 109 Updated Jul 25, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,278 3,419 Updated Oct 27, 2025

东南大学信息门户自动登录,SEU每日自动健康打卡,附赠绩点计算功能。Github Action一键部署,自动打卡

Python 108 533 Updated Nov 11, 2022