Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Marxlp's full-sized avatar

Block or report Marxlp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.

Jupyter Notebook 85 6 Updated Jul 3, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,452 542 Updated May 18, 2025

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,272 208 Updated Oct 27, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,699 928 Updated Oct 27, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,591 2,239 Updated Feb 1, 2025

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

555 27 Updated May 2, 2024

Codebase for Automated Creation of Digital Cousins for Robust Policy Learning

Python 230 20 Updated Mar 31, 2025

Markdown to PDF conversion tool

Python 374 46 Updated Mar 6, 2025

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 61,524 9,150 Updated Oct 27, 2025

The paper list of the review on LLMs in medicine - "Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review".

253 26 Updated Dec 23, 2023

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,399 200 Updated May 7, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,225 1,340 Updated Oct 1, 2025

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 25,183 3,135 Updated Oct 27, 2025
Python 1 Updated Apr 23, 2024

RAGOnMedicalKG,将大模型RAG与KG结合,完成demo级问答,旨在给出基础的思路。

Python 323 37 Updated Mar 31, 2024

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Python 817 116 Updated Dec 23, 2024

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,950 378 Updated Oct 27, 2025

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

Python 4,377 349 Updated Oct 13, 2025

A tutorial and implement of disease centered Medical knowledge graph and qa system based on it。知识图谱构建,自动问答,基于kg的自动问答。以疾病为中心的一定规模医药领域知识图谱,并以该知识图谱完成自动问答与分析服务。

Python 7,001 2,245 Updated Aug 8, 2024

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,641 217 Updated Oct 20, 2025

Simple flow library 🖥️🖱️

JavaScript 5,510 848 Updated Oct 19, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,689 619 Updated Feb 21, 2025

Medical NLP Competition, dataset, large models, paper

2,359 425 Updated Dec 6, 2024

中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc

2,476 388 Updated Jan 17, 2024

VS Code Extension Manager

TypeScript 932 233 Updated Sep 29, 2025
TypeScript 5 Updated Nov 10, 2023

Open-source coding assistant for Visual Studio Code. Connect to LLMs from OpenAI or Google.

TypeScript 18 2 Updated Aug 14, 2023

Simple samples for TensorRT programming

Python 1,644 351 Updated May 27, 2025
Next