Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View yxu0611's full-sized avatar

Block or report yxu0611

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Face Forgery Video Detection via Temporal Forgery Cue Unraveling

Python 4 Updated Oct 14, 2025

🔥 [ICLR 2025] FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models

Python 306 30 Updated Oct 12, 2025

分享一些好用的 Dify DSL 工作流程,自用、学习两相宜。 Sharing some Dify workflows.

9,443 949 Updated Jun 11, 2025

Production-ready platform for agentic workflow development.

TypeScript 117,236 18,107 Updated Oct 25, 2025

本项目是基于dify开源项目实现的dsl工作流脚本合集

Python 2,816 568 Updated Oct 21, 2025
Jupyter Notebook 34 4 Updated Apr 7, 2022

Windows inside a Docker container.

Shell 47,932 3,570 Updated Oct 24, 2025
Jupyter Notebook 21 4 Updated Jul 31, 2025

Official inference repo for FLUX.1 models

Python 24,536 1,801 Updated Jul 31, 2025

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 65,362 6,792 Updated Oct 16, 2025

Official code for Forensics Adapter (CVPR'25).

Python 67 5 Updated Oct 8, 2025

A lightweight LMM-based Document Parsing Model

Python 6,117 421 Updated Oct 25, 2025
Python 31 2 Updated Jun 4, 2025
Python 58 12 Updated Aug 12, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 7,597 612 Updated Oct 22, 2025

超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

C++ 12,220 2,295 Updated Aug 14, 2023

多功能多引擎OCR文字识别、翻译、朗读、语音合成、日漫游戏机翻汉化、验证码识别、图床上传、以图搜图、扫码工具

1,883 129 Updated Oct 8, 2025

本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/

Jupyter Notebook 10,500 1,101 Updated Oct 2, 2025

大模型基础: 一文了解大模型基础知识

6,063 506 Updated Feb 24, 2025

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Jupyter Notebook 3,918 396 Updated Aug 30, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 25,408 2,553 Updated Oct 9, 2025

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 4,570 505 Updated Aug 25, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 60,752 7,348 Updated Oct 24, 2025

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,309 163 Updated Oct 22, 2025

PDF解析工具:GOT的vLLM加速实现,MinerU做布局识别裁剪、GOT做表格公式解析,实现RAG中的pdf解析

Python 64 5 Updated Nov 7, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 60,980 10,765 Updated Oct 25, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,971 695 Updated Feb 10, 2025

⚡️ Fast, ultra-accurate text extraction from any image or PDF—including challenging ones—with structured markdown output powered by vision models.

TypeScript 34 4 Updated Jan 9, 2025

[CVPR2023] Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution

Python 174 21 Updated Sep 5, 2025
Next