Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View rex-yue-wu's full-sized avatar

Block or report rex-yue-wu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 8,372 735 Updated Aug 13, 2024

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 55,796 5,599 Updated Nov 14, 2025

Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines

Python 126 10 Updated Nov 6, 2024

[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Python 3,941 341 Updated Jul 29, 2025

Turn any face into a video game character, pixel art, claymation, 3D or toy

Python 1,356 206 Updated Apr 9, 2024

ControlNet collections for Flux1-dev model, Trained by TheMisto.ai Team

Python 354 16 Updated Sep 5, 2025

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,446 195 Updated May 14, 2025

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

Python 39,824 3,942 Updated Nov 12, 2025

Official repository for CVPR2022 publication, ViM: Out-Of-Distribution with Virtual-logit Matching

Python 91 13 Updated Sep 18, 2023

[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"

270 4 Updated Oct 3, 2023

iCartoonFace dataset, and baseline approaches, the project is supported by iQIYI

300 18 Updated Jun 25, 2021

Official Pytorch Implementation for "Splicing ViT Features for Semantic Appearance Transfer" presenting "Splice" (CVPR 2022 Oral)

Jupyter Notebook 387 33 Updated Nov 21, 2023

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Python 4,799 306 Updated Mar 7, 2025

[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Python 461 44 Updated Mar 14, 2024

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 10,512 807 Updated Dec 4, 2024

Mapping of Imagenet and Wikidata for Knowledge Graphs Enabled Computer Vision

8 1 Updated Apr 22, 2022

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,727 71 Updated Nov 9, 2025

🔥 A cross-platform build utility based on Lua

Lua 11,475 883 Updated Nov 14, 2025

[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"

Python 319 14 Updated Jun 3, 2024

CLIP-like model evaluation

Python 786 98 Updated Nov 7, 2025

Keras implementation of the Yahoo Open-NSFW model

Python 462 62 Updated Nov 2, 2025

Refine high-quality datasets and visual AI models

Python 10,029 680 Updated Nov 14, 2025
Python 103 6 Updated Jan 26, 2024

Train transformer language models with reinforcement learning.

Python 16,294 2,291 Updated Nov 14, 2025

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 37,216 6,237 Updated Jul 26, 2024

闻达:一个LLM调用平台。目标为针对特定环境的高效内容生成,同时考虑个人和中小企业的计算资源局限性,以及知识安全和私密性问题

JavaScript 6,230 808 Updated Jan 23, 2025

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,691 1,839 Updated Jun 27, 2024

Real-time face swap for PC streaming or video calls

Python 30,076 981 Updated Nov 8, 2024

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,978 331 Updated Jun 12, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,135 973 Updated Nov 14, 2025
Next