Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View xiaoyuan1996's full-sized avatar
🌙
🌙

Block or report xiaoyuan1996

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

walkvlm for blind walking

Python 5 1 Updated Sep 1, 2025

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Python 607 63 Updated Nov 26, 2025

Universal and Transferable Attacks on Aligned Language Models

Python 4,401 585 Updated Aug 2, 2024

[ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval

Jupyter Notebook 238 9 Updated Nov 6, 2025

Explore the Multimodal “Aha Moment” on 2B Model

Python 620 23 Updated Mar 18, 2025

Xposed虚拟摄像头,适用于Android9.0+; Xposed virtual camera, available for Android 9.0+

Kotlin 225 81 Updated Nov 12, 2024

Building DeepSeek R1 from Scratch

Jupyter Notebook 731 118 Updated Mar 21, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,423 1,688 Updated Sep 24, 2025

Generative Models by Stability AI

Python 26,723 3,009 Updated Dec 16, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,393 1,152 Updated Apr 30, 2025

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Python 1,990 223 Updated Mar 21, 2024

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,920 423 Updated Nov 30, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,843 8,401 Updated Sep 20, 2025

期刊分区查询小工具,包括中科院分区表升级版(2025、2023、2022)及国际期刊预警名单(2025、2024、2023、2021、2020)、JCR(2023、2022、2021、2020)、CCF推荐国际会议和期刊目录(2022)、计算领域高质量科技期刊分级目录(2022)。

C++ 327 34 Updated Jul 22, 2025

轻量、可靠的小程序 UI 组件库

JavaScript 18,311 3,494 Updated Dec 2, 2025

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Jupyter Notebook 956 93 Updated Jun 22, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 28,134 2,815 Updated Apr 30, 2025

[AAAI 2024] "LDMVFI: Video Frame Interpolation with Latent Diffusion Models", Duolikun Danier, Fan Zhang, David Bull

Python 180 22 Updated Aug 13, 2023

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,147 272 Updated Jan 10, 2025

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Python 1,112 94 Updated Jan 23, 2024

[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.

Python 1,897 188 Updated Oct 30, 2025
Python 320 16 Updated Jul 16, 2024

MagicEdit: High-Fidelity Temporally Coherent Video Editing

1,807 103 Updated Aug 29, 2023

[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Python 4,866 382 Updated Apr 7, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,138 6,620 Updated Dec 19, 2025

LLaMA: Open and Efficient Foundation Language Models

Python 2,800 306 Updated Nov 8, 2023

[ECCV2022, TPAMI2023] FAST-VQA, and its extended version FasterVQA.

Jupyter Notebook 329 34 Updated Aug 12, 2024

Finetune ModelScope's Text To Video model using Diffusers 🧨

Python 692 111 Updated Dec 14, 2023

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

Python 950 85 Updated Nov 11, 2023

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,301 327 Updated Dec 15, 2025
Next