-
14:19
(UTC +08:00) - littlenyima.github.io
Stars
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.
[CVPR 2026] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding
A curated list of papers on reinforcement learning for video generation
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.
Pusa: Thousands Timesteps Video Diffusion Model
[NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
Wan: Open and Advanced Large-Scale Video Generative Models
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
[AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
💫 Ngrok FRP Alternative • ⚡ Fast • 🪶 Lightweight • 0️⃣ Dependency • 🔌 Pluggable • 😈 TLS interception • 🔒 DNS-over-HTTPS • 🔥 Poor Man's VPN • ⏪ Reverse & ⏩ Forward • 👮🏿 "Proxy Server" framework • 🌐 …
Understand Human Behavior to Align True Needs
[IJCV 2024] InterGen: Diffusion-based Multi-human Motion Generation under Complex Interactions
OpenUI let's you describe UI using your imagination, then see it rendered live.
High-quality PNGs for logos I made for fun
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Ficus is a software for editing and managing markdown documents, developed by the gg=G team.
「开往 Travellings」是一个友链接力项目,旨在通过网络跳转的方式将流量引入那些鲜为人知的独立站点。 每当用户访问加入该项目的网页时,点击该网页上的“开往”按钮将随机跳转到另一个加入该项目的网页。
A markup-based typesetting system that is powerful and easy to learn.
Master the command line, in one page
Stable Diffusion web UI