- Taizhou
Stars
🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
A diffusion-based style transfer system that injects multi-token CLIP style embeddings into UNet attention layers for controllable artistic style generation. Includes a custom StyleAttnProcessor, i…
A Swift-based, offline password manager built on KeePassKit, featuring multi-layer encryption, biometric authentication, password generation, and a fully localized English–Chinese interface. 一款基于 S…
A lightweight browser-to-NAS pipeline for capturing and downloading web videos. It integrates a Chrome Extension with a NAS-hosted Docker backend (FastAPI, workers, FFmpeg) to automatically detect,…
[TKDE2025] Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | A curated list of resources (surveys, papers, benchmarks, and opensource projects) on large language model-based …
SAG - SQL驱动的RAG引擎 · 查询时自动构建知识图谱 | SQL-Driven RAG Engine · Automatically Build Knowledge Graph During Querying
Fully elastic, MongoDB API compatible distributed JSON document database with compute-storage separation and robust ACID transactions.
持续收集更新全网最全最有趣的Telegram机器人🤖大全,各类工具箱干货,相信总有你需要的一款机器人~ Telegram 中文机器人 / 群组频道导航(Chinese Telegram bots, groups & channels collection)
Query-aware Token Selector (QTSplus), a lightweight yet powerful visual token selection module that serves as an information gate between the vision encoder and LLMs.
High Performance Redis-API Compatible Distributed Database with Persistency, Scalability, Full ACID Transactions, and Tiered S3 Storage Cost Efficiency
The Intelligent GUI Agent for Mobile Phones
Fulling is an AI-powered Full-stack Engineer Agent. Built with Next.js, Claude, shadcn/ui, and PostgreSQL.
INFTY Engine: An Optimization Toolkit to Support Continual AI
本项目是一个基于 Golang Gin 框架 开发的 B2C 电商平台,采用 MVC(Model-View-Controller)架构 进行模块化设计,能够扩展为实现前后端分离,支持后台商品管理、用户系统、订单交易、支付集成、数据分析等功能,系统地展示了现代Web应用的全貌。该项目描绘了一个功能完整、技术选型现代的全栈电商项目。它从前端交互到后端管理,从业务逻…
AipexBase is an AI-native BaaS platform. You only need to develop the frontend with vibe coding tools, and leave the backend to AipexBase!
This is the official implementation for the paper "SNR-aware low-light image enhancement" in CVPR2022
A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.
职星学院企业培训系统是一套基于点播、直播、考试、培训、面授等功能完善的在线教育系统,开源版是基于商业版精简实现的一个企业员工培训系统,致力于打造一个各行业都适用的在线培训系统、企业培训平台、员工培训系统、企业内部培训系统。
Codebase of our paper in CVPR 2025: "Neural Hierarchial Decomposition for Single Image Plant Modeling."
PageEyes Agent 是一个轻量级 UI Agent,通过自然语言指令驱动,无需编写脚本既可实现Web、Android平台的UI自动化任务。
[NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models
A head-only, lightweight, fast, thread safe, valgrind-like memory monitor, which output perf-like report.
PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.
Official implementation of UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy
Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…
Open source AI terminal and SSH Client for EC2, Database and Kubernetes.
Agent-ready RPA suite with out-of-the-box automation tools. Built for individuals and enterprises.
Enterprise-grade, commercial-friendly agentic workflow platform for building next-generation SuperAgents.