-
18:39
(UTC) - https://buymeacoffee.com/ymrohit
- in/ym-rohit
Lists (1)
Sort Name ascending (A-Z)
Stars
A phone number can reveal whether a device is active, in standby or offline (and more). This PoC demonstrates how delivery receipts + RTT timing leak sensitive device-activity patterns. (WhatsApp /…
Z-Image workflow with predefined styles for high-quality image generation and a user-friendly experience. Includes pre-configured versions for GGUF and SAFETENSORS checkpoint formats.
Fully automatic censorship removal for language models
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Generate long Sora 2 videos that exceed OpenAI's native 12-second limit
Turn your screen into a Window into a virtual world
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.
DigitalPlat FreeDomain: Free Domain For Everyone
A LLM trained only on data from certain time periods to reduce modern bias
Drag and drop page builder library written in vanilla javascript without dependencies or build tools.
Build Real-Time Knowledge Graphs for AI Agents
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
State-of-the-art TTS model under 25MB 😻
Qwen-Image text to image lora trainer
CLI tool for configuring and monitoring Claude Code
Collection of specialized AI subagents for Claude Code for personal use (full-stack development).
A lightweight CLI tool to easily configure and initialize MCPs for Claude Code.
A comprehensive collection of development workflow commands for Claude Code
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
ymichael / open-codex
Forked from openai/codexLightweight coding agent that runs in your terminal
Open-source, vision-first browser agent
Accessing Apple Intelligence and ChatGPT desktop through OpenAI / Ollama API
akashjss / sesame-csm
Forked from SesameAILabs/csmA Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.