Starred repositories
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
[Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI
A playbook for systematically maximizing the performance of deep learning models.
Google Drive Public File Downloader when Curl/Wget Fails
Stable Diffusion with Core ML on Apple Silicon
Obsidian plugin that gives you the power to generate dynamic MOCs in your folder notes. Enables folders to show up in the graph view and removes the need for messy tags!
🔍 Search browser tabs from Chrome, Arc, Brave, Safari, etc..
Alfred workflow with dozens of features for controlling your Obsidian vault.
Chrome/Safari/Firefox extension for clipping arXiv articles to Notion.
Official Repository for our ICCV2021 paper: Continual Learning on Noisy Data Streams via Self-Purified Replay
A fast CSV command line toolkit written in Rust.
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
Fetch Academic Research Papers from different sources
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Copy/paste detector for programming source code.
Official Repository for our ECCV2020 paper: Imbalanced Continual Learning with Partitioning Reservoir Sampling
Modin: Scale your Pandas workflows by changing a single line of code
Context-Sensitive Misspelling Correction of Clinical Text via Conditional Independence, CHIL 2022
A simple and efficient tool to parallelize Pandas operations on all available CPUs
High performance model preprocessing library on PyTorch
Examples of how to create colorful, annotated equations in Latex using Tikz.