- Brooklyn, NY
- https://linktr.ee/saifrahmed
Highlights
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Streaming music player that finds free music for you
Press shortcut → speak → get text. Free and open source. More local-first apps soon ❤️
MCP server for querying Apple Health data with natural language and SQL
Desktop Extensions: One-click local MCP server installation in desktop apps
Control 3D models using hand gestures and voice commands in real-time. Threejs / mediapipe computer vision
🎓 Path to a free self-taught education in Computer Science!
real time face swap and one-click video deepfake with only a single image
A Model Context Protocol (MCP) server library that gives LLMs access to information about a candidate.
A package to work with SEC data. Incorporates datamule endpoints.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
This repository contains the Hugging Face Agents Course.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Solving the first 100 Project Euler problems using 100 different programming languages!
A curated directory of free and friendly communities to showcase your side projects!
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
AI-powered tools to enhance Anki flashcards with explanations, mnemonics, illustrations, and adaptive learning for medical school and beyond
Web based game to learn chords and common chord progressions
App programmed in Swift/SwiftUI for using Libre blood glucose sensors.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Production-ready platform for agentic workflow development.
Send Morse code via ⏮️ ⏸️ ⏯️
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]