InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more ā
Top 23 Python Research Projects
-
qlib
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
After researching different AI models in Qlib (a quantitative finance platform), here's what I learned:
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
-
gpt-researcher
An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.
For demonstration purposes, I'll be using the gpt-researcher tool. Github link: https://github.com/assafelovic/gpt-researcher
-
RD-Agent
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through R&D-Agent, which lets AI drive data-driven AI. šhttps://aka.ms/RD-Agent-Tech-Report
-
-
-
mlfinlab
MlFinLab helps portfolio managers and traders who want to leverage the power of machine learning by providing reproducible, interpretable, and easy to use tools.
-
InfluxDB
InfluxDB ā Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
-
-
-
habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
-
diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Project mention: AI video you can watch and interact with, in real-time | news.ycombinator.com | 2025-05-31Odyssey Systems is six months behind way more impressive demos:
- Open Source Diamond WM that you can run on consumer hardware [1]
- Google's Genie 2 (way better than this) [2]
- Oasis [3]
[1] https://diamond-wm.github.io/
[2] https://deepmind.google/discover/blog/genie-2-a-large-scale-...
[3] https://oasis.decart.ai/welcome
-
-
-
-
tldw_server
tl/dw (Too Long, Didn't Watch): Your Personal Research Multi-Tool - a naive attempt at 'A Young Lady's Illustrated Primer' (Open Source NotebookLM)
Project mention: Show HN: Morphik ā Open-source RAG that understands PDF images, runs locally | news.ycombinator.com | 2025-04-22Hey yes, Iām building exactly that.
https://github.com/rmusser01/tldw
I first built a POC in gradio and am now rebuilding it as a FastAPI app. The media processing endpoints work but Iām still tweaking media ingestion to allow for syncing to clients(idea is to allow for client-first design).
-
PyGame-Learning-Environment
PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.
-
-
Mava
š¦ A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
-
-
Project mention: Promptic ā the "requests" of LLM app development | news.ycombinator.com | 2024-11-26
Thanks for the kind words! I'm a fan of magentic :) One of the projects I've built with promptic is https://pdf-to-podcast.com
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Research discussion
Python Research related posts
-
Improving KAN with CDF normalization to quantiles
-
Show HN: AI Peer Reviewer ā Multiagent System for Scientific Manuscript Analysis
-
GitHub ā ByteDance/UI-Tars
-
Datahawk ā Text data browser for NLP, LLM researchers and developers
-
Papers for Software Engineers
-
A curated list of papers for Software Engineers
-
VectorChord: Store 400k Vectors for $1 in PostgreSQL
-
A note from our sponsor - InfluxDB
www.influxdata.com | 15 Nov 2025
Index
What are some of the best open-source Research projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | qlib | 33,724 |
| 2 | khoj | 31,564 |
| 3 | gpt-researcher | 24,101 |
| 4 | RD-Agent | 9,439 |
| 5 | UI-TARS | 8,197 |
| 6 | software-papers | 6,185 |
| 7 | mlfinlab | 4,373 |
| 8 | acme | 3,842 |
| 9 | scenic | 3,707 |
| 10 | catalyst | 3,363 |
| 11 | lingvo | 2,854 |
| 12 | habitat-lab | 2,669 |
| 13 | diamond | 1,892 |
| 14 | Papers-in-100-Lines-of-Code | 1,666 |
| 15 | music_source_separation | 1,365 |
| 16 | SALMONN | 1,352 |
| 17 | yacs | 1,325 |
| 18 | tldw_server | 1,121 |
| 19 | PyGame-Learning-Environment | 1,052 |
| 20 | dreamerv2 | 965 |
| 21 | Mava | 853 |
| 22 | iris | 842 |
| 23 | pdf-to-podcast | 806 |