Stars
This is the unofficial implementation of HOPE based model as shown in the google deepmind research paper.
TradingAgents: Multi-Agents LLM Financial Trading Framework
Run Windows Subsystem For Android on your Windows 10 and Windows 11 PC using prebuilt binaries with Google Play Store (MindTheGapps) and/or Magisk or KernelSU (root solutions) built in.
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
Drone dataset to guide enemy drones (with some tools)
[IEEE TGRS 2021] Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。 AI 分析简报直推手机,也支持接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测。支持 Docker 一键部署,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。⭐
Uses artificial intelligence to mix together songs from an inputted playlist.
Official codes of the first place for AICity 2022 Track 1
Solution of The AI City Challenge 2022 Track 1
OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.
Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.
Everything you need to know to build your own RAG application
This repo powers my experiment where ChatGPT manages a real-money micro-cap stock portfolio.
Klavis AI (YC X25): MCP integration platforms that let AI agents use tools reliably at any scale
Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.
Tools to support converting a Python project into a standalone native application.
Resources for Multiple Object Tracking (MOT)
🚀 PR Agent - The Original Open-Source PR Reviewer, This repo is not the Qodo free tier! Try the free version on our website.
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.