Stars
🚀 JavaScript diagramming library that uses SVG and HTML for rendering.
Fast and Lightweight Observability Data Collector
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
Java tutorial for DingTalk Open Platform 钉钉开放平台的 Java 教程
Python SDK for DingTalk Stream Mode API, Compared with the webhook mode, it is easier to access the DingTalk chatbot
Label Studio is a multi-type data labeling and annotation tool with standardized output format
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, xDC replica…
🐬DeepChat - A smart assistant that connects powerful AI to your personal world
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.
Hopsworks - Data-Intensive AI platform with a Feature Store
Feathr – A scalable, unified data and AI engineering platform for enterprise
Automated Machine Learning with scikit-learn
🔎 Open source distributed and RESTful search engine.
SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.
Apache InLong - a one-stop, full-scenario integration framework for massive data
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data ever…
🔥 人人可用的开源 BI 工具,数据可视化神器。An open-source BI tool alternative to Tableau.