🍽️ Analyze business success through machine learning with a comprehensive pipeline, using diverse data to drive insights in the restaurant industry.
-
Updated
Jan 10, 2026 - Jupyter Notebook
🍽️ Analyze business success through machine learning with a comprehensive pipeline, using diverse data to drive insights in the restaurant industry.
🧠 Monitor web app performance and detect anomalies with AI-driven insights, capturing key events and generating detailed incident reports.
🛒 Track Walmart product prices effortlessly with this bot. Get instant alerts on price changes to score the best deals.
⚙️ Monitor and automatically restart Databricks apps to ensure they stay operational with ease. Deploy quickly with our straightforward setup guide.
🗣️ Record your voice and convert it to text on Linux with Hyprflow, a simple open-source tool that enhances your workflow effortlessly.
🩺 Diagnose and treat missing values in machine learning datasets with tools to quantify, visualize, and impute, all while evaluating impact on model performance.
📦 Split buffers and streams into smaller chunks for smooth HTTP uploads and accurate progress tracking.
🤖 Set up Terry, your AI agent, to monitor and troubleshoot your homelab effortlessly while ensuring human oversight throughout the process.
🚀 Simplify BigQuery queries with jsh, offering a clean and efficient way to manage and manipulate your data seamlessly.
❄️ Streamline your data workflows with Snowflake IQD, a tool for efficient data integration and analysis in Snowflake environments.
❄️ Simplify data management with snowflake-mh9, a tool that streamlines interactions with Snowflake for efficient querying and analytics.
🛡️ Detect and prevent fraud in real-time with an ensemble system, achieving high accuracy and significant ROI for enhanced security solutions.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
A robust (🐢) and fast (🐇) MLOps tool for managing data and pipelines in Rust (🦀)
The Feldera Incremental Computation Engine
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
An orchestration platform for the development, production, and observation of data assets.
Dataform is a framework for managing SQL based data operations in BigQuery
Add a description, image, and links to the data-pipelines topic page so that developers can more easily learn about it.
To associate your repository with the data-pipelines topic, visit your repo's landing page and select "manage topics."