Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Pathairush's full-sized avatar
🍃
Enjoy what you have!
🍃
Enjoy what you have!

Block or report Pathairush

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Robyn is an experimental, AI/ML-powered and open sourced Marketing Mix Modeling (MMM) package from Meta Marketing Science. Our mission is to democratise modeling knowledge, inspire the industry thr…

Jupyter Notebook 1,364 413 Updated Jul 1, 2025

Uplift modeling and causal inference with machine learning algorithms

Python 5,630 839 Updated Sep 26, 2025

advertools - online marketing productivity and analysis tools

Python 1,294 233 Updated Sep 23, 2025

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, P…

Python 432 83 Updated Nov 5, 2025

pyspark methods to enhance developer productivity 📣 👯 🎉

Python 675 99 Updated Mar 6, 2025

PySpark test helper methods with beautiful error messages

Python 723 75 Updated Sep 17, 2025

Database Markup Language (DBML), designed to define and document database structures

JavaScript 3,409 214 Updated Nov 6, 2025

Template for a data science project

Python 741 215 Updated Aug 14, 2025

Free MLOps course from DataTalks.Club

Jupyter Notebook 13,580 2,719 Updated Oct 15, 2025

Documentation that simply works

Python 25,062 3,946 Updated Nov 5, 2025

Project documentation with Markdown.

Python 21,245 2,555 Updated Oct 20, 2025

🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.

Python 453 127 Updated Apr 22, 2025

Examples for the blog post on pytest-mock

Python 80 18 Updated Apr 13, 2022

Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline

Python 153 127 Updated Aug 14, 2024

Dataset extracted from the Jira ITS of four popular open source ecosystems i.e., the Apache Software Foundation, Spring, JBoss and CodeHaus communities.

Shell 33 14 Updated Mar 13, 2023

An ultra-simplified explanation to design patterns

47,008 5,462 Updated Dec 2, 2024

Testing framework for Databricks notebooks

Python 309 44 Updated Apr 20, 2024

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Python 47,033 19,247 Updated Nov 6, 2025

Generate and Visualize Data Lineage from query history

Python 326 45 Updated Aug 4, 2023

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 19,960 4,892 Updated Nov 6, 2025

An open source python library for automated feature engineering

Python 7,559 907 Updated Nov 3, 2025

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python 11,800 2,179 Updated Nov 5, 2025

Examples surrounding Databricks.

Jupyter Notebook 60 25 Updated Jul 4, 2024

A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton

Python 858 36 Updated Jul 3, 2023

A collection of learning resources for curious software engineers

Python 49,712 3,919 Updated Oct 27, 2025

📙 Awesome Data Catalogs and Observability Platforms.

933 67 Updated Aug 14, 2025

WIP: Roadmap to becoming a machine learning engineer in 2020

2,196 256 Updated Sep 16, 2021

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,381 1,945 Updated Nov 5, 2025

Panel: The powerful data exploration & web app framework for Python

Python 5,503 567 Updated Nov 6, 2025

Always know what to expect from your data.

Python 10,898 1,643 Updated Nov 6, 2025
Next