Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View LiChangNY's full-sized avatar

Block or report LiChangNY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Google Cloud Storage emulator & testing library.

Go 1,299 256 Updated Jan 16, 2026

Python XML Schema Bindings

Python 130 73 Updated Apr 29, 2023

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning

Scala 2,271 401 Updated Sep 29, 2023

A List of Recommender Systems and Resources

4,798 708 Updated Dec 3, 2025

DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector

Java 152 197 Updated Mar 4, 2024

A shell script to set up a macOS laptop for web and mobile development.

Shell 8,551 1,897 Updated Jan 16, 2026

Track changes to your rails models

Ruby 6,966 909 Updated Oct 24, 2025

Do some browser detection with Ruby. Includes ActionController integration.

Ruby 2,487 365 Updated Jun 10, 2025

Python module installed with setup.py

Python 338 79 Updated Jun 29, 2022

Google BigQuery connector for pandas

Python 488 126 Updated Jan 5, 2026

Samples for the DoubleClick for Advertisers Reporting and Trafficking API

C# 109 172 Updated Sep 29, 2025

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 43,901 16,303 Updated Jan 19, 2026

Repository with examples and smoke tests for the GCP Airflow operators and hooks

Python 152 40 Updated Jan 15, 2017

Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform…

Python 87 35 Updated Feb 11, 2014

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 8,449 4,482 Updated Jan 19, 2026

DonorsChoose.org Data Science Team Opensource Code

Jupyter Notebook 78 24 Updated Dec 8, 2022

Pentaho Data Integration ( ETL ) a.k.a Kettle

Java 8,294 3,575 Updated Jan 19, 2026

Upserts, Deletes And Incremental Processing on Big Data.

Java 6,077 2,461 Updated Jan 19, 2026

Adds static typing to JavaScript to improve developer productivity and code quality.

OCaml 22,212 1,886 Updated Jan 17, 2026

Streaming MapReduce with Scalding and Storm

Scala 2,131 263 Updated Jan 19, 2022

Ansible playbook to deploy distributed technologies

Python 67 43 Updated Nov 20, 2017

A short guide for transitioning from Python to Scala

65 28 Updated Jan 5, 2016

Repo to migrate old wiki to, esp for devs and code examples

184 58 Updated Oct 18, 2016

Web UI for PrestoDB.

Java 2,752 449 Updated May 20, 2021

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 70,137 16,529 Updated Jan 19, 2026

Docker image for Airbnb's Superset

Dockerfile 987 417 Updated Dec 22, 2025

Content for Udacity's Machine Learning curriculum

Jupyter Notebook 4,006 6,289 Updated Feb 24, 2022

An extension of GeoJSON that encodes topology! 🌐

JavaScript 4,854 680 Updated Sep 20, 2024