Stars
Securities and Exchange Commission (SEC) EDGAR database which contains regulatory filings from publicly-traded US corporations.
Rosbag-Databricks demonstrates how to read and process rosbag files (http://wiki.ros.org/rosbag) in Databricks.
Collection of Machine Learning Examples for Azure Databricks
Basic and advanced MLflow examples for many ML flavors
Import and analyze Chicago public taxi and ride-hailing data
Databricks Platform - Architecture, Security, Automation and much more!!
Sample base images for Databricks Container Services
A Python library for reading and writing PDF, powered by QPDF
Works with ecobee sensors to adjust fans between auto and on to balance environment temperature.
Reading digital XBRL/iXBRL account documents - for sharing
Power BI Embedded with Custom Controls PoC
Azure Quickstart Templates
A game theoretic approach to explain the output of any machine learning model.
Open-source Javascript Pivot Table (aka Pivot Grid, Pivot Chart, Cross-Tab) implementation with drag'n'drop.
Automatically knit R Markdown documents, build them with Jekyll, and serve the website with servr locally
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
Apache Spark - A unified analytics engine for large-scale data processing
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…
A global, black box optimization engine for real world metric optimization.
Python implementations of the Boruta all-relevant feature selection method.
Base classes to use when writing tests with Spark
This is meant to be a Data Science resource for capturing the latest technology, expertise, and evolving techniques in Data Science.
Resumes generated using the GitHub informations
A rapid on-ramp primer for programmers who want to learn Python for doing data science research and development.