Stars
simple overview of python, numpy, scipy, matplotlib functions that are useful for scientific work
A curated list of awesome Machine Learning frameworks, libraries and software.
🐘 Elasticsearch real-time search and analytics natively integrated with Hadoop
Deserialization support for Scala case classes, including proper handling of default values.
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
python implementation of the parquet columnar file format.
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
The HTML Presentation Framework
OpenRefine is a free, open source power tool for working with messy data and improving it
*Experimental* GraphChi-DB graph database with computational capabilities