Data Analysis packages

Showing projects tagged as Data Analysis

  • Dask

    9.2 9.4 L2 Python
    Parallel computing with task scheduling
  • marimo

    9.2 10.0 Python
    A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
  • #<Sawyer::Resource:0x00007fbd82367850>

    8.0 9.7 Python
    Panel: The powerful data exploration & web app framework for Python
  • AWS Data Wrangler

    7.6 8.8 Python
    pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
  • Sacred

    7.5 3.1 Python
    Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
  • Interactive Parallel Computing with IPython

    7.3 8.1 L3 Jupyter Notebook
    IPython Parallel: Interactive Parallel Computing in Python
  • Clairvoyant

    7.0 2.1 L3 Python
    Software designed to identify and monitor social/historical cues for short term stock movement
  • TextDistance

    6.9 4.1 Python
    📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
  • karateclub

    6.2 6.7 Python
    Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)
  • jellyfish

    5.9 5.8 Jupyter Notebook
    🪼 a python library for doing approximate and phonetic matching of strings.
  • Cubes

    5.8 0.0 L3 Python
    [NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis
  • Optimus

    5.5 0.0 Python
    :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
  • Streamz

    5.0 4.0 Python
    Real-time stream processing for python
  • bcolz

    4.7 0.0 C
    DISCONTINUED. A columnar data container that can be compressed.
  • fastparquet

    4.5 5.8 Python
    python implementation of the parquet columnar file format.
  • pdpipe

    3.9 7.2 Jupyter Notebook
    Easy pipelines for pandas DataFrames.
  • Bubbles

    3.7 0.0 L5 Python
    [NOT MAINTAINED] Bubbles – Python ETL framework
  • Zef

    1.9 1.8 Python
    Toolkit for graph-relational data across space and time
  • Google Analytics Extractor

    1.4 3.2 Python
    Tool for extracting Google Analytics data suitable for migrating to other platforms/databases
  • convtools

    1.3 7.0 Python
    convtools is a specialized Python library for dynamic, declarative data transformations with automatic code generation
  • pyxll-utils

    1.0 0.0 Python
    DISCONTINUED. Utility code for use with PyXLL, the Python Excel Add-In.