Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View osopardo1's full-sized avatar
🐮
🐮

Highlights

  • Pro

Organizations

@Qbeast-io

Block or report osopardo1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Open, Multi-modal Catalog for Data & AI

Java 3,229 557 Updated Dec 18, 2025

The official home of the Presto distributed SQL query engine for big data

Java 16,599 5,509 Updated Dec 21, 2025

A simple macOS application that will prevent iTunes or Apple Music from launching.

Swift 5,258 88 Updated Aug 8, 2024

Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

Java 1,137 196 Updated Dec 20, 2025

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…

Scala 803 159 Updated Nov 6, 2025

Visual-Flow main repository

454 5 Updated Mar 11, 2025

The Open-Source toolkit to build your own reliable and secure Industrial IoT platform.

Go 338 58 Updated Dec 19, 2025

The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing

Rust 1,667 198 Updated Dec 19, 2025

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 8,263 1,564 Updated Dec 22, 2025

This repository has moved into https://github.com/dbt-labs/dbt-adapters

Python 444 239 Updated Jul 16, 2025

Upserts, Deletes And Incremental Processing on Big Data.

Java 6,047 2,454 Updated Dec 22, 2025

MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.

Go 59,355 6,808 Updated Dec 3, 2025

QuestDB is a high performance, open-source, time-series database

Java 16,480 1,521 Updated Dec 22, 2025

Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.

Java 284 80 Updated Nov 26, 2025

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 27,753 1,943 Updated Dec 21, 2025

Write data & AI pipelines in (SQL, Spark, Pandas) and deploy to the cloud, simplified

Python 36 8 Updated Apr 19, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,125 31,504 Updated Dec 22, 2025

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 12,317 3,420 Updated Dec 22, 2025

The missing star history graph of GitHub repos - https://star-history.com

TypeScript 8,199 308 Updated Dec 18, 2025

This repository started out as a learning in public project for myself and has now become a structured learning map for many in the community. We have 3 years under our belt covering all things Dev…

Shell 29,183 6,703 Updated Jun 4, 2025

A Scala API for Apache Beam and Google Cloud Dataflow.

Scala 2,615 526 Updated Dec 16, 2025

A Github API client to extract events and actions, and load into a database

Jupyter Notebook 28 12 Updated Oct 22, 2021

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 20,305 4,973 Updated Dec 20, 2025

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Python 4,710 972 Updated Dec 18, 2025

A sbt plugin for publishing Scala/Java projects to the Maven central.

Scala 342 64 Updated Dec 12, 2025

Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!

Scala 235 24 Updated Jan 24, 2025

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such …

JavaScript 803 54 Updated Aug 10, 2022

A simple Spark-powered ETL framework that just works 🍺

Scala 181 33 Updated Oct 2, 2025

wik is use to get information about anything on the shell using Wikipedia.

Python 632 20 Updated May 25, 2024
Next