Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View hj5's full-sized avatar

Block or report hj5

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
40 stars written in Scala
Clear filter

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,882 29,076 Updated Feb 25, 2026

Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3

Scala 14,440 3,096 Updated Feb 23, 2026

CMAK is a tool for managing Apache Kafka clusters

Scala 11,950 2,495 Updated Aug 2, 2023

A fault tolerant, protocol-agnostic RPC system

Scala 8,873 1,445 Updated Feb 2, 2026

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,603 2,007 Updated Feb 25, 2026

酷玩 Spark: Spark 源代码解析、Spark 类库等

Scala 3,482 1,393 Updated May 18, 2022

REST job server for Apache Spark

Scala 2,842 979 Updated Jul 8, 2025

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,305 985 Updated Feb 25, 2026

Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.

Scala 1,845 546 Updated May 29, 2024

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,516 578 Updated Feb 25, 2026

command line options parsing for Scala

Scala 1,445 161 Updated Sep 6, 2025

A simple-build-tool (sbt) plugin/processor for creating IntelliJ IDEA project files

Scala 1,065 147 Updated Dec 27, 2017

A collection of open source Apache 2.0 Kafka Connector maintained by Lenses.io.

Scala 1,059 379 Updated Feb 10, 2026

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Scala 1,007 312 Updated Oct 5, 2022

A connector for Spark that allows reading and writing to/from Redis cluster

Scala 947 367 Updated Oct 22, 2024

Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.

Scala 946 618 Updated Feb 24, 2026

Non-blocking, Reactive Redis driver for Scala (with Sentinel support)

Scala 785 141 Updated May 7, 2024

This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language

Scala 566 557 Updated Mar 20, 2024

Quick up and running using Scala for Apache Kafka

Scala 327 132 Updated Jul 2, 2017

Connect Spark to HBase for reading and writing data with ease

Scala 295 106 Updated Dec 19, 2017

SparkOnHBase

Scala 278 175 Updated Mar 30, 2021

A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).

Scala 152 54 Updated Apr 21, 2023

Spark SQL index for Parquet tables

Scala 134 35 Updated May 6, 2021

A Redis client written with Akka's IO package

Scala 108 24 Updated Jun 9, 2022

Spark Clickhouse Connector

Scala 71 8 Updated Aug 7, 2020

spark to yandex clickhouse connector

Scala 69 40 Updated Sep 4, 2019

A library based on delta for Spark and MLSQL

Scala 60 23 Updated Dec 24, 2020

SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题

Scala 48 10 Updated Jan 4, 2019

A playground for Spark jobs.

Scala 43 40 Updated Dec 8, 2018
Next