Thanks to visit codestin.com
Credit goes to github.com

Skip to content
forked from apache/iggy

Iggy is the persistent message streaming platform written in Rust, supporting QUIC, TCP and HTTP transport protocols, capable of processing millions of messages per second.

License

Notifications You must be signed in to change notification settings

lukaszzborek/iggy

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Apache Iggy (Incubating)

crates.io crates.io coverage dependency x discord-badge


iggy


Iggy is a persistent message streaming platform written in Rust, supporting QUIC, TCP (custom binary specification) and HTTP (regular REST API) transport protocols, capable of processing millions of messages per second at ultra-low latency.

Iggy provides exceptionally high throughput and performance while utilizing minimal computing resources.

This is not yet another extension running on top of existing infrastructure, such as Kafka or SQL database.

Iggy is a persistent message streaming log built from the ground up using low-level I/O for speed and efficiency.

The name is an abbreviation for the Italian Greyhound - small yet extremely fast dogs, the best in their class. See the lovely Fabio & Cookie ❤️


Features

  • Highly performant, persistent append-only log for message streaming
  • Very high throughput for both writes and reads
  • Low latency and predictable resource usage thanks to the Rust compiled language (no GC)
  • User authentication and authorization with granular permissions and Personal Access Tokens (PAT)
  • Support for multiple streams, topics and partitions
  • Support for multiple transport protocols (QUIC, TCP, HTTP)
  • Fully operational RESTful API which can be optionally enabled
  • Available client SDK in multiple languages
  • Works directly with binary data, avoiding enforced schema and serialization/deserialization overhead
  • Custom zero-copy (de)serialization, which greatly improves the performance and reduces memory usage.
  • Configurable server features (e.g. caching, segment size, data flush interval, transport protocols etc.)
  • Server-side storage of consumer offsets
  • Multiple ways of polling the messages:
    • By offset (using the indexes)
    • By timestamp (using the time indexes)
    • First/Last N messages
    • Next N messages for the specific consumer
  • Possibility of auto committing the offset (e.g. to achieve at-most-once delivery)
  • Consumer groups providing the message ordering and horizontal scaling across the connected clients
  • Message expiry with auto deletion based on the configurable retention policy
  • Additional features such as server side message deduplication
  • Multi-tenant support via abstraction of streams which group topics
  • TLS support for all transport protocols (TCP, QUIC, HTTPS)
  • Connectors - sinks, sources and data transformations based on the custom Rust plugins
  • Model Context Protocol - provide context to LLM with MCP server
  • Optional server-side as well as client-side data encryption using AES-256-GCM
  • Optional metadata support in the form of message headers
  • Optional data backups and archiving to disk or S3 compatible cloud storage (e.g. AWS S3)
  • Support for OpenTelemetry logs & traces + Prometheus metrics
  • Built-in CLI to manage the streaming server installable via cargo install iggy-cli
  • Built-in benchmarking app to test the performance
  • Single binary deployment (no external dependencies)
  • Running as a single node (clustering based on Viewstamped Replication will be implemented in the near future)

server

files structure


Architecture

This is the high-level architecture of the Iggy message streaming server, where extremely high performance and ultra low and stable tail latencies are the primary goals. The server is designed to handle high throughput and very low latency (sub-millisecond tail latencies), making it suitable for real-time applications. For more details, please refer to the documentation.

server


Version

The official releases follow the regular semver (0.5.0) or have latest tag applied (apache/iggy:latest).

We do also publish edge/dev/nightly releases (e.g. 0.5.0-edge.1 or apache/iggy:edge), for both, SDKs and the Docker images, which are typically compatible with the latest changes, but are not guaranteed to be stable, and as the name states, are not recommended for production use.


Roadmap

  • Shared-nothing design and io_uring support (PoC on experimental branch, WiP on the main branch)
  • Clustering & data replication based on VSR (on sandbox project using Raft, will be implemented after shared-nothing design is completed)

Supported languages SDK

C++ and Elixir are work in progress.


CLI

The interactive CLI is implemented under the cli project, to provide the best developer experience. This is a great addition to the Web UI, especially for all the developers who prefer using the console tools.

Iggy CLI can be installed with cargo install iggy-cli and then simply accessed by typing iggy in your terminal.

CLI

Web UI

There's a dedicated Web UI for the server, which allows managing the streams, topics, partitions, browsing the messages and so on. This is an ongoing effort to build a comprehensive dashboard for administrative purposes of the Iggy server. Check the Web UI in the /web directory. The docker image for Web UI is available, and can be fetched via docker pull apache/iggy-web-ui.

Web UI


Connectors

The highly performant and modular runtime for statically typed, yet dynamically loaded connectors. Ingest the data from the external sources and push it further to the Iggy streams, or fetch the data from the Iggy streams and push it further to the external sources. Create your own Rust plugins by simply implementing either the Source or Sink trait and build custom pipelines for the data processing.

## Configure a sink or source connector, depending on your needs
[sinks.quickwit]
enabled = true
name = "Quickwit sink"
path = "target/release/libiggy_connector_quickwit_sink"
config_format = "yaml"

[[sinks.quickwit.streams]]
stream = "qw"
topics = ["records"]
schema = "json"
batch_length = 1000
poll_interval = "5ms"
consumer_group = "qw_sink_connector"

[[sinks.quickwit.transforms.add_fields.fields]]
key = "random_id"
value.computed = "uuid_v7"

[sinks.quickwit.transforms.delete_fields]
enabled = true
fields = ["email", "created_at"]

Model Context Protocol

The Model Context Protocol (MCP) is an open protocol that standardizes how applications provide context to LLMs. The Iggy MCP Server is an implementation of the MCP protocol for the message streaming infrastructure. It can be used to provide context to LLMs in real-time, allowing for more accurate and relevant responses.

server


Docker

The official Apache Iggy images can be found in Docker Hub, simply type docker pull apache/iggy to pull the image.

You can also find the images for all the different tooling such as Connectors, MCP Server etc. at Docker Hub.

Please note that the images tagged as latest are based on the official, stable releases, while the edge ones are updated directly from latest version of the master branch.

You can find the Dockerfile and docker-compose in the root of the repository. To build and start the server, run: docker compose up.

Additionally, you can run the CLI which is available in the running container, by executing: docker exec -it iggy-server /iggy.

Keep in mind that running the container on the OS other than Linux, where the Docker is running in the VM, might result in the performance degradation.


Configuration

The default configuration can be found in server.toml file in configs directory.

The configuration file is loaded from the current working directory, but you can specify the path to the configuration file by setting IGGY_CONFIG_PATH environment variable, for example export IGGY_CONFIG_PATH=configs/server.toml (or other command depending on OS).

When config file is not found, the default values from embedded server.toml file are used.

For the detailed documentation of the configuration file, please refer to the configuration section.


Quick start

Build the project (the longer compilation time is due to LTO enabled in release profile:

cargo build

Run the tests:

cargo test

Set root user credentials (OPTIONAL):

Iggy requires credentials to authenticate request to the server. You can set the root user before starting the server.

(macOS/Linux)

export IGGY_ROOT_USERNAME=iggy
export IGGY_ROOT_PASSWORD=iggy

(Windows(Powershell))

$env:IGGY_ROOT_USERNAME = "iggy"
$env:IGGY_ROOT_PASSWORD = "iggy"

By default, iggy-server will generate a randomized root user password and print it to stdout, when there's NO users created.

Start the server:

cargo run --bin iggy-server

One can use default root credentials with optional --with-default-root-credentials. This flag is equivalent to setting IGGY_ROOT_USERNAME=iggy and IGGY_ROOT_PASSWORD=iggy, plus it should only be used for development and testing.

cargo run --bin iggy-server -- --with-default-root-credentials

For configuration options and detailed help:

cargo run --bin iggy-server -- --help

You can also use environment variables to override any configuration setting:

  • Override TCP address IGGY_TCP_ADDRESS=0.0.0.0:8090 cargo run --bin iggy-server

  • Set custom data path IGGY_SYSTEM_PATH=/data/iggy cargo run --bin iggy-server

  • Enable HTTP transport IGGY_HTTP_ENABLED=true cargo run --bin iggy-server

  • Set custom root user credentials IGGY_ROOT_USERNAME=iggy IGGY_ROOT_PASSWORD=iggy cargo run --bin iggy-server

To quickly generate the sample data:

cargo run --bin data-seeder-tool

Please note that all commands below are using iggy binary, which is part of release (cli sub-crate).

Create a stream with name dev (numerical ID will be assigned by server automatically) using default credentials and tcp transport (available transports: quic, tcp, http, default tcp):

cargo run --bin iggy -- --transport tcp --username <iggy_username> --password <iggy_password> stream create dev

List available streams:

cargo run --bin iggy -- --username <iggy_username> --password <iggy_password> stream list

Get dev stream details:

cargo run --bin iggy -- -u <iggy_username> -p <iggy_password> stream get dev

Create a topic named sample (numerical ID will be assigned by server automatically) for stream dev, with 2 partitions (IDs 1 and 2), disabled compression (none) and disabled message expiry (skipped optional parameter):

cargo run --bin iggy -- -u <iggy_username> -p <iggy_password> topic create dev sample 2 none

List available topics for stream dev:

cargo run --bin iggy -- -u <iggy_username> -p <iggy_password> topic list dev

Get topic details for topic sample in stream dev:

cargo run --bin iggy -- -u <iggy_username> -p <iggy_password> topic get dev sample

Send a message 'hello world' (message ID 1) to the stream dev to topic sample and partition 1:

cargo run --bin iggy -- -u <iggy_username> -p <iggy_password> message send --partition-id 1 dev sample "hello world"

Send another message 'lorem ipsum' (message ID 2) to the same stream, topic and partition:

cargo run --bin iggy -- -u <iggy_username> -p <iggy_password> message send --partition-id 1 dev sample "lorem ipsum"

Poll messages by a regular consumer with ID 1 from the stream dev for topic sample and partition with ID 1, starting with offset 0, messages count 2, without auto commit (storing consumer offset on server):

cargo run --bin iggy -- -u <iggy_username> -p <iggy_password> message poll --consumer 1 --offset 0 --message-count 2 --auto-commit dev sample 1

Finally, restart the server to see it is able to load the persisted data.

The HTTP API endpoints can be found in server.http file, which can be used with REST Client extension for VS Code.

To see the detailed logs from the CLI/server, run it with RUST_LOG=trace environment variable. See images below:


Examples

You can find comprehensive sample applications under the examples/rust directory. These examples showcase various usage patterns of the Iggy client SDK, from basic operations to advanced multi-tenant scenarios.

For detailed information about available examples and how to run them, please see the Examples README.


SDK

Iggy comes with the Rust SDK, which is available on crates.io.

The SDK provides both, low-level client for the specific transport, which includes the message sending and polling along with all the administrative actions such as managing the streams, topics, users etc., as well as the high-level client, which abstracts the low-level details and provides the easy-to-use API for both, message producers and consumers.

You can find the more examples, including the multi-tenant one under the examples directory.

// Create the Iggy client
let client = IggyClient::from_connection_string("iggy://user:secret@localhost:8090")?;

// Create a producer for the given stream and one of its topics
let mut producer = client
    .producer("dev01", "events")?
    .direct( // Use either direct (instant) or background message sending
        DirectConfig::builder()
            .batch_length(1000)
            .linger_time(IggyDuration::from_str("1ms")?)
            .build(),
    )
    .partitioning(Partitioning::balanced())
    .build();

producer.init().await?;

// Send some messages to the topic
let messages = vec![IggyMessage::from_str("Hello Apache Iggy")?];
producer.send(messages).await?;

// Create a consumer for the given stream and one of its topics
let mut consumer = client
    .consumer_group("my_app", "dev01", "events")?
    .auto_commit(AutoCommit::IntervalOrWhen(
        IggyDuration::from_str("1s")?,
        AutoCommitWhen::ConsumingAllMessages,
    ))
    .create_consumer_group_if_not_exists()
    .auto_join_consumer_group()
    .polling_strategy(PollingStrategy::next())
    .poll_interval(IggyDuration::from_str("1ms")?)
    .batch_length(1000)
    .build();

consumer.init().await?;

// Start consuming the messages
while let Some(message) = consumer.next().await {
    // Handle the message
}

Benchmarks

Benchmarks should be the first-class citizens. We believe that performance is crucial for any system, and we strive to provide the best possible performance for our users. Please check, why we believe that the transparent benchmarking is so important.

We've also built the benchmarking platform where anyone can upload the benchmarks and compare the results with others. Source code for the platform is available in the core/bench/dashboard directory.

server

For the benchmarking purposes, we've developed the dedicated iggy-bench tool, which is a part of the iggy project. It is a command-line tool that allows you to run the variety of fully customizable benchmarks.

server

To benchmark the project, first build the project in release mode:

cargo build --release

Then, run the benchmarking app with the desired options:

  1. Sending (writing) benchmark

    cargo run --bin iggy-bench -r -- -v pinned-producer tcp
  2. Polling (reading) benchmark

    cargo run --bin iggy-bench -r -- -v pinned-consumer tcp
  3. Parallel sending and polling benchmark

    cargo run --bin iggy-bench -r -- -v pinned-producer-and-consumer tcp
  4. Balanced sending to multiple partitions benchmark

    cargo run --bin iggy-bench -r -- -v balanced-producer tcp
  5. Consumer group polling benchmark:

    cargo run --bin iggy-bench -r -- -v balanced-consumer-group tcp
  6. Parallel balanced sending and polling from consumer group benchmark:

    cargo run --bin iggy-bench -r -- -v balanced-producer-and-consumer-group tcp
  7. End to end producing and consuming benchmark (single task produces and consumes messages in sequence):

    cargo run --bin iggy-bench -r -- -v end-to-end-producing-consumer tcp

These benchmarks would start the server with the default configuration, create a stream, topic and partition, and then send or poll the messages. The default configuration is optimized for the best performance, so you might want to tweak it for your needs. If you need more options, please refer to iggy-bench subcommands help and examples.

For example, to run the benchmark for the already started server, provide the additional argument --server-address 0.0.0.0:8090.

Depending on the hardware, transport protocol (quic, tcp or http) and payload size (messages-per-batch * message-size) you might expect over 5000 MB/s (e.g. 5M of 1 KB msg/sec) throughput for writes and reads.

Iggy is already capable of processing millions of messages per second at the microseconds range for p99+ latency, and with the upcoming optimizations related to the io_uring support along with the shared-nothing design, it will only get better.

Please refer to the mentioned benchmarking platform where you can browse the results achieved on the different hardware configurations, using the different Iggy server versions.


Contributing

Please see Contributing

About

Iggy is the persistent message streaming platform written in Rust, supporting QUIC, TCP and HTTP transport protocols, capable of processing millions of messages per second.

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Rust 64.6%
  • C# 10.3%
  • Java 6.7%
  • TypeScript 5.4%
  • Go 4.6%
  • Svelte 2.5%
  • Other 5.9%