Thanks to visit codestin.com
Credit goes to github.com

Skip to content

[Feature Request]: {Managed IO Iceberg} - Allow users to run streaming reads #33092

@eder001

Description

@eder001

What would you like to happen?

Icerberg users normally look for implement real-time pipeline tracking and consuming new data inserted to a Iceberg Table. Today, Spark can read in streaming using Spark Structured Streaming (through Iceberg Snapshots) so any new data loaded to an iceberg table is automatically identify and read by a Spark streaming job helping users to implement a real-time ingestion on a medallion architecture.

Issue Priority

Priority: 2 (default / most feature requests should be filed as P2)

Issue Components

  • Component: Python SDK
  • Component: Java SDK
  • Component: Go SDK
  • Component: Typescript SDK
  • Component: IO connector
  • Component: Beam YAML
  • Component: Beam examples
  • Component: Beam playground
  • Component: Beam katas
  • Component: Website
  • Component: Infrastructure
  • Component: Spark Runner
  • Component: Flink Runner
  • Component: Samza Runner
  • Component: Twister2 Runner
  • Component: Hazelcast Jet Runner
  • Component: Google Cloud Dataflow Runner

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions