Kadita is a config-driven data platform that ingests data from multiple sources into an S3-based data lake powered by Apache Iceberg.
Inspired by the legendary sea goddess Kadita, this platform unifies your data streams like waves flowing into one ocean.
- Pluggable data sources:
- Databases: Postgres, MySQL, MongoDB
- 3rd-party APIs: Jira, Zendesk
- File-based: JSON, CSV from S3
- Separation of concerns:
DataSource: defines connection info for databases, APIs, or buckets.TableConfig: defines ingestion rules (tables, primary keys, indexing, etc.).
- Target storage:
- All ingested data lands in S3 β Iceberg tables.
- Kubernetes-inspired YAML config:
- Declarative, easy to version-control, portable.