🔍 LogLens — Real-Time Log Analytics Engine

LogLens is a high-performance, scalable log analytics engine built in Go. It enables real-time ingestion, search, and analysis of structured logs at scale.

With support for high-throughput ingestion, efficient indexing using Bleve, batched writes, write-ahead logging (WAL), compression, and time-based retention, LogLens is designed to be the backbone of modern observability pipelines.

🚀 Features

✅ High Throughput Ingestion: Handle thousands of logs per second with minimal overhead
✅ Real-Time Search: Instant querying using Bleve full-text search index
✅ Batching & Compression: Efficient disk writes with Zstandard compression
✅ Write-Ahead Logging (WAL): Ensures durability and crash recovery
✅ Time-Based Retention: Auto-delete logs beyond configured threshold
✅ REST API: Simple HTTP endpoints for ingestion and querying
✅ Performance Monitoring: Built-in stats and metrics endpoint
✅ Load Testing Ready: Comes with Vegeta scripts for benchmarking

🧠 Architecture Overview

This image was generated by GoTypeGraph

🔁 High-Level Workflow

The system processes logs through several stages:

Ingestion
Buffering
Indexing
Storage
Search
Retention

🏗️ Core Components & Responsibilities

1. Ingestion Layer

Accepts JSON-formatted logs via HTTP POST requests.
Tags logs using custom headers (KV-environment, KV-level, etc.).
Sends logs asynchronously to an internal channel for processing.

2. Memory Buffer

Stores recent logs in memory for fast access and partial indexing.
Flushes logs to disk when buffer reaches threshold or day changes.
Periodically indexes batches in memory for fast queries.

3. Write-Ahead Log (WAL)

Ensures durability before logs are flushed to disk.
Prevents data loss during crashes by replaying WAL on startup.
Logs are written synchronously with Sync() (can be optimized).

4. Index Manager

Uses Bleve to build a full-text search index.
Indexes logs in background via a worker pool.
Handles complex queries with filters, time ranges, and text match.

5. Batch Manager

Serializes logs into binary format and compresses them using Zstandard.
Writes compressed logs to disk in files named like 12345656789-987654321.lens.
- The numbers represent start and end timestamps (in microseconds since epoch) of the batch's ingestion window.
Retrieves logs from batches based on position offsets.
Deletes old batches as part of retention cleanup.

6. Storage Manager

Manages file I/O operations (read/write/delete).
Generates file paths based on ingestion date.
Caches open file handles for faster access.
Compressed .lens files are stored in a directory structure organized by year/month/day.

7. Retention Manager

Periodically scans and deletes logs older than a set number of days.
Queries index to find expired logs.
Deletes matching batches from disk.
Reports statistics like freed space and deleted logs.

8. Search Engine

Supports hybrid query execution:
- First searches the in-memory buffer.
- Then searches indexed logs from disk.
Merges results and returns unified output.
Includes frequency maps, pagination, and retrieval time tracking.

🔄 Data Flow

Here’s how a log flows through the system:

Ingest
- HTTP request received → parsed → sent to entryChan
Buffer
- Log is added to in-memory buffer
- Assigned a position (offset in buffer)
WAL
- Log is written to Write-Ahead Log (with sync)
- Ensures crash recovery
Indexing
- Logs are periodically indexed in memory
- When buffer reaches threshold, logs are flushed
Flush
- Buffered logs are grouped into a batch
- Batch is compressed and written to disk
- Logs are also indexed in Bleve asynchronously
Search
- Query hits both memory buffer and indexed logs
- Matching positions are used to retrieve actual logs
- Results are merged and returned
Retention
- Periodic scan finds old logs
- Batches containing old logs are deleted
- Index entries for those logs are removed

📦 File Format

Each .lens file contains:

[Header: Magic "LENS"] [Version] [Length]
[Repeated: [Log Size] [Serialized Log]]

Logs are serialized using gob
Entire batch is compressed using Zstandard

Files are organized by date directories:

/data/
  └── 2025/
      └── 04/
          └── 05/
              └── 123456789-987654321.lens

Each filename like 123456789-987654321.lens indicates the start and end timestamp (in microseconds since epoch) of the batch's ingestion window.

⚙️ Background Workers

Buffer Flush Worker:
- Listens on entryChan
- Flushes buffer when it reaches threshold or day changes
Indexing Worker:
- Listens on consumeBatchChan
- Indexes logs in background
Retention Worker:
- Runs daily
- Deletes expired logs based on time threshold

📈 Performance Characteristics

✅ What’s Optimized Already

Batching: Logs are buffered and flushed in batches to reduce disk I/O.
Compression: Uses Zstandard to reduce storage footprint.
WAL: Ensures durability even if system crashes before flush.
Efficient Search: Hybrid search between memory buffer and persisted logs.
File Format Design: Binary format with header metadata + compressed payload.

🧪 Load Testing

Use the provided Vegeta scripts to simulate high-load scenarios:

# Run load test with 3000 RPS for 10 seconds
./vegeta.bash 3000 10s

Monitor system behavior using:

go tool pprof http://localhost:6060/debug/pprof/profile?seconds=30

🧰 REST API Endpoints

Method	Endpoint	Description
POST	`/`	Ingest a new log
GET	`/`	Search logs
GET	`/range-count?start=YYYY-MM-DD&end=YYYY-MM-DD`	Get count over time range

Headers like KV-environment, KV-level allow tagging logs with metadata.

📁 File Format Specification

Each .lens file contains:

[Magic "LENS"] [Version (1 byte)] [Log Count (4 bytes)]
[Log Size (4 bytes)][Log Payload...]
[Log Size (4 bytes)][Log Payload...]
...

All payloads are compressed using Zstandard.

🛡️ Crash Recovery

The Write-Ahead Log ensures that logs are not lost during unexpected shutdowns. On startup, the system replays the WAL to rebuild the in-memory buffer.

🗑️ Time-Based Retention

Old logs can be automatically deleted after a configurable number of days. A cron job scans and removes expired data from both index and disk.

🧩 Future Enhancements

✅ UI dashboard for visualization (WIP)
✅ Query caching layer (WIP)
✅ Support for TLS and authentication
✅ Alerting system with custom triggers

🤝 Contributing

Contributions welcome! Whether it's performance improvements, bug fixes, or feature additions — feel free to open an issue or PR.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
client		client
cmd		cmd
internal		internal
load-testing		load-testing
.gitignore		.gitignore
.zedignore		.zedignore
README.md		README.md
go.mod		go.mod
go.sum		go.sum
type-graph.png		type-graph.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

🔍 LogLens — Real-Time Log Analytics Engine

🚀 Features

🧠 Architecture Overview

🔁 High-Level Workflow

🏗️ Core Components & Responsibilities

1. Ingestion Layer

2. Memory Buffer

3. Write-Ahead Log (WAL)

4. Index Manager

5. Batch Manager

6. Storage Manager

7. Retention Manager

8. Search Engine

🔄 Data Flow

📦 File Format

⚙️ Background Workers

📈 Performance Characteristics

✅ What’s Optimized Already

🧪 Load Testing

🧰 REST API Endpoints

📁 File Format Specification

🛡️ Crash Recovery

🗑️ Time-Based Retention

🧩 Future Enhancements

🤝 Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Uh oh!

Uh oh!

hasssanezzz/LogLens

Folders and files

Latest commit

History

Repository files navigation

🔍 LogLens — Real-Time Log Analytics Engine

🚀 Features

🧠 Architecture Overview

🔁 High-Level Workflow

🏗️ Core Components & Responsibilities

1. Ingestion Layer

2. Memory Buffer

3. Write-Ahead Log (WAL)

4. Index Manager

5. Batch Manager

6. Storage Manager

7. Retention Manager

8. Search Engine

🔄 Data Flow

📦 File Format

⚙️ Background Workers

📈 Performance Characteristics

✅ What’s Optimized Already

🧪 Load Testing

🧰 REST API Endpoints

📁 File Format Specification

🛡️ Crash Recovery

🗑️ Time-Based Retention

🧩 Future Enhancements

🤝 Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages