tweaks & bugfixes #35

thushan · 2025-08-02T10:58:27Z

This PR does some maintenance work:

Added a new CLI flag (-c or --config so you can pass in a config file),
Check on startup if port is in use and bail if it is (especially problematic in Windows it seems)
adds better logging with context-aware (requestid etc)
Friendlier error messages for discovery, similarly styled to Proxy now.

One important bugfix was to resolve model unification when the same model differs on servers (by digest). This is usually when a model is updated on one but not the other, we would get things like:

hf.co/unsloth/Qwen3-32B-GGUF:Q4_K_XL-75fb8ad3
qwen3:32b-8ade7840

And automation tools (mostly our suite) would fail to query those because they dont exist - aren't routable. They now use the id properly.

hf.co/unsloth/Qwen3-32B-GGUF:Q4_K_XL
qwen3:32b

Summary by CodeRabbit

New Features

Added command-line flags to specify a custom configuration file.
Enhanced HTTP service startup to check for port availability and prevent conflicts.
Introduced user-friendly error messages for discovery-related issues.
Integrated enhanced and access logging middleware for structured request logging.
Improved context-aware logging throughout proxy and discovery services.

Improvements

Model IDs now use the first alias for better routing compatibility and consistency.
Configuration loading now prioritises command-line flags, then environment variables, then defaults.
Security and rate limiting middleware chains now include enhanced and access logging layers.

Bug Fixes

Updated tests to reflect changes in model ID handling and logging behaviour.

Tests

Added comprehensive unit tests for new logging middleware and user-friendly error messages.

…rrectly. Incorrectly using the model ID when the same model name has different digests. We'd get models like this: hf.co/unsloth/Qwen3-32B-GGUF:Q4_K_XL-75fb8ad3 When it needed to be: hf.co/unsloth/Qwen3-32B-GGUF:Q4_K_XL interestingly the digest clash isn't for LMStudio.

coderabbitai · 2025-08-02T10:58:32Z

Walkthrough

This update introduces enhanced logging middleware for HTTP requests, context-aware logging in proxy and discovery services, and user-friendly error messages for discovery errors. It changes model ID selection logic to prioritise aliases for routing compatibility, updates configuration loading to support a command-line flag, and adds a port availability check before starting the HTTP service. Associated tests are included for new utilities and behaviour.

Changes

Cohort / File(s)	Change Summary
Model ID Alias Handling `internal/adapter/converter/openai_converter.go`, `internal/adapter/converter/unified_converter.go`, `internal/adapter/converter/openai_converter_test.go`, `internal/adapter/converter/unified_converter_test.go`	Model ID assignment in converters now uses the first alias if available, instead of the original model ID, for routing compatibility. Corresponding tests updated to expect alias-based IDs.
Discovery Error Handling and Logging `internal/adapter/discovery/errors.go`, `internal/adapter/discovery/errors_test.go`, `internal/adapter/discovery/service.go`	Adds `GetUserFriendlyMessage` for concise error messages, introduces structured/contextual logging in discovery service, and provides tests for error message mapping.
Context-Aware Proxy Logging `internal/adapter/proxy/olla/service.go`, `internal/adapter/proxy/sherpa/service.go`	Logging in proxy services now uses context-derived loggers when available, with additional request metadata and improved log detail.
HTTP Logging Middleware `internal/app/middleware/logging.go`, `internal/app/middleware/logging_test.go`	Introduces enhanced and access logging middleware for HTTP handlers, with request ID propagation, structured logging, and byte formatting utilities. Comprehensive tests included.
Security Middleware Integration `internal/app/handlers/application.go`	SecurityAdapters now includes a logger and wraps handlers with enhanced logging middleware for both chain and rate limit middleware.
HTTP Service Port Check `internal/app/services/http.go`, `internal/util/network.go`	Adds a utility to check TCP port availability and applies it before HTTP server startup to prevent port conflicts.
Config Loading via Flag `internal/config/config.go`, `main.go`	Config loader now accepts an optional flag for the config file path, with priority over environment variable and defaults. Main function updated to support `-c`/`-config` flags.

Sequence Diagram(s)

Enhanced HTTP Request Logging Middleware

sequenceDiagram
    participant Client
    participant Middleware
    participant Handler
    participant Logger

    Client->>Middleware: HTTP Request
    Middleware->>Logger: Log request start (method, path, etc.)
    Middleware->>Handler: ServeHTTP (with context)
    Handler-->>Middleware: Response
    Middleware->>Logger: Log request completion (status, duration, bytes)
    Middleware-->>Client: HTTP Response

Model Conversion with Alias ID

sequenceDiagram
    participant Converter
    participant Model

    Converter->>Model: Get ID, Get Aliases
    alt Aliases exist
        Converter->>Converter: Use first alias as model ID
    else No aliases
        Converter->>Converter: Use original model ID
    end
    Converter-->>Caller: Return model data with selected ID

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~18 minutes

Possibly related PRs

feat: olla profile #32: Enhances model routing and profile handling, including capability-based routing and model aliasing, which is directly related to the changes in model ID selection and routing compatibility in this PR.

Note

⚡️ Unit Test Generation is now available in beta!

Learn more here, or try it out under "Finishing Touches" below.

📜 Recent review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between e8bae0e and 2f7dc75.

📒 Files selected for processing (1)

internal/app/middleware/logging.go (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

internal/app/middleware/logging.go

✨ Finishing Touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feature/tweaks

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai generate unit tests to generate unit tests for this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (5)

internal/adapter/converter/openai_converter.go (1)
51-57: Fix typo in comment.

There's a typo in line 52: "alas" should be "alias".
-	// we need to use first alas as ID for routing compatibility
+	// we need to use first alias as ID for routing compatibility
internal/adapter/converter/unified_converter.go (1)
78-84: Fix typo in comment.

There's a typo in line 79: "alas" should be "alias".
-	// we need to use first alas as ID for routing compatibility
+	// we need to use first alias as ID for routing compatibility
internal/util/network.go (1)
51-59: Potential race condition in port availability check.

The function correctly checks port availability, but there's a race condition between checking and the actual server start. Another process could claim the port after this check but before the HTTP server binds to it.

Consider these alternatives:

Move this check closer to the actual server start

Let the HTTP server handle port binding errors directly

Use a more robust approach that reserves the port until the server starts
-// IsPortAvailable checks if a port is available by attempting to bind to it
-func IsPortAvailable(host string, port int) bool {
-	listener, err := net.Listen("tcp", fmt.Sprintf("%s:%d", host, port))
-	if err != nil {
-		return false
-	}
-	defer listener.Close()
-	return true
-}
+// IsPortAvailable checks if a port is available by attempting to bind to it
+// Note: There's a race condition between checking and actual usage
+func IsPortAvailable(host string, port int) bool {
+	listener, err := net.Listen("tcp", fmt.Sprintf("%s:%d", host, port))
+	if err != nil {
+		return false
+	}
+	defer listener.Close()
+	return true
+}
internal/app/middleware/logging.go (1)
172-184: Add input validation to formatBytes function.

The function handles most cases well, but should validate input to prevent potential issues with negative values.
-func formatBytes(bytes int64) string {
-	const unit = 1024
-	if bytes < unit {
-		return fmt.Sprintf("%dB", bytes)
-	}
+func formatBytes(bytes int64) string {
+	const unit = 1024
+	if bytes < 0 {
+		return "0B"
+	}
+	if bytes < unit {
+		return fmt.Sprintf("%dB", bytes)
+	}
main.go (1)
48-49: Well-implemented command-line flags for config file specification.

Both short (-c) and long (--config) flag variants provide good user experience. The implementation correctly points both flags to the same variable.

Consider adding a brief comment to clarify that both flags serve the same purpose:
+	// Config file flags (both -c and -config point to the same variable)
 	flag.StringVar(&configFile, "c", "", "Config file path")
 	flag.StringVar(&configFile, "config", "", "Config file path")

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4f44e29 and e8bae0e.

📒 Files selected for processing (16)

internal/adapter/converter/openai_converter.go (1 hunks)
internal/adapter/converter/openai_converter_test.go (1 hunks)
internal/adapter/converter/unified_converter.go (1 hunks)
internal/adapter/converter/unified_converter_test.go (3 hunks)
internal/adapter/discovery/errors.go (2 hunks)
internal/adapter/discovery/errors_test.go (1 hunks)
internal/adapter/discovery/service.go (2 hunks)
internal/adapter/proxy/olla/service.go (5 hunks)
internal/adapter/proxy/sherpa/service.go (4 hunks)
internal/app/handlers/application.go (4 hunks)
internal/app/middleware/logging.go (1 hunks)
internal/app/middleware/logging_test.go (1 hunks)
internal/app/services/http.go (2 hunks)
internal/config/config.go (2 hunks)
internal/util/network.go (1 hunks)
main.go (2 hunks)

🧰 Additional context used

📓 Path-based instructions (2)

**/*_test.go

📄 CodeRabbit Inference Engine (CLAUDE.md)

Unit tests should test individual components in isolation.

Files:

internal/adapter/converter/openai_converter_test.go
internal/adapter/discovery/errors_test.go
internal/app/middleware/logging_test.go
internal/adapter/converter/unified_converter_test.go

internal/{app,adapter}/**/*.go

📄 CodeRabbit Inference Engine (CLAUDE.md)

Endpoints should be exposed at /internal/health and /internal/status.

Files:

internal/adapter/converter/openai_converter_test.go
internal/adapter/discovery/errors_test.go
internal/adapter/converter/openai_converter.go
internal/adapter/converter/unified_converter.go
internal/adapter/discovery/service.go
internal/adapter/discovery/errors.go
internal/adapter/proxy/sherpa/service.go
internal/app/middleware/logging_test.go
internal/app/middleware/logging.go
internal/app/services/http.go
internal/app/handlers/application.go
internal/adapter/proxy/olla/service.go
internal/adapter/converter/unified_converter_test.go

🧠 Learnings (9)

📚 Learning: applies to config.yaml : the main configuration should be defined in `config.yaml`....

Learnt from: CR
PR: thushan/olla#0
File: CLAUDE.md:0-0
Timestamp: 2025-07-27T12:59:29.787Z
Learning: Applies to config.yaml : The main configuration should be defined in `config.yaml`.

Applied to files:

main.go

📚 Learning: applies to internal/adapter/proxy/*_test.go : shared proxy tests should ensure compatibility between...

Learnt from: CR
PR: thushan/olla#0
File: CLAUDE.md:0-0
Timestamp: 2025-07-27T12:59:29.787Z
Learning: Applies to internal/adapter/proxy/*_test.go : Shared proxy tests should ensure compatibility between both proxy engines.

Applied to files:

internal/adapter/converter/openai_converter_test.go
internal/adapter/proxy/sherpa/service.go
internal/app/middleware/logging_test.go
internal/adapter/proxy/olla/service.go
internal/adapter/converter/unified_converter_test.go

📚 Learning: applies to **/*_test.go : unit tests should test individual components in isolation....

Learnt from: CR
PR: thushan/olla#0
File: CLAUDE.md:0-0
Timestamp: 2025-07-27T12:59:29.787Z
Learning: Applies to **/*_test.go : Unit tests should test individual components in isolation.

Applied to files:

internal/adapter/discovery/errors_test.go
internal/app/middleware/logging_test.go
internal/adapter/converter/unified_converter_test.go

📚 Learning: applies to {proxy_sherpa.go,proxy_olla.go} : proxy implementations should be in `proxy_sherpa.go` an...

Learnt from: CR
PR: thushan/olla#0
File: CLAUDE.md:0-0
Timestamp: 2025-07-27T12:59:29.787Z
Learning: Applies to {proxy_sherpa.go,proxy_olla.go} : Proxy implementations should be in `proxy_sherpa.go` and `proxy_olla.go`.

Applied to files:

internal/adapter/proxy/sherpa/service.go
internal/app/services/http.go
internal/app/handlers/application.go
internal/adapter/proxy/olla/service.go

📚 Learning: applies to internal/adapter/proxy/*_test.go : integration tests should test the full request flow th...

Learnt from: CR
PR: thushan/olla#0
File: CLAUDE.md:0-0
Timestamp: 2025-07-27T12:59:29.787Z
Learning: Applies to internal/adapter/proxy/*_test.go : Integration tests should test the full request flow through the proxy.

Applied to files:

internal/adapter/proxy/sherpa/service.go
internal/app/middleware/logging_test.go
internal/adapter/proxy/olla/service.go
internal/adapter/converter/unified_converter_test.go

📚 Learning: applies to handler_proxy.go : request routing logic should be implemented in `handler_proxy.go`....

Learnt from: CR
PR: thushan/olla#0
File: CLAUDE.md:0-0
Timestamp: 2025-07-27T12:59:29.787Z
Learning: Applies to handler_proxy.go : Request routing logic should be implemented in `handler_proxy.go`.

Applied to files:

internal/adapter/proxy/sherpa/service.go
internal/app/handlers/application.go
internal/adapter/proxy/olla/service.go

📚 Learning: applies to internal/adapter/proxy/*_test.go : benchmark tests should measure performance of critical...

Learnt from: CR
PR: thushan/olla#0
File: CLAUDE.md:0-0
Timestamp: 2025-07-27T12:59:29.787Z
Learning: Applies to internal/adapter/proxy/*_test.go : Benchmark tests should measure performance of critical paths, proxy engine comparisons, connection pooling efficiency, and circuit breaker behavior.

Applied to files:

internal/adapter/proxy/sherpa/service.go
internal/app/middleware/logging_test.go
internal/adapter/proxy/olla/service.go

📚 Learning: applies to internal/adapter/proxy/*.go : expose the following response headers: `x-olla-endpoint`, `...

Learnt from: CR
PR: thushan/olla#0
File: CLAUDE.md:0-0
Timestamp: 2025-07-27T12:59:29.788Z
Learning: Applies to internal/adapter/proxy/*.go : Expose the following response headers: `X-Olla-Endpoint`, `X-Olla-Model`, `X-Olla-Backend-Type`, `X-Olla-Request-ID`, `X-Olla-Response-Time`.

Applied to files:

internal/adapter/proxy/sherpa/service.go
internal/app/middleware/logging.go
internal/app/services/http.go
internal/app/handlers/application.go
internal/adapter/proxy/olla/service.go
internal/adapter/converter/unified_converter_test.go

📚 Learning: applies to internal/{app,adapter}/**/*.go : endpoints should be exposed at `/internal/health` and `/...

Learnt from: CR
PR: thushan/olla#0
File: CLAUDE.md:0-0
Timestamp: 2025-07-27T12:59:29.788Z
Learning: Applies to internal/{app,adapter}/**/*.go : Endpoints should be exposed at `/internal/health` and `/internal/status`.

Applied to files:

internal/adapter/proxy/sherpa/service.go
internal/app/services/http.go
internal/app/handlers/application.go
internal/adapter/proxy/olla/service.go

🧬 Code Graph Analysis (8)

main.go (1)

internal/config/config.go (1)

Load (112-153)

internal/adapter/discovery/errors_test.go (1)

internal/adapter/discovery/errors.go (4)

NewDiscoveryError (33-42)

NetworkError (69-72)

ParseError (54-58)

GetUserFriendlyMessage (83-124)

internal/config/config.go (2)

internal/config/types.go (1)

Config (10-18)

internal/logger/logger.go (1)

Config (17-26)

internal/app/middleware/logging_test.go (2)

internal/app/middleware/logging.go (5)

GetLogger (49-54)

GetRequestID (57-62)

EnhancedLoggingMiddleware (65-120)

AccessLoggingMiddleware (123-170)

FormatBytes (187-189)

internal/logger/styled.go (2)

StyledLogger (12-35)

LogContext (71-74)

internal/app/middleware/logging.go (3)

theme/default.go (1)

Default (45-81)

internal/logger/styled.go (1)

StyledLogger (12-35)

internal/logger/logger.go (1)

DefaultDetailedCookie (30-30)

internal/app/services/http.go (1)

internal/util/network.go (1)

IsPortAvailable (52-59)

internal/app/handlers/application.go (2)

internal/logger/styled.go (1)

StyledLogger (12-35)

internal/app/middleware/logging.go (2)

EnhancedLoggingMiddleware (65-120)

AccessLoggingMiddleware (123-170)

internal/adapter/proxy/olla/service.go (2)

internal/app/middleware/logging.go (3)

GetLogger (49-54)

GetRequestID (57-62)

FormatBytes (187-189)

internal/adapter/proxy/common/errors.go (1)

ErrNoHealthyEndpoints (16-16)

🔇 Additional comments (38)

internal/adapter/converter/openai_converter_test.go (1)

92-92: Test correctly updated to match new alias-based model ID behaviour.

The test assertion has been properly updated to expect the first alias name ("phi4:latest") instead of the original model ID, which aligns with the implementation changes for routing compatibility.

internal/adapter/converter/openai_converter.go (1)

54-57: LGTM: Alias prioritisation logic is correct.

The implementation correctly prioritises the first alias for routing compatibility whilst maintaining backwards compatibility by falling back to the original model ID when no aliases exist.

internal/adapter/converter/unified_converter_test.go (2)

87-88: Test comment and assertion correctly updated.

The comment accurately explains the new behaviour of using the first alias for routing compatibility, and the assertion correctly expects the alias name instead of the original model ID.

115-115: Consistent test assertions across all filter scenarios.

The test assertions have been consistently updated across all filtering scenarios to expect the alias-based model IDs, ensuring comprehensive coverage of the new behaviour.

Also applies to: 143-143

internal/adapter/converter/unified_converter.go (1)

81-84: LGTM: Consistent alias prioritisation implementation.

The implementation correctly mirrors the OpenAI converter's approach, ensuring consistent behaviour across different response formats. The logic properly prioritises the first alias whilst maintaining backwards compatibility.

Also applies to: 87-87

internal/app/middleware/logging.go (2)

64-120: Excellent middleware implementation with comprehensive logging.

The middleware provides excellent request tracing capabilities with proper context propagation and metrics capture. The request ID generation and response size tracking are well implemented.

122-170: Well-structured access logging with detailed context.

The access logging middleware complements the enhanced logging nicely, providing detailed information suitable for file-based analysis. The use of context flags for detailed logging is a good pattern.

internal/app/services/http.go (1)

9-9: Good addition of util import for network utilities.

Clean import addition to support the new port availability check functionality.

main.go (2)

42-42: Good addition of configFile variable for command-line configuration.

The global variable properly supports the new configuration file flag functionality.

105-105: Correct integration with updated config.Load signature.

The config loading properly passes the configFile argument, aligning with the updated Load function signature that accepts optional config file paths.

internal/adapter/discovery/errors_test.go (1)

9-98: Excellent comprehensive test coverage for error message formatting.

The test suite thoroughly covers all error scenarios handled by GetUserFriendlyMessage, including:

Network connectivity issues (connectex, timeout)

HTTP status error conditions (404, 500)

Parse errors and generic network errors

Unknown error fallback cases

The table-driven approach and clear test case names make this maintainable and easy to understand. This aligns perfectly with the coding guideline to test components in isolation.

internal/adapter/discovery/errors.go (1)

82-124: LGTM! Well-designed user-friendly error messaging.

The GetUserFriendlyMessage function provides a clear, structured approach to converting technical discovery errors into user-friendly messages. The implementation correctly uses errors.As for type assertion, handles HTTP status code categorisation appropriately, and includes comprehensive fallback handling for various error scenarios.

The string matching patterns for network errors (lines 101-109) cover common connection issues effectively.

internal/adapter/discovery/service.go (3)

99-99: Enhance error logging with context.

Good integration of the user-friendly error messaging. The call to LogErrorsWithContext provides better error visibility during regular discovery operations.

104-111: Well-structured logging helper method.

The LogErrorsWithContext method effectively combines user-friendly messaging with detailed technical information through structured logging. The separation of user-facing and technical details is appropriate for operational visibility.

202-210: Improved error handling with structured context.

The enhanced error logging in handleDiscoveryError provides better operational insight by combining user-friendly messages with detailed error context. This aligns well with the broader logging improvements across the system.

internal/config/config.go (3)

112-112: Good design choice for backwards compatibility.

The variadic parameter approach maintains backwards compatibility whilst enabling command-line flag support. This is a clean solution for extending the function's capability.

118-123: Clear priority hierarchy implementation.

The priority logic (flag > environment variable > default paths) is well-implemented and clearly documented. The conditional logic correctly handles the precedence order.

141-147: Enhanced error handling for different config sources.

The improved error handling provides specific messages based on the configuration source, which aids in troubleshooting. The logic correctly handles both flag-specified and environment variable scenarios.

internal/adapter/proxy/sherpa/service.go (4)

44-44: Appropriate middleware import for enhanced logging.

The addition of the middleware import enables context-aware logging capabilities, supporting the broader logging infrastructure improvements.

215-241: Well-implemented context-aware logging pattern.

The conditional logging approach effectively leverages context loggers when available whilst maintaining backwards compatibility. The enhanced debug information (endpoint count) provides better operational visibility.

255-263: Enhanced request dispatch logging.

Good addition of request ID to the dispatch logging when using context logger. This provides better request traceability across the system.

365-391: Comprehensive completion metrics logging.

Excellent enhancement to the completion logging with formatted byte counts and request ID. The detailed metrics (latency breakdown, byte formatting) provide valuable operational insights whilst maintaining the existing debug-level logging as fallback.

internal/adapter/proxy/olla/service.go (4)

43-43: Consistent middleware integration.

Good addition of middleware import to enable context-aware logging, maintaining consistency with the sherpa proxy implementation.

398-423: Consistent context-aware logging implementation.

The conditional logging pattern matches the sherpa service implementation, providing consistency across proxy engines. The enhanced debug information improves operational visibility.

444-452: Enhanced dispatch logging with request tracking.

Good integration of request ID into dispatch logging when context logger is available, providing better request traceability.

597-598: Comprehensive completion metrics with context awareness.

Excellent enhancement providing detailed completion metrics with formatted byte counts and request ID when context logger is available. The implementation maintains consistency with the sherpa service whilst preserving fallback behaviour.

Also applies to: 650-676

internal/app/handlers/application.go (6)

12-12: LGTM!

The middleware package import is correctly added to support the new logging middleware functionality.

23-23: LGTM!

The logger field addition to SecurityAdapters struct enables proper dependency injection for logging middleware.

26-32: Excellent logging middleware integration!

The middleware chaining is well-structured with clear ordering: logging → access logging → security → handler. The implementation correctly wraps the next handler with both logging middleware layers.

53-53: LGTM!

The handler correctly serves the wrapped middleware with access logging applied.

58-65: Good consistency in middleware application.

Both chain and rate limit middleware now consistently apply the same logging middleware layers, ensuring uniform logging behaviour across all request types.

125-125: LGTM!

The SecurityAdapters is correctly initialised with the logger dependency for middleware usage.

internal/app/middleware/logging_test.go (6)

15-72: Comprehensive test coverage for EnhancedLoggingMiddleware.

The test effectively validates:

Context logger injection and retrieval

Request ID propagation and header setting

Handler execution flow

Response verification

The test follows good isolation principles by using a mock logger and testing the middleware in isolation.

74-109: Good test coverage for AccessLoggingMiddleware.

The test validates the middleware behaviour with proper request setup including headers, content length, and query parameters. The response verification is thorough.

111-131: Excellent test coverage for FormatBytes utility.

The test cases cover all the important byte size ranges from bytes to terabytes, ensuring the formatting function works correctly across different scales.

133-141: Good edge case testing for GetLogger.

Testing the default behaviour when no logger is present in context ensures robustness.

143-151: Good edge case testing for GetRequestID.

Testing the default behaviour when no request ID is present in context ensures robustness.

153-178: Complete mock implementation of StyledLogger interface.

The mock implementation correctly implements all methods from the StyledLogger interface, enabling proper isolation testing of the middleware components. The implementation is minimal but sufficient for testing purposes.

internal/app/middleware/logging.go

internal/app/services/http.go

thushan added 6 commits August 1, 2025 21:07

pass in config file in the CLI args

567f5c0

avoid starting if the port is in use

490123e

context aware logging

367a9ea

Merge branch 'feature/tweaks-context-logs' into feature/tweaks

1e93e7e

friendlier error messages for discovery too.

e8bae0e

thushan marked this pull request as ready for review August 2, 2025 10:59

coderabbitai bot reviewed Aug 2, 2025

View reviewed changes

internal/app/middleware/logging.go Outdated Show resolved Hide resolved

internal/app/services/http.go Show resolved Hide resolved

Validated in testing, we can use our Util.GenerateRequestId now.

2f7dc75

thushan merged commit c332636 into main Aug 2, 2025
3 checks passed

thushan deleted the feature/tweaks branch August 2, 2025 11:14

coderabbitai bot mentioned this pull request Sep 26, 2025

feat: backend/sglang #69

Merged

coderabbitai bot mentioned this pull request Oct 22, 2025

feat: anthropic / message logger (development only) #77

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

tweaks & bugfixes #35

tweaks & bugfixes #35

Uh oh!

thushan commented Aug 2, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Aug 2, 2025 •

edited

Loading

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

tweaks & bugfixes #35

tweaks & bugfixes #35

Uh oh!

Conversation

thushan commented Aug 2, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Aug 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Enhanced HTTP Request Logging Middleware

Model Conversion with Alias ID

Estimated code review effort

Possibly related PRs

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

thushan commented Aug 2, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Aug 2, 2025 •

edited

Loading