This repository was archived by the owner on Jul 4, 2025. It is now read-only.
Tags: menloresearch/cortex.cpp
Tags
Hostfix: remove not needed params from load_model (#2209) * refactor: remove --pooling flag from model loading The --pooling flag was removed as the mean pooling functionality not needed in chat models. This fixes the regression * feat(local-engine): add ctx_len parameter support Adds support for the ctx_len parameter by appending --ctx-size with its value. Removed outdated parameter mappings from the kParamsMap to reflect current implementation details and ensure consistency. * feat: add conditional model parameters based on path When the model path contains both "jan" and "nano" (case-insensitive), automatically add speculative decoding parameters to adjust generation behavior. This improves flexibility by enabling environment-specific configurations without manual parameter tuning. Also includes necessary headers for string manipulation and fixes whitespace in ctx_len handling. * chore: remove redundant comment The comment was redundant as the code's purpose is clear without it, improving readability.
fix: check model is loaded before starting (#2206) * fix: check model is loaded before starting * chore: e2e test --------- Co-authored-by: sangjanai <[email protected]>
fix: remove jinja parameter (#2205) * fix: remove jinja parameter * chore: disable linux arm CI --------- Co-authored-by: sangjanai <[email protected]>
Merge pull request #2186 from menloresearch/s/chore/sync-dev fix: std::filesystem::equivalent does not work for non-exist path
PreviousNext