Tags: Zohan13666/cortex.cpp
Tags
Hostfix: remove not needed params from load_model (menloresearch#2209) * refactor: remove --pooling flag from model loading The --pooling flag was removed as the mean pooling functionality not needed in chat models. This fixes the regression * feat(local-engine): add ctx_len parameter support Adds support for the ctx_len parameter by appending --ctx-size with its value. Removed outdated parameter mappings from the kParamsMap to reflect current implementation details and ensure consistency. * feat: add conditional model parameters based on path When the model path contains both "jan" and "nano" (case-insensitive), automatically add speculative decoding parameters to adjust generation behavior. This improves flexibility by enabling environment-specific configurations without manual parameter tuning. Also includes necessary headers for string manipulation and fixes whitespace in ctx_len handling. * chore: remove redundant comment The comment was redundant as the code's purpose is clear without it, improving readability.
fix: check model is loaded before starting (menloresearch#2206) * fix: check model is loaded before starting * chore: e2e test --------- Co-authored-by: sangjanai <[email protected]>
fix: remove jinja parameter (menloresearch#2205) * fix: remove jinja parameter * chore: disable linux arm CI --------- Co-authored-by: sangjanai <[email protected]>
Merge pull request menloresearch#2186 from menloresearch/s/chore/sync… …-dev fix: std::filesystem::equivalent does not work for non-exist path
Merge pull request menloresearch#2178 from menloresearch/dev chore: sync dev to main
PreviousNext