Tags: hhao/ollama
Tags
Limit GPU lib search for now (ollama#4777) * fix oneapi errors on windows 10
revert tokenize ffi (ollama#4761) * Revert "use `int32_t` for call to tokenize (ollama#4738)" This reverts commit 763bb65. * Revert "vocab only" This reverts commit bf54c84. * Revert "use ffi for tokenizing/detokenizing" This reverts commit 26a00a0.
use `int32_t` for call to tokenize (ollama#4738) * use `int32_t` for call to tokenize * variable naming * cleanup * fix crash
working on integration of multi-byte and multi-width runes (ollama#4549) * integrated runewidth for display management - fixed cursor movement for mutli-width char * updated input and deletion of multi-byte chars * fixed line history with some exceptions * improved insert and add * fixed issues with moving across lines * end of line extra space tracking' * saved changes * fixed end of line issues with empty spaces * worked some more * worked on end of line * fixed failed test * fixed minor inserting bug * fixed movement hotkeys * adjusted hotkeys * removed comments * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <[email protected]> * deleted comments and duplicate code * removed duplicate code * added comments, refactored add function to use addChar * added helper to retrieve lineSpacing, renamed lineFlags for clarity * fixed remove() --------- Co-authored-by: Bruce MacDonald <[email protected]>
Use flash attention flag for now (ollama#4580) * put flash attention behind flag for now * add test * remove print * up timeout for sheduler tests
Merge pull request ollama#4543 from ollama/mxyng/simple-safetensors simplify safetensors reading
fix the cpu estimatedTotal memory + get the expiry time for loading m… …odels (ollama#4461)
Merge pull request ollama#4323 from dhiltgen/sort_by_free Always use the sorted list of GPUs
PreviousNext