Closed
Description
I'm grokking with LLaMA.cpp on M1 laptop with 32GB RAM. Somehow the inference is broken for me.
Like I'm expecting something reasonable for simple prompt I've got from original LLaMA examples:
SQL code to create a table, that will keep CD albums data, such as album name and track\n\\begin{code}\n
And LLaMA.cpp returns just some weirdo texts with any model size (7B, 13B, 30B quantised down to 4bit).
What's the reason here?