ollama/llm at 80ee9b5e47fc0ea99d1f3f33224923266627c15c - ollama - Gitea: Git with a cup of tea

third-party-mirrors/ollama

History

Jeffrey Morgan 5534f2cc6a

llm: consider head_dim in llama arch (#5817 )

2024-07-20 21:48:12 -04:00

..

Introduce /api/embed endpoint supporting batch embedding (#5127 )

2024-07-15 12:14:24 -07:00

Adjust windows ROCm discovery

2024-07-20 15:17:50 -07:00

llama.cpp @ a8db2a9ce6

Update llama.cpp submodule to a8db2a9c (#5530 )

2024-07-07 13:03:09 -04:00

llm: consider head_dim in llama arch (#5817 )

2024-07-20 21:48:12 -04:00

filetype.go

Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322 )

2024-05-23 13:21:49 -07:00

ggla.go

llm: speed up gguf decoding by a lot (#5246 )

2024-06-24 21:47:52 -07:00

ggml_test.go

llm: speed up gguf decoding by a lot (#5246 )

2024-06-24 21:47:52 -07:00

ggml.go

chatglm graph

2024-07-10 13:43:47 -07:00

gguf.go

add chat and generate tests with mock runner

2024-07-16 09:39:31 -07:00

llm_darwin_amd64.go

…

llm_darwin_arm64.go

…

llm_linux.go

…

llm_windows.go

…

llm.go

fix: quant err message (#5616 )

2024-07-11 17:24:29 -07:00

memory_test.go

llm: speed up gguf decoding by a lot (#5246 )

2024-06-24 21:47:52 -07:00

memory.go

handle asymmetric embedding KVs

2024-06-20 09:57:27 -07:00

payload.go

Fix corner cases on tmp cleaner on mac

2024-07-03 13:10:14 -07:00

server.go

Adjust windows ROCm discovery

2024-07-20 15:17:50 -07:00

status.go

fix error detection by limiting model loading error parsing (#5472 )

2024-07-03 20:04:30 -04:00