Commit Graph

  • 2ada1571f6
    Update docs/faq.md Jeffrey Morgan 2024-09-02 15:30:35 -04:00
  • 50a64aa99a Fix sprintf to snprintf fellowtraveler 2024-07-12 23:51:16 -05:00
  • e2b0e99843
    Merge branch 'ollama:main' into main Jonathan Hecl 2024-09-02 00:06:31 -03:00
  • 8bb9ad59bb
    Update faq.md SnoopyTlion 2024-09-02 09:37:34 +08:00
  • 6bdb00f63b
    Merge 1de766b9cf79c28610d2ec7e929cbfad25664791 into 5f7b4a5e3056d083997b744029c30614cd32397b Aarushi 2024-09-01 16:00:35 -04:00
  • cf2fb5e0ea
    docs: add tokenize and detokenize api Yurzs 2024-09-02 00:06:36 +07:00
  • 19a388bfb8
    api: expose tokenize and detokenize endpoints Yurzs 2024-09-01 23:35:58 +07:00
  • c074d3d5ad
    Add serve step to quickstart Anita Graser 2024-09-01 11:11:08 +02:00
  • 5f7b4a5e30
    fix(cmd): show info may have nil ModelInfo (#6579) Vimal Kumar 2024-09-01 09:42:17 +05:30
  • a0f6aae6e9
    Update README.md Jonathan Hecl 2024-09-01 00:53:15 -03:00
  • 1aad838707
    docs: update GGUF examples and references (#6577) rayfiyo 2024-09-01 11:34:25 +09:00
  • a1cef4d0a5
    Add findutils to base images (#6581) v0.3.9 Daniel Hiltgen 2024-08-31 10:40:05 -07:00
  • 532e74bbcc Add findutils to base images Daniel Hiltgen 2024-08-31 10:32:15 -07:00
  • 778d827d6f fix(cmd): show info may have nil ModelInfo Vimal Kumar 2024-08-31 20:06:27 +05:30
  • 318f0e7c9f
    docs: update GGUF examples and references rayfiyo 2024-08-31 22:32:20 +09:00
  • 1bab560415
    Merge e21b8aa4a469ba0e9d966f663de18f951710e25f into c41f0b9e6c09c3751ff4f10aff78e9da752c5f28 Marcin Szczygliński 2024-08-31 19:57:17 +08:00
  • c41f0b9e6c
    Merge pull request #6562 from ollama/mxyng/build-artifacts Michael Yang 2024-08-30 09:40:50 -07:00
  • 142cbb722d
    Merge pull request #6482 from ollama/mxyng/client-path Michael Yang 2024-08-30 09:40:34 -07:00
  • 9468c6824a
    Merge pull request #6534 from ollama/mxyng/messages Michael Yang 2024-08-30 09:39:59 -07:00
  • d0f34ac8d2
    Merge fbeaa39c630c56c7f62af0eaf1f695c727f55533 into 56346ccfa3e51eec51fc26ae8e91fc88cb74a9b8 Michael Vorburger 2024-08-30 07:48:53 +02:00
  • 9b636e9fbb
    Merge 69207b4987cd4b36e2168bed7d6137879e3d8efb into 56346ccfa3e51eec51fc26ae8e91fc88cb74a9b8 Michael Yang 2024-08-30 11:04:55 +10:00
  • 44f5603e57
    llama: delete unused files (#6523) Jeffrey Morgan 2024-08-29 17:30:11 -07:00
  • faf1a6ac5a update push to use model.Name mxyng/modelname-7 Michael Yang 2024-05-08 17:34:54 -07:00
  • 2f4f270957
    Merge 10c467c0c5d80ecbbe93fc82936df34a094f0dd5 into 56346ccfa3e51eec51fc26ae8e91fc88cb74a9b8 Lei Jitang 2024-08-30 10:12:55 +12:00
  • 11018196e0 remove any unneeded build artifacts Michael Yang 2024-08-29 13:40:43 -07:00
  • e513d8c740
    Merge 6761aca1e12f0a354caaedc9b07d28aeca619f38 into 9a394f936a2a85a5edf83619349b09cc2c1dc044 Michael Yang 2024-08-29 22:30:21 +05:30
  • 9a394f936a slog gin logging Michael Yang 2024-02-08 11:05:16 -08:00
  • 56346ccfa3
    doc: Add Nix and Flox to package manager listing (#6074) Bryan Honof 2024-08-29 18:45:35 +02:00
  • 1e5981ac23 Reduce docker image size Xiaodong Ye 2024-08-15 09:13:37 +08:00
  • 8e4e509fa4
    update the openai docs to explain how to set the context size (#6548) Patrick Devine 2024-08-28 17:11:46 -07:00
  • abb13b2e8d update the openai docs to explain how to set the context size Patrick Devine 2024-08-28 17:08:19 -07:00
  • 6de85f5c00 slog gin logging mxyng/gin-slog Michael Yang 2024-02-08 11:05:16 -08:00
  • dc08a27d54 remove merges jyan/convert-cmdr Josh Yan 2024-08-28 16:01:13 -07:00
  • cf8af774ab renaming Josh Yan 2024-08-28 15:44:21 -07:00
  • c41bbb45bd linter Josh Yan 2024-08-28 15:40:36 -07:00
  • d073220b65 rebased Josh Yan 2024-08-28 15:39:24 -07:00
  • 47c2b947a9
    Merge pull request #6546 from ollama/mxyng/fix-test Michael Yang 2024-08-28 15:37:47 -07:00
  • c6cb05a43b
    Merge 745706c76529a3c4d6ab6244921e85af34ffca4c into 6761aca1e12f0a354caaedc9b07d28aeca619f38 Michael Yang 2024-08-28 21:44:54 +00:00
  • 745706c765 refactor layer pruning mxyng/modelname-6 Michael Yang 2024-08-28 13:13:02 -07:00
  • 5eb77bf976
    Merge pull request #6539 from ollama/mxyng/validate-modelpath Michael Yang 2024-08-28 14:38:27 -07:00
  • e4d0a9c325 fix(test): do not clobber models directory Michael Yang 2024-08-28 14:07:48 -07:00
  • 7416ced70f
    add llama3.1 chat template (#6545) Patrick Devine 2024-08-28 14:03:20 -07:00
  • 6761aca1e1 update pull handler to use model.Name mxyng/modelname-5 Michael Yang 2024-08-28 13:06:41 -07:00
  • 6a809a726a add llama3.1 chat template Patrick Devine 2024-08-28 13:29:01 -07:00
  • 3e24edd9ed update push to use model.Name Michael Yang 2024-05-08 17:34:54 -07:00
  • 0e1ec461f9 import jyan/convert-prog Josh Yan 2024-08-28 11:18:23 -07:00
  • 52ef79bb7d last lint (hopefully) Josh Yan 2024-08-28 11:12:39 -07:00
  • 800edd7884 lint again Josh Yan 2024-08-28 11:10:03 -07:00
  • 01b20fe6f1 lint Josh Yan 2024-08-28 11:07:43 -07:00
  • 9cfd2dd3e3
    Merge pull request #6522 from ollama/mxyng/detect-chat Michael Yang 2024-08-28 11:04:18 -07:00
  • 340162fbc3 convert progress Josh Yan 2024-08-28 10:54:52 -07:00
  • 4da5d5beaa lint jyan/quant5 Josh Yan 2024-08-28 10:23:41 -07:00
  • cc17b02b23 update Josh Yan 2024-08-28 09:58:23 -07:00
  • 8e6da3cbc5 update deprecated warnings Michael Yang 2024-08-27 17:57:34 -07:00
  • d9d50c43cc validate model path Michael Yang 2024-08-27 17:56:04 -07:00
  • d33db8f7a9
    Update README.md RAPID ARCHITECT 2024-08-28 08:00:00 -05:00
  • f1aa5c7565 add support for choose visible ascend devices zhongtao 2024-08-28 02:27:27 +14:00
  • 73bc83337c
    Merge f436aabc9d8aa4e735c01122f7a3946ec7b3ceb1 into 6c1c1ad6a90e8fe23d63d2c431745e48e3fe9d81 Emir Sahin 2024-08-28 01:25:14 -04:00
  • 6c1c1ad6a9
    throw an error when encountering unsupport tensor sizes (#6538) Patrick Devine 2024-08-27 17:54:04 -07:00
  • c0302b05c0 feed the linter Patrick Devine 2024-08-27 17:34:34 -07:00
  • 48c304571f runner.go: Update TODOs Jesse Gross 2024-08-27 17:05:23 -07:00
  • 11e5a51308 throw an error when encountering unsupport tensor sizes Patrick Devine 2024-08-27 17:19:55 -07:00
  • 46df2b7c6d
    Merge pull request #6536 from ollama/jessegross/goserver-fixes Jesse Gross 2024-08-27 16:49:12 -07:00
  • 73d69bc90b remove types Josh Yan 2024-08-12 09:47:05 -07:00
  • 9bc42f532b rmv api type Josh Yan 2024-08-12 09:45:44 -07:00
  • 07c0f66f5e rm print Josh Yan 2024-08-01 16:49:03 -07:00
  • 4a7bfca902 change progress msg Josh Yan 2024-07-31 10:57:25 -07:00
  • 04f2154505 fixed cgo Josh Yan 2024-07-31 10:52:11 -07:00
  • de9b21b472 quantize progress Josh Yan 2024-07-31 10:48:18 -07:00
  • a0daca0f98
    Update openai_test.go Yaroslav 2024-08-28 01:39:13 +02:00
  • 71a692a25f runner.go: Fix embeddings endpoint Jesse Gross 2024-08-27 13:59:33 -07:00
  • d5d540b71a runner.go: Health endpoint comments Jesse Gross 2024-08-27 13:17:04 -07:00
  • 32b4eb2db9 runner.go: Cleanups Jesse Gross 2024-08-27 13:19:46 -07:00
  • 93ea9240ae
    Move ollama executable out of bin dir (#6535) v0.3.8 Daniel Hiltgen 2024-08-27 16:19:00 -07:00
  • 413ae39f3c update templates to use messages Michael Yang 2024-08-27 11:34:30 -07:00
  • 80ce0eebec Move ollama executable out of bin dir Daniel Hiltgen 2024-08-27 15:38:21 -07:00
  • cc142f259a Refactor payload logic and add buildx support for faster builds Daniel Hiltgen 2024-08-26 14:31:45 -07:00
  • 1e486dbc7d Optimize container images for startup Daniel Hiltgen 2024-08-23 17:40:20 -07:00
  • 60e47573a6 more tokenizer tests Michael Yang 2024-08-27 11:11:53 -07:00
  • d13c3daa0b
    add safetensors to the modelfile docs (#6532) Patrick Devine 2024-08-27 14:46:47 -07:00
  • 4c659c6b14 add safetensors to the modelfile docs Patrick Devine 2024-08-27 14:40:32 -07:00
  • 1713eddcd0
    Fix import image width (#6528) Patrick Devine 2024-08-27 14:19:47 -07:00
  • 4e1c4f6e0b
    Update manual instructions with discrete ROCm bundle (#6445) Daniel Hiltgen 2024-08-27 13:42:28 -07:00
  • 397cae7962
    llm: fix typo in comment (#6530) Sean Khatiri 2024-08-27 16:28:29 -04:00
  • 76ddbb8282 fix: comment typo seankhatiri 2024-08-27 15:05:15 -04:00
  • a6d30ecefe working causal attention paligemma-support Josh Yan 2024-08-27 11:34:32 -07:00
  • 5516515e9b fix the toc link Patrick Devine 2024-08-27 11:28:22 -07:00
  • ed731f1d76 fixup Patrick Devine 2024-08-27 11:24:26 -07:00
  • 3893f1f2ae fix widths Patrick Devine 2024-08-27 11:20:47 -07:00
  • 1c70a00f71 adjust image sizes Patrick Devine 2024-08-27 11:15:25 -07:00
  • eae3af6807 clean up convert tokenizer Michael Yang 2024-08-27 10:45:39 -07:00
  • 3eb08377f8 detect chat template from configs that contain lists Michael Yang 2024-08-26 16:36:50 -07:00
  • 4a7ac5bb2b
    Merge pull request #6521 from ollama/jessegross/goserver-fixes Jesse Gross 2024-08-27 10:49:12 -07:00
  • 80eef7c7b1 changes Josh Yan 2024-08-27 10:47:13 -07:00
  • aa0a49975f runner.go: Move pieces[] into sequence Jesse Gross 2024-08-27 10:24:33 -07:00
  • 3f4d203cf4 bugfix and improve for npu support zhongtao 2024-08-06 16:43:29 +14:00
  • d3f2eb0037 add ascend npu support zhongTao99 2024-07-24 01:57:01 +14:00
  • cb576a6b23 fix ref pdevine/import-docs Patrick Devine 2024-08-26 19:59:33 -07:00
  • ac80010db8
    update the import docs (#6104) Patrick Devine 2024-08-26 19:57:26 -07:00
  • 15b7ff3a89 more comments Patrick Devine 2024-08-26 19:56:45 -07:00