Commit Graph

  • d61ef8b954 update create handler to use model.Name Michael Yang 2024-05-08 14:36:08 -07:00
  • 89d9900152
    Merge pull request #4570 from ollama/mxyng/slices Michael Yang 2024-06-04 13:27:05 -07:00
  • 4a048715b6
    local wording was confusing people Michael 2024-06-04 13:25:25 -07:00
  • 6297f85606 gofmt, goimports Michael Yang 2024-06-04 11:53:23 -07:00
  • ed56428dd7 warn on intrange, usestdlibvars Michael Yang 2024-06-04 11:51:39 -07:00
  • ad40b92b6a disable intrange Michael Yang 2024-06-04 11:35:30 -07:00
  • 8ce4032e72 more lint Michael Yang 2024-05-29 18:22:03 -07:00
  • 42660466f8 no usestdlibvars Michael Yang 2024-05-23 11:04:46 -07:00
  • e919f6811f lint windows Michael Yang 2024-05-22 09:26:45 -07:00
  • bf7edb0d5d lint linux Michael Yang 2024-05-22 09:08:01 -07:00
  • f38353d6b9 stdin.fd Michael Yang 2024-05-22 09:00:38 -07:00
  • 201d853fdf nolintlint Michael Yang 2024-05-22 08:52:00 -07:00
  • e40145a39d lint Michael Yang 2024-05-21 22:21:04 -07:00
  • c895a7d13f some gocritic Michael Yang 2024-05-21 22:07:57 -07:00
  • dad7a987ae nosprintfhostport Michael Yang 2024-05-21 21:53:44 -07:00
  • 8ffb51749f nolintlint Michael Yang 2024-05-21 21:52:20 -07:00
  • 55f6eba049 gofmt Michael Yang 2024-05-21 21:32:43 -07:00
  • 04f3c12bb7 replace x/exp/slices with slices Michael Yang 2024-05-21 21:30:52 -07:00
  • 60323e0805
    add embed model command and fix question invoke (#4766) Shubham 2024-06-04 10:50:48 +05:30
  • 5dc5a295bf added testcase jyan/format Josh Yan 2024-06-03 17:28:05 -07:00
  • e21e6b2a33 added testcase Josh Yan 2024-06-03 17:27:38 -07:00
  • a240ea3367 humanNumbers formats to 3 digits, added trillion case for future Josh Yan 2024-06-03 17:26:02 -07:00
  • d4a86102fd
    update welcome prompt in windows to llama3 (#4779) Jeffrey Morgan 2024-06-01 21:05:51 -07:00
  • 476fb8e892
    Limit GPU lib search for now (#4777) v0.1.41 Jeffrey Morgan 2024-06-01 19:24:33 -07:00
  • 829ff87bd1
    revert tokenize ffi (#4761) v0.1.40 Michael Yang 2024-05-31 18:54:21 -07:00
  • 5f0403d208 Isolated Deletions royh-testdelete Roy Han 2024-05-31 17:40:11 -07:00
  • 5a3cb1064a Clean Up Delete Tests Roy Han 2024-05-31 16:44:54 -07:00
  • f6b622c4b3
    Merge pull request #4733 from ollama/jyan/isvalidname Josh 2024-05-31 14:08:45 -07:00
  • 77487ada72 Err Check Roy Han 2024-05-31 13:12:26 -07:00
  • a946b6f020 Adjust Response and Blob Check Roy Han 2024-05-31 13:08:59 -07:00
  • c62df6b3bf Check Blob Roy Han 2024-05-31 12:07:52 -07:00
  • 2e4da8eec2 added tests for IsValidNamespace Josh Yan 2024-05-31 11:48:07 -07:00
  • e8788ae8dd Specify DNE error Roy Han 2024-05-31 09:45:47 -07:00
  • 95af97b9f3 server: try github.com/minio/sha256-simd bmizerany/fastverify Blake Mizerany 2024-05-31 00:42:09 -07:00
  • 763bb65dbb
    use int32_t for call to tokenize (#4738) v0.1.40-rc1 Jeffrey Morgan 2024-05-30 21:43:30 -07:00
  • 7ca9605f54
    speed up tests by only building static lib (#4740) Jeffrey Morgan 2024-05-30 21:43:15 -07:00
  • eb2c443a79
    Merge pull request #4736 from ollama/mxyng/vocab-only Michael Yang 2024-05-30 17:21:00 -07:00
  • 278e25ea44
    Merge pull request #4737 from ollama/mxyng/less-generate Michael Yang 2024-05-30 17:17:50 -07:00
  • a50a87a7b8
    partial offloading: allow flash attention and disable mmap (#4734) Jeffrey Morgan 2024-05-30 16:58:01 -07:00
  • 98085015d5 only generate on relevant changes Michael Yang 2024-05-22 09:58:26 -07:00
  • bf54c845e9 vocab only Michael Yang 2024-05-30 16:49:28 -07:00
  • 8774e5d6a9 Deletion Unit Test Roy Han 2024-05-30 16:44:17 -07:00
  • c365f195a8 directly use isvalidpart Josh Yan 2024-05-30 16:40:04 -07:00
  • e91d0ef737
    Merge pull request #4728 from ollama/jyan/japanese Josh 2024-05-30 16:25:12 -07:00
  • 22f5c12ced
    Update llama.cpp submodule to 5921b8f0 (#4731) Jeffrey Morgan 2024-05-30 16:20:22 -07:00
  • 298c996e54 added IsValidNamespace function Josh Yan 2024-05-30 16:02:07 -07:00
  • 0fc0cfc6d2
    Merge pull request #4594 from dhiltgen/doc_container_workarounds Daniel Hiltgen 2024-05-30 13:10:54 -07:00
  • 914f68f021 replaced duplicate call with variable Josh Yan 2024-05-30 10:38:07 -07:00
  • bd1d119ba9 fixed japanese characters deleted at end of line Josh Yan 2024-05-30 10:24:21 -07:00
  • a03be18189
    Fix OLLAMA_LLM_LIBRARY with wrong map name and add more env vars to help message (#4663) Lei Jitang 2024-05-31 00:36:51 +08:00
  • c79fd5c168 Reincluding Numbers language_support Roy Han 2024-05-29 12:22:36 -07:00
  • 96bc232b43
    Merge pull request #4413 from ollama/mxyng/name-check Michael Yang 2024-05-29 12:06:58 -07:00
  • bca7b12284
    Merge pull request #3718 from ollama/mxyng/modelname-3 Michael Yang 2024-05-29 12:02:07 -07:00
  • 32cb1960c1
    Merge pull request #4380 from ollama/mxyng/tokenize Michael Yang 2024-05-29 12:00:59 -07:00
  • 73fb9ea36e Draft for Multi-Language Modelfile Creation Roy Han 2024-05-29 11:51:57 -07:00
  • de781b37c8 rm unused infill Michael Yang 2024-05-12 09:21:35 -07:00
  • 3e21799377 rm unused system prompt Michael Yang 2024-05-12 09:20:39 -07:00
  • 26a00a0410 use ffi for tokenizing/detokenizing Michael Yang 2024-05-11 12:49:24 -07:00
  • 646371f56d
    Merge pull request #3278 from zhewang1-intc/rebase_ollama_main Daniel Hiltgen 2024-05-28 16:30:50 -07:00
  • 1f5008544b
    Update install.sh Jeffrey Morgan 2024-05-28 15:01:22 -07:00
  • 45cbfc5aee
    fix wsl2 status check for nvidia cards (#4689) Jeffrey Morgan 2024-05-28 14:49:46 -07:00
  • 6d423b383b
    Improve install experience on WSL2 and Linux (#4653) Jeffrey Morgan 2024-05-28 14:41:50 -07:00
  • ad897080a2
    working on integration of multi-byte and multi-width runes (#4549) v0.1.39 Josh 2024-05-28 12:04:03 -07:00
  • b7d316d98d
    fix nvidia detection in install script (#4683) Jeffrey Morgan 2024-05-28 09:59:36 -07:00
  • d7339fad52
    Merge pull request #4682 from dhiltgen/more_time Daniel Hiltgen 2024-05-28 09:36:02 -07:00
  • 92c81e8117 Give the final model loading more time Daniel Hiltgen 2024-05-28 08:56:18 -07:00
  • 9db0996ed4
    Add OllamaSpring Project to Readme (#4672) Tai 2024-05-28 10:58:26 +08:00
  • 6f43898b17
    Adds olpaka flutter client (#4647) Orfeo Ciano 2024-05-28 01:22:01 +01:00
  • 7487229c34
    llm/server.go: Fix 2 minor typos (#4661) Lei Jitang 2024-05-28 08:21:10 +08:00
  • 8a8e7afa96
    small fix on examples/python-simplechat/client.py to actually get a streamed response and get tokens printed as we receive it (#4671) Rayan Mostovoi 2024-05-28 02:19:20 +02:00
  • c79f8c9c39
    Ensure nvidia and nvidia_uvm kernel modules are loaded in install.sh script and at startup (#4652) Jeffrey Morgan 2024-05-26 14:57:17 -07:00
  • 485016bfbb
    Update install.sh Jeffrey Morgan 2024-05-26 11:46:00 -07:00
  • 0165ba1651
    Merge pull request #4638 from dhiltgen/better_error Daniel Hiltgen 2024-05-25 14:32:28 -07:00
  • c4209d6d21 Report better warning on client closed abort of load Daniel Hiltgen 2024-05-25 09:23:28 -07:00
  • 6adca97f37
    Merge pull request #4619 from noxer/patch-1 Michael Yang 2024-05-24 17:21:57 -07:00
  • 9a3c8003c8
    Merge pull request #4624 from ollama/mxyng/fix-5 Michael Yang 2024-05-24 16:11:21 -07:00
  • d51f15257c
    Update llm/ggml.go Michael Yang 2024-05-24 16:10:43 -07:00
  • 8f440d579a fix q5_0, q5_1 Michael Yang 2024-05-24 16:01:37 -07:00
  • 4cc3be3035
    Move envconfig and consolidate env vars (#4608) Patrick Devine 2024-05-24 14:57:15 -07:00
  • db2ffa79f1
    Fix download retry issue Tim Scheuermann 2024-05-24 20:30:42 +02:00
  • afd2b058b4
    set codesign timeout to longer (#4605) Jeffrey Morgan 2024-05-23 22:46:23 -07:00
  • fd5971be0b support ollama run on Intel GPUs Wang,Zhe 2024-05-24 11:18:27 +08:00
  • 89bf98bcf2
    Merge pull request #4598 from dhiltgen/docs Daniel Hiltgen 2024-05-23 15:14:29 -07:00
  • 1b2d156094 Tidy up developer guide a little Daniel Hiltgen 2024-05-23 14:24:07 -07:00
  • 714adb8bd1
    bump (#4597) Michael Yang 2024-05-23 14:16:26 -07:00
  • 95b1133d0c
    Merge pull request #4547 from dhiltgen/load_progress Daniel Hiltgen 2024-05-23 14:06:02 -07:00
  • b37b496a12 Wire up load progress Daniel Hiltgen 2024-05-20 16:41:43 -07:00
  • d6f692ad1a
    Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322) Bruce MacDonald 2024-05-23 13:21:49 -07:00
  • f77713bf1f Add isolated gpu test to troubleshooting Daniel Hiltgen 2024-05-23 09:33:25 -07:00
  • 38255d2af1
    Use flash attention flag for now (#4580) v0.1.39-rc2 Jeffrey Morgan 2024-05-22 21:52:09 -07:00
  • 73630a7e85
    add phi 3 medium (#4578) Michael 2024-05-22 12:53:45 -04:00
  • 955c317cab
    chore: update tokenizer.go (#4571) Ikko Eltociear Ashimine 2024-05-22 16:25:23 +09:00
  • 9f18b88a06
    Merge pull request #4566 from ollama/jyan/shortcuts Josh 2024-05-21 22:49:36 -07:00
  • 353f83a9c7 add Ctrl + W shortcut Josh Yan 2024-05-21 16:55:09 -07:00
  • 3bade04e10
    doc updates for the faq/troubleshooting (#4565) Patrick Devine 2024-05-21 15:30:09 -07:00
  • a6d0f443eb
    Merge pull request #4543 from ollama/mxyng/simple-safetensors v0.1.39-rc1 Michael Yang 2024-05-21 14:43:55 -07:00
  • 96236b7968
    Merge pull request #4268 from ollama/pdevine/llama3 Michael Yang 2024-05-21 14:43:37 -07:00
  • 4434d7f447
    Correct typo in error message (#4535) Sang Park 2024-05-22 05:39:01 +09:00
  • 171eb040fc simplify safetensors reading Michael Yang 2024-05-20 09:47:01 -07:00
  • 3591bbe56f add test Michael Yang 2024-05-21 11:28:16 -07:00