Commit Graph

  • 2d25199c7f rename server to runner jmorganca 2024-05-19 00:13:30 -04:00
  • 21c9faac20 Update README.md Jeffrey Morgan 2024-05-18 19:50:23 -07:00
  • a76728e41e Update README.md Jeffrey Morgan 2024-05-18 19:49:43 -07:00
  • d0b581926a Update README.md Jeffrey Morgan 2024-05-18 19:47:19 -07:00
  • 3328470557 Update README.md Jeffrey Morgan 2024-05-18 19:46:44 -07:00
  • 904f8f4ec7 Add missing hipcc flags jmorganca 2024-05-18 23:07:19 -04:00
  • 15eaa576e5 fix .gitattributes jmorganca 2024-05-18 22:39:41 -04:00
  • ca5efef7c6 Initial llama Go module jmorganca 2024-04-20 20:44:01 -04:00
  • 1611e2ac9f add sync of llama.cpp jmorganca 2024-04-20 18:08:09 -04:00
  • 4257809f33
    Merge branch 'ollama:main' into feature/kv-quant Sam 2024-08-07 09:33:20 +12:00
  • 1abf1cf351 Report GPU variant in log Daniel Hiltgen 2024-06-19 09:36:30 -07:00
  • e8923a3b52 Add Jetson cuda variants for arm Daniel Hiltgen 2024-05-30 21:54:07 -07:00
  • e04c7012c2
    update llama.cpp submodule to 1e6f6554 (#6208) Jeffrey Morgan 2024-08-06 15:11:45 -04:00
  • c818cd5834 Refactor linux packaging Daniel Hiltgen 2024-07-08 12:50:11 -07:00
  • 9e75f17e19 refactor: clean up gpu/amd_hip_*.go evshiron 2024-08-07 02:40:15 +08:00
  • d4a7216c82
    Fixed invalid option provided not displaying the invalid option name problem. (#6202) Chua Chee Seng 2024-08-07 02:37:16 +08:00
  • f386c41911 update llama.cpp submodule to 1e6f6554 jmorganca 2024-08-06 14:07:19 -04:00
  • a4fdd03c3b
    Merge pull request #6207 from dhiltgen/sparse_win Daniel Hiltgen 2024-08-06 11:06:06 -07:00
  • fc85f50a2b Ensure sparse files on windows during download Daniel Hiltgen 2024-08-06 10:46:31 -07:00
  • 81329a95d0 Fixed invalid option provided not displaying the invalid option name problem. Chua Chee Seng 2024-08-06 22:10:31 +08:00
  • 36b63efd00 feat: add support for running ollama on rocm in wsl evshiron 2024-08-06 21:34:59 +08:00
  • 3b32812019 resolve conflicts from upstream Sam McLeod 2024-08-06 15:16:25 +10:00
  • f2dcf42914 resolve conflicts from upstream Sam McLeod 2024-08-06 12:39:49 +10:00
  • 67ef34a994
    Merge branch 'main' into feature/kv-quant Sam 2024-08-06 14:36:15 +12:00
  • 86b907f82a
    sort batch results (#6189) royjhan 2024-08-05 19:55:34 -04:00
  • 10d49bce70
    Merge pull request #6190 from ollama/mxyng/fix-integration Michael Yang 2024-08-05 16:45:49 -07:00
  • 7ed367419e fix concurrency test Michael Yang 2024-08-05 16:34:54 -07:00
  • c05e030f19 sort batch results Roy Han 2024-08-05 15:57:17 -07:00
  • 50ee8b5f56
    Merge pull request #6186 from dhiltgen/numa Daniel Hiltgen 2024-08-05 15:20:06 -07:00
  • 1477d629e9
    Allow singular array for CompletionRequest prompt field Igor Drozdov 2024-08-05 23:37:58 +02:00
  • 03bdac0595
    Merge pull request #6146 from ollama/mxyng/testing Michael Yang 2024-08-05 13:00:05 -07:00
  • f457d63400 Implement linux NUMA detection Daniel Hiltgen 2024-08-05 12:56:20 -07:00
  • ba9a6fe80e Create systemd socket file in install script. Nicholas Schwab 2024-08-05 20:03:39 +02:00
  • 462aa5f800 Enable server to be activated via systemd sockets. Nicholas Schwab 2024-08-05 19:50:42 +02:00
  • e30825aa25 feat(server): remove hardcode gin mode haunt98 2024-07-29 01:28:37 +07:00
  • ae66894614
    Merge branch 'ollama:main' into cors frob 2024-08-05 19:02:38 +02:00
  • c38592c3e4 lint Roy Han 2024-08-05 09:56:50 -07:00
  • f944783e88
    Merge branch 'main' into royh-openai-testing royjhan 2024-08-05 12:50:24 -04:00
  • 04210aa6dd Catch one more error log Daniel Hiltgen 2024-08-05 09:28:07 -07:00
  • 66c894f4d2
    Merge branch 'ollama:main' into feature/kv-quant Sam 2024-08-05 23:25:06 +12:00
  • c59c2caf9d
    Merge 3b100a91f838ec404e10ee7c5f9948403ce211f5 into 39f2bc6bfccef2d82aa1d7e8a7aecf0e14bc126c NormalFish 2024-08-05 09:52:48 +02:00
  • 43f9d92008 close pid file Michael Yang 2024-08-05 00:34:09 -07:00
  • ed6c8bfe57 removeall to remove non-empty temp dirs Michael Yang 2024-08-05 00:02:47 -07:00
  • 39f2bc6bfc
    Merge pull request #6167 from ollama/mxyng/line-feed Michael Yang 2024-08-05 00:06:28 -07:00
  • 001a52cb0a cmd: better version info when client/server not equal alwqx 2024-08-05 14:27:08 +08:00
  • b73b0940ef
    Disable paging for journalctl (#6154) frob 2024-08-05 06:10:53 +02:00
  • 6a07344786 line feed Michael Yang 2024-08-04 17:25:33 -07:00
  • 0c06132239
    Update gpu.go to support older amdgpu Vishal Rao 2024-08-05 00:12:21 +05:30
  • 8b920f35a4
    Add Gemma 2 2b (#6151) sryu1 2024-08-05 00:58:39 +10:00
  • 141a7c1ff9 align with new linting settings upstream Sam McLeod 2024-08-04 08:47:57 +10:00
  • c401a227e1
    Disable paging for journalctl frob 2024-08-03 19:33:01 +02:00
  • 43f81399da
    Add Gemma 2 2b sryu1 2024-08-03 21:57:09 +10:00
  • 7c3098e8c1 Merge branch 'main' into feature/cors-chrome-extension potato-digua 2024-08-03 14:59:36 +08:00
  • 44db8454e3 feat: api allow moz-extension and safari-web-extension origin potato-digua 2024-08-03 14:01:54 +08:00
  • 33f94ca1da
    Merge branch 'ollama:main' into feature/kv-quant Sam 2024-08-03 17:38:13 +12:00
  • 90c84adcf4 server/download.go: Avoid using magic number Jitang Lei 2024-08-03 09:34:14 +08:00
  • 437154501b server/download.go: Fix downloading retry issues Jitang Lei 2024-08-03 09:31:42 +08:00
  • 4221e39867
    Reference ollama integration with Harbor (#6147) Ivan Charapanau 2024-08-03 02:03:46 +02:00
  • 0b3060c2dc
    Reference ollama integration with Harbor Ivan Charapanau 2024-08-03 01:58:53 +02:00
  • f4a3b1e164
    Merge d9dd76e4e7efcde568e3e24904edebdadd8a2a28 into 77ccbf04dc8d3854dc2c0aafe7d1d03a50fe81a0 Kevin Thomas 2024-08-03 01:36:28 +02:00
  • a091fadfda use testing tempdirs Michael Yang 2024-08-02 15:55:34 -07:00
  • 77ccbf04dc
    Merge pull request #6128 from ollama/mxyng/lint Michael Yang 2024-08-02 14:58:40 -07:00
  • 4addf6b587
    Update OpenAI Compatibility Docs with /v1/completions (#5311) royjhan 2024-08-02 16:16:23 -04:00
  • d812d88a53 merge conflicts Roy Han 2024-08-02 13:15:42 -07:00
  • 9003290259
    Merge branch 'main' into royh-completions-docs royjhan 2024-08-02 16:13:33 -04:00
  • dbfdea6550 merge conflicts Roy Han 2024-08-02 13:09:58 -07:00
  • 85c7f11170
    Update docs (#5310) royjhan 2024-08-02 16:05:57 -04:00
  • e1e0d59a16
    Merge branch 'ollama:main' into cors frob 2024-08-02 20:53:20 +02:00
  • 6ec19682ea
    Merge branch 'main' into royh-completions-docs royjhan 2024-08-02 14:09:24 -04:00
  • 525de066cb
    Merge branch 'main' into royh-vision-docs royjhan 2024-08-02 13:24:52 -04:00
  • b9b5e58dd1
    Merge 97d5582c24e16c844af9b54f1f415eb2a51e563b into ce1fb4447efc9958dcf279f7eb2ae6941bec1220 Michael Yang 2024-08-02 04:30:27 -07:00
  • 8c0fa97f35
    Merge 695d7fea2231446a09c19032351271e7287937e2 into ce1fb4447efc9958dcf279f7eb2ae6941bec1220 Daniel Hiltgen 2024-08-02 04:23:03 -07:00
  • 3258ca7392 feat: support num_ctx for /v1/chat/completions chen 2024-08-02 13:14:13 +08:00
  • 1972b7962c Update api.md farwish 2024-08-02 11:37:04 +08:00
  • df3802a65f Adjust arm cuda repo paths Daniel Hiltgen 2024-08-01 17:22:25 -07:00
  • b732beba6a lint Michael Yang 2024-08-01 14:52:15 -07:00
  • ce1fb4447e
    Fix models/{model} URL (#6132) v0.3.3 Kim Hallberg 2024-08-02 01:31:47 +02:00
  • 3023f26096
    Fix models/{model} URL Kim Hallberg 2024-08-02 01:04:08 +02:00
  • 558a54b098
    Update OpenAI Compatibility Docs with /v1/embeddings (#5470) royjhan 2024-08-01 19:00:29 -04:00
  • 4d0b3cdbcf
    Merge branch 'main' into royh-openai-embed-docs royjhan 2024-08-01 18:59:54 -04:00
  • ed52833bb1
    Add to docs (#5309) royjhan 2024-08-01 18:58:13 -04:00
  • b2827499ea
    Merge branch 'main' into royh-retrieve-docs royjhan 2024-08-01 18:57:34 -04:00
  • e584f14e78
    Merge pull request #6123 from dhiltgen/go_server_unified Daniel Hiltgen 2024-08-01 15:51:51 -07:00
  • 6f133a0bdd
    OpenAI: Add Usage to v1/embeddings (#5886) royjhan 2024-08-01 18:49:37 -04:00
  • f561eecfb8
    Update OpenAI Compatibility Docs with /v1/models (#5151) royjhan 2024-08-01 18:48:44 -04:00
  • fc51ffb6a1 update response testing Roy Han 2024-08-01 15:48:03 -07:00
  • ff7c9060ec
    Merge pull request #6115 from slouffka/fix-context Michael Yang 2024-08-01 15:13:59 -07:00
  • 3cc7ec4429 fix dolphin-mistral Daniel Hiltgen 2024-08-01 14:47:00 -07:00
  • 343aba9fca harden integration tests Daniel Hiltgen 2024-08-01 14:41:23 -07:00
  • 7b8ca2dc0f add q4_0_4_4/4_8/8_8 jyan/q4_4/8 Josh Yan 2024-08-01 14:31:36 -07:00
  • 973ee69c92 rm error pointer type Roy Han 2024-08-01 14:23:16 -07:00
  • 0ff42e84b0
    Merge pull request #4756 from ollama/mxyng/convert2 Michael Yang 2024-08-01 14:16:30 -07:00
  • c75d88a545
    Merge branch 'ollama:main' into feature/kv-quant Sam 2024-08-02 09:01:54 +12:00
  • 8a9f946ca7 Refactor and format code. Vyacheslav Moskalev 2024-08-02 03:50:05 +07:00
  • cde1aa9e2f Introduce GPU Overhead env var Daniel Hiltgen 2024-07-24 11:22:46 -07:00
  • 46c5f5fd9e Runtime selection of new or old runners Daniel Hiltgen 2024-08-01 08:54:44 -07:00
  • c1384c8bcc Implement timings response in Go server Daniel Hiltgen 2024-07-29 14:09:55 -07:00
  • b9db0d385c Implement timings response in Go server Daniel Hiltgen 2024-07-29 14:09:55 -07:00
  • d0e239b85f
    Merge pull request #6110 from dhiltgen/go_server_embeds Daniel Hiltgen 2024-08-01 07:59:35 -07:00
  • d9dd76e4e7
    Merge branch 'main' into main Kevin Thomas 2024-08-01 10:40:22 -04:00