Commit Graph

  • 51214ddef5 Update docs/openai.md royjhan 2024-07-08 14:46:05 -07:00
  • b950d749a9 Update docs/openai.md royjhan 2024-07-08 14:44:16 -07:00
  • 3702ed7532 token bug corrected Roy Han 2024-07-03 15:03:54 -07:00
  • 6266603b17 Update docs Roy Han 2024-06-26 14:30:28 -07:00
  • c279f96371 remove ToolCall from GenerateResponse Michael Yang 2024-07-16 14:51:19 -07:00
  • 23d5beeb9c input royh-imgembed Roy Han 2024-07-16 15:19:13 -07:00
  • 499e87c9ba
    Merge pull request #5730 from ollama/mxyng/cleanup Michael Yang 2024-07-16 14:42:13 -07:00
  • cd0853f2d5
    Merge pull request #5207 from ollama/mxyng/suffix Michael Yang 2024-07-16 14:37:32 -07:00
  • d290e87513 add suffix support to generate endpoint Michael Yang 2024-06-20 19:13:36 -07:00
  • 97c20ede33
    README: Added AI Studio to the list of UIs (#5721) Thorsten Sommer 2024-07-16 23:24:27 +02:00
  • 703ecccc6b clean Josh Yan 2024-07-16 14:17:44 -07:00
  • 873f334783 IT WORKS Josh Yan 2024-07-16 14:12:07 -07:00
  • 5a83f79afd remove unneeded tool calls Michael Yang 2024-07-16 13:48:38 -07:00
  • 987dbab0b0
    OpenAI: /v1/embeddings compatibility (#5285) royjhan 2024-07-16 13:36:08 -07:00
  • fa49bfc0bd FIXED TESTS Josh Yan 2024-07-16 12:14:10 -07:00
  • a8388beb94
    Merge pull request #5726 from ollama/mxyng/tools-templates Michael Yang 2024-07-16 12:12:10 -07:00
  • 5afbb60fc4 fix unmarshal type errors Michael Yang 2024-07-16 09:38:46 -07:00
  • fc1b3ee9bf test Josh Yan 2024-07-16 11:21:13 -07:00
  • 4cb5d7decc
    server: omit model system prompt if empty (#5717) Jeffrey Morgan 2024-07-16 11:09:00 -07:00
  • 8eac50dd4f
    Merge pull request #5684 from ollama/mxyng/tests Michael Yang 2024-07-16 09:44:45 -07:00
  • 4a565cbf94 add chat and generate tests with mock runner Michael Yang 2024-07-13 17:46:24 -07:00
  • 64039df6d7
    Merge pull request #5284 from ollama/mxyng/tools Michael Yang 2024-07-15 18:03:37 -07:00
  • 7ac6d462ec
    server: return empty slice on empty /api/embed request (#5713) Jeffrey Morgan 2024-07-15 17:39:44 -07:00
  • ef5136a745 tools test Michael Yang 2024-07-15 12:17:38 -07:00
  • 8288ec8824
    Merge pull request #5710 from dhiltgen/rocm_bump Daniel Hiltgen 2024-07-15 15:32:18 -07:00
  • d02bbebb11 tools Michael Yang 2024-06-20 13:45:47 -07:00
  • 224337b32f Bump linux ROCm to 6.1.2 Daniel Hiltgen 2024-07-15 15:10:22 -07:00
  • 25be20949c test Josh Yan 2024-07-15 15:08:24 -07:00
  • 9e35d9bbee
    server: lowercase roles for compatibility with clients (#5695) Jeffrey Morgan 2024-07-15 13:55:57 -07:00
  • b9f5e16c80
    Introduce /api/embed endpoint supporting batch embedding (#5127) royjhan 2024-07-15 12:14:24 -07:00
  • eb7cc2d1ce image embeddings Roy Han 2024-07-15 12:13:06 -07:00
  • 903e9df46f test Josh Yan 2024-07-15 11:46:49 -07:00
  • 907b038ff0 reduce error footprint royh-embed-parallel Roy Han 2024-07-15 10:57:01 -07:00
  • 8476ef2bd8 atomic for race jyan/quant4 Josh Yan 2024-07-15 10:44:35 -07:00
  • 766ca1cd7d clean up Roy Han 2024-07-15 09:54:19 -07:00
  • 40c0f9612e unneccesary Josh Yan 2024-07-14 18:41:16 -07:00
  • e9f7f36029
    Support image input for OpenAI chat compatibility (#5208) royjhan 2024-07-13 22:07:45 -07:00
  • 057d31861e
    remove template (#5655) Patrick Devine 2024-07-13 20:56:24 -07:00
  • f7ee012300 server: prepend system message in chat handler v0.2.5 jmorganca 2024-07-13 15:08:00 -07:00
  • 1ed0aa8fea
    server: fix context, load_duration and total_duration fields (#5676) v0.2.4 Jeffrey Morgan 2024-07-13 09:25:31 -07:00
  • ef98803d63
    llm: looser checks for minimum memory (#5677) Jeffrey Morgan 2024-07-13 09:20:05 -07:00
  • 02fea420e5
    Add Kerlig AI, an app for macOS (#5675) Jarek 2024-07-13 17:33:46 +02:00
  • 22c5451fc2
    fix system prompt (#5662) v0.2.3 Michael Yang 2024-07-12 21:04:44 -07:00
  • f08ac71a3b more clean up Roy Han 2024-07-12 17:30:30 -07:00
  • 424f3f81a9 clean up Roy Han 2024-07-12 17:28:08 -07:00
  • 15a0215203 running Josh Yan 2024-07-12 16:49:57 -07:00
  • 1f73889f34
    Merge branch 'royh-batchembed' into royh-embed-parallel royjhan 2024-07-12 16:44:12 -07:00
  • 7e313e5964 remove redundant error check Roy Han 2024-07-12 16:37:29 -07:00
  • 5a8f8e96e0 clean up Roy Han 2024-07-12 16:35:25 -07:00
  • 7cddd6d741 parallelized Roy Han 2024-07-12 16:08:12 -07:00
  • ebc529cbb3 autodetect stop parameters from template Michael Yang 2024-07-05 17:31:23 -07:00
  • 23ebbaa46e Revert "remove template from tests" Patrick Devine 2024-07-12 15:47:17 -07:00
  • 9ac0a7a50b remove template from tests Patrick Devine 2024-07-12 15:41:31 -07:00
  • faa3c937cf writeto Josh Yan 2024-07-12 15:37:27 -07:00
  • e210f8763f merge conflicts royh-precision Roy Han 2024-07-12 15:09:05 -07:00
  • 3971c2333f
    Merge branch 'main' into royh-precision royjhan 2024-07-12 15:07:36 -07:00
  • 1f3aefd323 remove function closure Roy Han 2024-07-12 14:45:16 -07:00
  • 2d7048f410 Revert "remove function closure" Roy Han 2024-07-12 14:40:40 -07:00
  • 55d48c6ed1 remove function closure Roy Han 2024-07-12 14:35:43 -07:00
  • cf57246aba write Josh Yan 2024-07-12 12:59:51 -07:00
  • 6fafe4f753 gguf Josh Yan 2024-07-12 12:58:00 -07:00
  • e5c65a85df
    Merge pull request #5653 from ollama/mxyng/collect-system v0.2.2 Michael Yang 2024-07-12 12:32:34 -07:00
  • 33627331a3
    app: also clean up tempdir runners on install (#5646) Jeffrey Morgan 2024-07-12 12:29:23 -07:00
  • 36c87c433b template: preprocess message and collect system Michael Yang 2024-07-12 11:48:06 -07:00
  • d7c8d4f3f4 ggufwritekv Josh Yan 2024-07-12 12:25:13 -07:00
  • 3d0fd31f0e TensorWriter Josh Yan 2024-07-12 12:18:46 -07:00
  • 4c9a160a08 race Josh Yan 2024-07-12 11:52:10 -07:00
  • c0b5bf0a36 testing clean up Roy Han 2024-07-12 11:45:45 -07:00
  • 554f3bdc0e interface Josh Yan 2024-07-12 10:02:58 -07:00
  • e75fb73839 types Josh Yan 2024-07-12 09:42:10 -07:00
  • 179737feb7
    Clean up old files when installing on Windows (#5645) Jeffrey Morgan 2024-07-11 22:53:46 -07:00
  • 53e9576f46 testing clean up Roy Han 2024-07-11 20:20:14 -07:00
  • 2fdebffc8d sawp Josh Yan 2024-07-11 18:18:26 -07:00
  • 29ecfe493b write Josh Yan 2024-07-11 17:56:51 -07:00
  • 47353f5ee4
    Merge pull request #5639 from ollama/mxyng/unaggregated-system Michael Yang 2024-07-11 17:48:50 -07:00
  • dbe9527305 clean up Roy Han 2024-07-11 17:28:55 -07:00
  • 10e768826c
    fix: quant err message (#5616) Josh 2024-07-11 17:24:29 -07:00
  • b6c7d01af3 more cmt rmv jyan/progress Josh Yan 2024-07-11 17:21:36 -07:00
  • 9d517cf556 rm comment Josh Yan 2024-07-11 17:20:09 -07:00
  • 5056bb9c01 rename aggregate to contents Michael Yang 2024-07-11 16:06:57 -07:00
  • c4cf8ad559
    llm: avoid loading model if system memory is too small (#5637) Jeffrey Morgan 2024-07-11 16:42:57 -07:00
  • 57ec6901eb revert embedded templates to use prompt/response Michael Yang 2024-07-11 13:11:40 -07:00
  • e64f9ebb44 do no automatically aggregate system messages Michael Yang 2024-07-11 13:10:13 -07:00
  • 657a1102fc lint complained Josh Yan 2024-07-11 09:26:17 -07:00
  • 791650ddef
    sched: only error when over-allocating system memory (#5626) v0.2.2-rc2 Jeffrey Morgan 2024-07-11 00:53:12 -07:00
  • efbf41ed81
    llm: dont link cuda with compat libs (#5621) v0.2.2-rc1 Jeffrey Morgan 2024-07-10 20:01:52 -07:00
  • cf15589851
    Merge pull request #5620 from ollama/mxyng/templates Michael Yang 2024-07-10 17:16:24 -07:00
  • 19753c18c0 update embedded templates Michael Yang 2024-07-10 11:00:29 -07:00
  • 41be28096a add system prompt to first legacy template Michael Yang 2024-07-10 11:00:07 -07:00
  • 694388db90 set context length Roy Han 2024-07-10 15:21:46 -07:00
  • d352c68ffc move llama.h Josh Yan 2024-07-10 14:20:04 -07:00
  • 37a570f962
    Merge pull request #5612 from ollama/mxyng/mem Michael Yang 2024-07-10 14:18:33 -07:00
  • 3c0d043b79 pass the template to the /api/chat endpoint pdevine/fix-template Patrick Devine 2024-07-10 14:17:39 -07:00
  • d82d25d70c patched Josh Yan 2024-07-10 13:57:28 -07:00
  • 60be9e2840 patch Josh Yan 2024-07-10 13:46:38 -07:00
  • 5a739ff4cb chatglm graph Michael Yang 2024-07-10 13:18:04 -07:00
  • a083852eb5 quantize progress Josh Yan 2024-07-10 13:21:22 -07:00
  • 4e262eb2a8
    remove GGML_CUDA_FORCE_MMQ=on from build (#5588) Jeffrey Morgan 2024-07-10 13:17:13 -07:00
  • a548eb6003 a8db2a9 jyan/quant3 Josh Yan 2024-07-10 13:10:58 -07:00
  • f92818d90d patch again Josh Yan 2024-07-10 13:06:40 -07:00