Commit Graph

  • 037fb713e9 Update list local models API chyok 2024-07-04 22:51:01 +08:00
  • 58bd71b14d Add device cmd/api Xiaodong Ye 2024-07-04 19:18:19 +08:00
  • 3a0a90bb2e new backend draft Wang,Zhe 2024-07-04 16:15:17 +08:00
  • 0b01490f7a
    fix: Ricky Bobby 2024-07-04 00:55:50 +00:00
  • 52abc8acb7 Document older win10 terminal problems Daniel Hiltgen 2024-05-13 15:08:29 -07:00
  • 4d71c559b2
    fix error detection by limiting model loading error parsing (#5472) Jeffrey Morgan 2024-07-03 20:04:30 -04:00
  • 0ea80531d9 fix error detection by limiting model loading error parsing jmorganca 2024-07-03 18:39:52 -04:00
  • 0d16eb310e
    fix: use envconfig.ModelsDir directly (#4821) Anatoli Babenia 2024-07-04 01:36:11 +03:00
  • 8072e205ff
    Merge pull request #5447 from dhiltgen/fix_keepalive Daniel Hiltgen 2024-07-03 15:34:38 -07:00
  • 955f2a4e03 Only set default keep_alive on initial model load Daniel Hiltgen 2024-07-02 15:12:43 -07:00
  • e1246a7a1c docs without usage Roy Han 2024-07-03 15:17:07 -07:00
  • 105e36765d token bug corrected Roy Han 2024-07-03 15:03:54 -07:00
  • 3c75113e37 Prevent loading models larger than total memory Daniel Hiltgen 2024-07-03 14:47:42 -07:00
  • c8eb5fe22f merge conflicts Roy Han 2024-07-03 14:18:31 -07:00
  • 0b386f6689
    Merge branch 'main' into royh-vision royjhan 2024-07-03 14:16:30 -07:00
  • 6caac01494 clear comments Roy Han 2024-07-03 14:05:34 -07:00
  • 17de2b4405 Refactoring of legacy and new Roy Han 2024-07-03 14:02:25 -07:00
  • ccd7785859
    Merge pull request #5243 from dhiltgen/modelfile_use_mmap Daniel Hiltgen 2024-07-03 13:59:42 -07:00
  • 3b5a4a77f3
    Return Correct Prompt Eval Count Regardless of Cache Prompt (#5371) royjhan 2024-07-03 13:46:23 -07:00
  • daed0634a9
    Merge pull request #5467 from dhiltgen/bogus_cpu_mac_error Daniel Hiltgen 2024-07-03 13:39:36 -07:00
  • 1353b1b99b Bubble up model load error messages Daniel Hiltgen 2024-07-03 13:27:38 -07:00
  • 0d4dd707bc
    Merge pull request #5465 from dhiltgen/better_cuda_logging Daniel Hiltgen 2024-07-03 13:12:22 -07:00
  • 0e982bc1f4 Fix corner cases on tmp cleaner on mac Daniel Hiltgen 2024-07-03 13:10:14 -07:00
  • 922b8f2584 input handling and handler testing Roy Han 2024-07-03 12:48:54 -07:00
  • c0fa2236cf integration float32 Roy Han 2024-07-03 12:47:57 -07:00
  • 6298f49816 Fix clip model loading with unicode paths Daniel Hiltgen 2024-07-03 12:37:40 -07:00
  • a413014aaf refactoring Roy Han 2024-07-03 11:20:55 -07:00
  • a5f23d766e
    Merge branch 'main' into royh-batchembed royjhan 2024-07-03 11:20:24 -07:00
  • ef757da2c9 Better nvidia GPU discovery logging Daniel Hiltgen 2024-07-03 10:30:07 -07:00
  • 95e46eeedf move normalize test Roy Han 2024-07-03 09:45:42 -07:00
  • 12f540f0ad
    Merge branch 'ollama:main' into patch-1 Cyril Blaecke 2024-07-03 13:36:27 +02:00
  • e7d9f098ea
    Update amd-igpu-780m.md alexhegit 2024-07-03 15:03:50 +08:00
  • 1e11edbcf7
    Merge pull request #1 from alexhegit/alexhegit-patch-1 alexhegit 2024-07-03 14:51:58 +08:00
  • 0ceb0af8e1
    Update amd-igpu-780m.md alexhegit 2024-07-03 14:50:45 +08:00
  • 38241c0067
    Merge a56b01aea7d292196151a03275ddbb23a77eabc5 into e5352297d97b96101a7bd6944de420ed17ae62d3 Lei Jitang 2024-07-03 12:16:17 +08:00
  • e5352297d9
    Merge pull request #5448 from ollama/mxyng/fix-generate Michael Yang 2024-07-02 16:48:06 -07:00
  • 65a5040e09 fix generate template Michael Yang 2024-07-02 16:42:17 -07:00
  • d626b99b54
    OpenAI: v1/completions compatibility (#5209) royjhan 2024-07-02 16:01:45 -07:00
  • 222b7084f3 cleaning Roy Han 2024-07-02 15:48:39 -07:00
  • 03a32c5984 cleaning Roy Han 2024-07-02 15:47:33 -07:00
  • f4e5131797
    Merge branch 'main' into royh-openai-docs royjhan 2024-07-02 15:09:26 -07:00
  • 3ff210b1bb merge conflicts Roy Han 2024-07-02 15:06:20 -07:00
  • 3db09b13de merge conflicts Roy Han 2024-07-02 15:05:08 -07:00
  • 64ee9001ac merge conflicts Roy Han 2024-07-02 15:04:34 -07:00
  • a8b595122c merge conflicts Roy Han 2024-07-02 15:02:46 -07:00
  • 61e0c785c8
    Merge branch 'main' into royh-completions royjhan 2024-07-02 15:00:01 -07:00
  • 8928dfed50
    Merge branch 'main' into royh-retrieve-docs royjhan 2024-07-02 14:54:50 -07:00
  • cb12afff24
    Merge branch 'main' into royh-vision-docs royjhan 2024-07-02 14:54:01 -07:00
  • fa7be5aab4
    Merge branch 'main' into royh-completions-docs royjhan 2024-07-02 14:52:56 -07:00
  • 4469418148 Switch to rocky 8 base Daniel Hiltgen 2024-07-02 12:39:22 -07:00
  • dddb58a38b
    Merge pull request #5051 from ollama/mxyng/capabilities Michael Yang 2024-07-02 14:26:07 -07:00
  • cd55715012
    Merge branch 'main' into patch-1 Cyril Blaecke 2024-07-02 23:20:21 +02:00
  • 2921a366eb merge conflicts Roy Han 2024-07-02 14:09:14 -07:00
  • 400056e154
    Merge pull request #5420 from ollama/mxyng/insecure-path Michael Yang 2024-07-02 14:03:23 -07:00
  • d2f19024d0
    Merge pull request #5442 from dhiltgen/concurrency_docs Daniel Hiltgen 2024-07-02 12:47:47 -07:00
  • 69c04eecc4 Add windows radeon concurreny note Daniel Hiltgen 2024-07-02 12:46:14 -07:00
  • 2bbedd6b8f merge conflicts Roy Han 2024-07-02 12:01:52 -07:00
  • ceebcf1580
    Merge branch 'main' into royh-vision royjhan 2024-07-02 11:56:50 -07:00
  • 996bb1b85e
    OpenAI: /v1/models and /v1/models/{model} compatibility (#5007) royjhan 2024-07-02 11:50:56 -07:00
  • 9bd9d39b42
    OpenAI: /v1/models/{model} compatibility (#5028) royjhan 2024-07-02 11:40:48 -07:00
  • 422dcc3856
    Merge pull request #5439 from dhiltgen/fix_centos_7_build v0.1.49-rc3 Daniel Hiltgen 2024-07-02 11:01:15 -07:00
  • 3d060e0ae9 move normalize Roy Han 2024-07-02 10:35:02 -07:00
  • 020bd60ab2 Switch amd container image base to rocky 8 Daniel Hiltgen 2024-07-02 10:23:05 -07:00
  • 00a4cb26ca use float32 Roy Han 2024-07-02 10:30:29 -07:00
  • 8e277b72bb
    Merge pull request #5438 from dhiltgen/fix_centos_7_build v0.1.49-rc2 Daniel Hiltgen 2024-07-02 09:28:00 -07:00
  • 4f67b39d26 Centos 7 EOL broke mirrors Daniel Hiltgen 2024-07-02 09:22:17 -07:00
  • b181b3b0b0 commit Ricky Bobby 2024-07-02 14:52:24 +00:00
  • edf0181887 commit Ricky Bobby 2024-07-02 13:34:07 +00:00
  • 022b9217aa Merge branch 'main' of https://github.com/ollama/ollama into vulkan pufferffish 2024-07-02 10:47:56 +01:00
  • 1df7ac813e
    Create amd-igpu-780m.md alexhegit 2024-07-02 12:03:53 +08:00
  • a914b774f9 remove erroneous subtraction of prompt cache Roy Han 2024-07-01 17:04:52 -07:00
  • 381ae2e488 Revert "openai compatibility" Roy Han 2024-07-01 17:01:20 -07:00
  • 2425281317
    Merge pull request #5336 from ollama/jyan/from-errors v0.1.49-rc1 Josh 2024-07-01 16:32:46 -07:00
  • 0403e9860e
    Merge pull request #5421 from ollama/jyan/ver Josh 2024-07-01 16:32:14 -07:00
  • 512e0a7bde Clean up Roy Han 2024-07-01 16:29:54 -07:00
  • 1a0c8b363c Truncation Integration Tests Roy Han 2024-07-01 16:26:30 -07:00
  • 33a65e3ba3 error Josh Yan 2024-07-01 16:04:13 -07:00
  • 3803ecb6a6 cmd build context mxyng/create-context Michael Yang 2024-06-30 10:24:31 -07:00
  • b662e4706e Remove default auto from help message jyan/v0.146 Daniel Hiltgen 2024-07-01 09:48:05 -07:00
  • be31611ff1 Fix case for NumCtx Daniel Hiltgen 2024-07-01 09:43:59 -07:00
  • 02ba11b614 Document concurrent behavior and settings Daniel Hiltgen 2024-06-28 13:15:57 -07:00
  • 03bb60e036 Sort the ps output Daniel Hiltgen 2024-06-21 15:59:41 -07:00
  • 976fc86978 Disable concurrency for AMD + Windows Daniel Hiltgen 2024-06-19 13:35:38 -07:00
  • 9bceb3b55e Enable concurrency by default Daniel Hiltgen 2024-05-06 17:47:52 -07:00
  • 7add3e5267 Update README.md (#5214) RAPID ARCHITECT 2024-06-30 21:00:57 -05:00
  • c4f2236cf9 Update gpu.md (#5382) Eduard 2024-07-01 03:48:51 +02:00
  • b7ccdcef94 Update api.md Jeffrey Morgan 2024-06-29 16:22:49 -07:00
  • 1f4f46800c Do not shift context for sliding window models (#5368) Jeffrey Morgan 2024-06-28 19:39:31 -07:00
  • 42574d3b11 Include Show Info in Interactive (#5342) royjhan 2024-06-28 13:15:52 -07:00
  • 7bd7e113e3 Ollama Show: Check for Projector Type (#5307) royjhan 2024-06-28 11:30:16 -07:00
  • 20240927f8 Update docs (#5312) royjhan 2024-06-28 09:58:14 -07:00
  • 3af1c58146 gemma2 graph Michael Yang 2024-06-27 10:52:25 -07:00
  • d90b27a57f update readme for gemma 2 (#5333) Michael 2024-06-27 12:45:16 -04:00
  • b7ce14c764 zip: prevent extracting files into parent dirs (#5314) Michael Yang 2024-06-26 21:38:21 -07:00
  • 161229a153 llm: architecture patch (#5316) Jeffrey Morgan 2024-06-26 21:38:12 -07:00
  • bd8d680e26 refactor error Josh Yan 2024-07-01 15:57:57 -07:00
  • a562b9069f refactor error Josh Yan 2024-07-01 15:56:47 -07:00
  • 88bcd79bb9 err on insecure path Michael Yang 2024-06-30 11:10:40 -07:00
  • 5d76e78c2f add error message for unsupported arch Josh Yan 2024-07-01 15:43:03 -07:00
  • 826c3179b7 refactor convert Michael Yang 2024-05-31 20:00:49 -07:00