Commit Graph

  • 8fd3e47ac2 Remove Delete from Branch Roy Han 2024-06-14 16:25:06 -07:00
  • 818074c763 Docs Roy Han 2024-06-14 16:15:45 -07:00
  • 6be309e1bd Centralize GPU configuration vars Daniel Hiltgen 2024-05-08 11:11:50 -07:00
  • b005becdb9 Verbose functionality Roy Han 2024-06-14 15:58:22 -07:00
  • cb2a1d112a Use ModelName Parser Roy Han 2024-06-14 15:42:22 -07:00
  • da3bf23354 Workaround gfx900 SDMA bugs Daniel Hiltgen 2024-05-31 16:15:21 -07:00
  • 26ab67732b Bump ROCm linux to 6.1.1 Daniel Hiltgen 2024-06-06 10:43:55 -07:00
  • 45cacbaf05
    Merge pull request #4517 from dhiltgen/gpu_incremental Daniel Hiltgen 2024-06-14 15:35:00 -07:00
  • 17df6520c8 Remove mmap related output calc logic Daniel Hiltgen 2024-06-13 09:59:36 -07:00
  • 6f351bf586 review comments and coverage Daniel Hiltgen 2024-06-05 12:07:20 -07:00
  • ff4f0cbd1d Prevent multiple concurrent loads on the same gpus Daniel Hiltgen 2024-06-04 14:08:36 -07:00
  • fc37c192ae Refine CPU load behavior with system memory visibility Daniel Hiltgen 2024-06-03 19:09:23 -07:00
  • 434dfe30c5 Reintroduce nvidia nvml library for windows Daniel Hiltgen 2024-06-03 15:07:50 -07:00
  • 4e2b7e181d Refactor intel gpu discovery Daniel Hiltgen 2024-05-29 16:37:34 -07:00
  • 48702dd149 Harden unload for empty runners Daniel Hiltgen 2024-05-30 16:43:40 -07:00
  • 68dfc6236a refined test timing Daniel Hiltgen 2024-05-31 14:28:02 -07:00
  • 5e8ff556cb Support forced spreading for multi GPU Daniel Hiltgen 2024-05-08 14:32:42 -07:00
  • 6fd04ca922 Improve multi-gpu handling at the limit Daniel Hiltgen 2024-05-18 12:34:31 -07:00
  • 206797bda4 Fix concurrency integration test to work locally Daniel Hiltgen 2024-05-23 13:12:14 -07:00
  • 43ed358f9a Refine GPU discovery to bootstrap once Daniel Hiltgen 2024-05-15 15:13:16 -07:00
  • b32ebb4f29 Use DRM driver for VRAM info for amd Daniel Hiltgen 2024-05-14 16:18:42 -07:00
  • fb9cdfa723 Fix server.cpp for the new cuda build macros Daniel Hiltgen 2024-05-18 16:02:13 -07:00
  • efac488675 Revert "Limit GPU lib search for now (#4777)" Daniel Hiltgen 2024-06-03 08:31:48 -07:00
  • c037616d6b v1/models docs Roy Han 2024-06-14 14:39:34 -07:00
  • 6b800aa7b7
    openai: do not set temperature to 0 when setting seed (#5045) Jeffrey Morgan 2024-06-14 13:43:56 -07:00
  • 3cf19d9588 Add back envconfig Roy Han 2024-06-14 13:29:02 -07:00
  • fe1a625fee Update Test Roy Han 2024-06-14 12:00:41 -07:00
  • c826421f17 Use Namespace for Ownedby Roy Han 2024-06-14 11:59:15 -07:00
  • f46b4a6fa2 implement the vulkan C backend pufferffish 2024-06-14 19:56:35 +01:00
  • d55fa6f294 Merge remote-tracking branch 'upstream/main' Ricky Bobby 2024-06-14 14:35:26 -04:00
  • bfff252fa9 chore: converted from quoted strings to multiline JD Davis 2024-06-14 13:35:15 -05:00
  • c8e1bb5a66 Prevent wrapping from files Roy Han 2024-06-14 11:31:55 -07:00
  • 65c08507cb Empty List Testing Roy Han 2024-06-14 09:50:26 -07:00
  • dd7c9ebeaf
    server: longer timeout in TestRequests (#5046) Jeffrey Morgan 2024-06-14 09:48:25 -07:00
  • 08d1474d10 server: longer timeout in TestRequests jmorganca 2024-06-14 09:36:54 -07:00
  • ca1a151bbd openai: do not set temperature to 0 when setting seed jmorganca 2024-06-14 09:20:06 -07:00
  • 0c1d42fc67
    Merge 5c293fa0f63486599a6a28fbf8c8362c8d90886d into 4dc7fb952514673bcfff0aed0c2c8e40c1459987 Ashok Gelal 2024-06-14 02:11:41 -07:00
  • c323e43130
    Merge 6ab6a9029398376e56cdb4af70317d87f17f774a into 4dc7fb952514673bcfff0aed0c2c8e40c1459987 Glen 2024-06-14 02:10:40 -07:00
  • b6cee78fdd
    Merge 5fb8ad2babca5169a4b640ab334596c938a52a6b into 4dc7fb952514673bcfff0aed0c2c8e40c1459987 Elliot 2024-06-14 02:10:36 -07:00
  • 3e6f593e3a Adds an uninstall script to the installer Noufal Ibrahim 2024-06-14 13:46:00 +05:30
  • ef7c6cb43a chore: added spectral to lint OpenAPI spec JD Davis 2024-06-13 23:25:47 -05:00
  • 73cba3f0d7 chore: add openapi 3.1 spec for public api JD Davis 2024-06-13 23:09:04 -05:00
  • 4dc7fb9525
    update 40xx gpu compat matrix (#5036) Patrick Devine 2024-06-13 20:10:33 -04:00
  • 3fe56f373e update 40xx gpu compat matrix Patrick Devine 2024-06-13 17:03:34 -07:00
  • d17f98192d Error Handling Roy Han 2024-06-13 16:05:28 -07:00
  • 81791a0efd Retrieve Middleware Roy Han 2024-06-13 15:35:11 -07:00
  • 691e44869e
    Merge branch 'main' into royh-openai royjhan 2024-06-13 14:08:54 -07:00
  • 30da7da2bb Add Mod Time to Show Roy Han 2024-06-13 13:51:49 -07:00
  • c39761c552
    Merge pull request #5032 from dhiltgen/actually_skip v0.1.44 Daniel Hiltgen 2024-06-13 13:26:09 -07:00
  • aac367636d Actually skip PhysX on windows Daniel Hiltgen 2024-06-13 13:17:19 -07:00
  • 15a687ae4b
    Merge pull request #5031 from ollama/mxyng/fix-multibyte-utf16 Michael Yang 2024-06-13 13:14:55 -07:00
  • d528e1af75 fix utf16 for multibyte runes Michael Yang 2024-06-13 11:39:01 -07:00
  • cd234ce22c parser: add test for multibyte runes Michael Yang 2024-06-13 11:09:22 -07:00
  • 94618b2365
    add OLLAMA_MODELS to envconfig (#5029) Patrick Devine 2024-06-13 15:52:03 -04:00
  • 6a68a5936d
    Update README.md Lord Basil - Automate EVERYTHING 2024-06-13 15:37:56 -04:00
  • c2e4bc2894 add OLLAMA_MODELS to envconfig Patrick Devine 2024-06-13 11:23:58 -04:00
  • c7ec861493 OpenAI Delete Model Roy Han 2024-06-13 12:33:13 -07:00
  • eccd1918d3 Retrieve Model Roy Han 2024-06-13 11:27:24 -07:00
  • 1fd236d177
    server: remove jwt decoding error (#5027) Jeffrey Morgan 2024-06-13 11:21:15 -07:00
  • 62333a5597 server: remove jwt decoding error jmorganca 2024-06-13 11:14:53 -07:00
  • e87fc7200d
    Merge pull request #5025 from ollama/mxyng/revert-parser-scan Michael Yang 2024-06-13 10:31:25 -07:00
  • 20b9f8e6f4 Revert "proper utf16 support" Michael Yang 2024-06-13 10:22:16 -07:00
  • 02e25b32fd
    Merge branch 'ollama:main' into main Zeyo 2024-06-13 16:53:27 +05:30
  • 916bc46c42 fix(parser): proper UTF-8 CJK supports CDFMLR 2024-06-13 19:21:07 +08:00
  • d22cec1c1f
    add utf8 test case 007gzs 2024-06-13 18:21:48 +08:00
  • 08ebacb8c4
    fix utf8 parser error 007gzs 2024-06-13 17:39:49 +08:00
  • c69bc19e46
    move OLLAMA_HOST to envconfig (#5009) Patrick Devine 2024-06-12 18:48:16 -04:00
  • 310940d11e Credit Co-Author royjhan 2024-06-12 15:45:32 -07:00
  • 758a88be21 feed the linter Patrick Devine 2024-06-12 18:21:49 -04:00
  • 2e7acad7dc move OLLAMA_HOST to envconfig Patrick Devine 2024-06-12 18:11:51 -04:00
  • e57f0d19f3 Add Test Roy Han 2024-06-12 15:06:21 -07:00
  • 12209bd021 Remove Latest at API Level Roy Han 2024-06-12 14:43:17 -07:00
  • bba5d177aa
    Merge pull request #5004 from ollama/mxyng/fix-templates Michael Yang 2024-06-12 14:39:29 -07:00
  • 1114d9661b Refactor Writers Roy Han 2024-06-12 14:11:26 -07:00
  • cb6d5b0310 OpenAI v1 models Roy Han 2024-06-12 13:59:21 -07:00
  • c16f8af911 fix: multiple templates when creating from model Michael Yang 2024-06-12 13:30:08 -07:00
  • 6d172e8e51
    Merge 9051578ecbcabb6f1497376717ddd565a6381640 into 217f60c3d95edc7d5dcbeb4c3cffb0190c147f92 Bruce MacDonald 2024-06-12 23:18:51 +08:00
  • 1d5fd1d2c6
    Merge 6204941d069c46264c6b73736d65b071dbcbae32 into 217f60c3d95edc7d5dcbeb4c3cffb0190c147f92 Sam 2024-06-12 23:16:17 +08:00
  • 9ec08f8262
    Merge e1f383437ec96e27e27f124e4b9b73ebcb0a03f4 into 217f60c3d95edc7d5dcbeb4c3cffb0190c147f92 Noah GITsham 2024-06-12 11:16:15 -03:00
  • 0c0eb9969b
    Merge 6186c56c99b436ad967beb80b0bc26494933dcf0 into 217f60c3d95edc7d5dcbeb4c3cffb0190c147f92 Lei Jitang 2024-06-12 16:32:54 +03:00
  • 259aaead22
    Create 1 renjy0219 2024-06-12 11:48:49 +08:00
  • 217f60c3d9
    Merge pull request #4987 from ollama/mxyng/revert-byte-order v0.1.43 Michael Yang 2024-06-11 16:04:20 -07:00
  • 7bdcd1da94 Revert "Merge pull request #4938 from ollama/mxyng/fix-byte-order" Michael Yang 2024-06-11 15:55:44 -07:00
  • ead259d877
    llm: fix seed value not being applied to requests (#4986) Jeffrey Morgan 2024-06-11 14:24:41 -07:00
  • c8223d6ad0 llm: fix seed value not being applied to requests jmorganca 2024-06-11 13:39:59 -07:00
  • 2ff45d571d
    Add Ollama-hpp to Community Libraries in README. (#4983) James Montgomery 2024-06-11 14:15:05 -04:00
  • dbcfc65b06 Add Ollama-hpp to Community Libraries in README. James Montgomery 2024-06-11 14:07:47 -04:00
  • 157f09acdf
    fix: "Skip searching for network devices" jayson-cloude 2024-06-11 16:11:35 +08:00
  • be2c5fd71a Remove :latest from Ollama List Roy Han 2024-06-10 16:39:44 -07:00
  • 78e143453a Touches Roy Han 2024-06-10 16:32:58 -07:00
  • 3ad14e1bb2 Remove Chat Template Roy Han 2024-06-10 15:33:33 -07:00
  • c453d524b8 Second Draft of Show with Projectors Included Roy Han 2024-06-10 15:01:12 -07:00
  • 0f3cf1d42e
    Merge pull request #4715 from ollama/mxyng/utf16-parser Michael Yang 2024-06-10 11:41:29 -07:00
  • 5bc029c529
    Merge pull request #4921 from ollama/mxyng/import-md Michael Yang 2024-06-10 11:41:09 -07:00
  • e9a9c6a8e8
    Merge pull request #4965 from ollama/mxyng/skip-layer-remove Michael Yang 2024-06-10 11:40:03 -07:00
  • 515f497e6d fix: skip removing layers that no longer exist Michael Yang 2024-06-10 11:15:03 -07:00
  • b27268aaef add test Michael Yang 2024-06-10 11:31:34 -07:00
  • f5f245cc15
    Merge pull request #4938 from ollama/mxyng/fix-byte-order Michael Yang 2024-06-10 09:38:12 -07:00
  • d63e1f5b34 Lint royh-show-rigid Roy Han 2024-06-10 09:36:05 -07:00
  • 8d26aa84f5
    Add TypingMind Tony Dinh 2024-06-10 16:27:04 +07:00