Commit Graph

  • 12e7a96e5a remove whitespace change jmorganca 2024-07-08 22:09:55 -07:00
  • 77f4aedae1 server: fix unneeded model reloads when setting OLLAMA_NUM_PARALLEL jmorganca 2024-07-08 20:12:07 -07:00
  • e32de893ec punch the linter again pdevine/ggla Patrick Devine 2024-07-08 18:50:10 -07:00
  • c37ab3b9f2 punch the linter in the face Patrick Devine 2024-07-08 18:40:24 -07:00
  • b44320db13 Bundle missing CRT libraries Daniel Hiltgen 2024-07-08 18:24:21 -07:00
  • 6367b7449e try feeding the linter again Patrick Devine 2024-07-08 17:23:05 -07:00
  • 8ba3f38f82 feed the linter again + llama.cpp patches Patrick Devine 2024-07-08 17:03:13 -07:00
  • 3342e5f035 merge conflicts Roy Han 2024-07-08 15:15:09 -07:00
  • b7c622dd32
    Merge branch 'main' into royh-batchembed royjhan 2024-07-08 15:10:52 -07:00
  • a3058002c4 feed the linter Patrick Devine 2024-07-08 12:15:31 -07:00
  • a451611761 add adapter conversion for modelfiles Patrick Devine 2024-07-06 18:19:56 -07:00
  • 5d4a331de3 more unittests Patrick Devine 2024-07-05 23:11:20 -07:00
  • 2e055e3af8 ggla checkin Patrick Devine 2024-07-05 22:48:21 -07:00
  • 9f32c634ae refactor convert Michael Yang 2024-05-31 20:00:49 -07:00
  • a4978a94b5 update convert test to check result data Michael Yang 2024-06-03 09:49:13 -07:00
  • 2644c4e682
    Update docs/openai.md royjhan 2024-07-08 14:46:05 -07:00
  • 04cde43b2a
    Update docs/openai.md royjhan 2024-07-08 14:44:16 -07:00
  • 4b4e97ed10 tests mxyng/update-registry-domain Michael Yang 2024-05-20 14:04:27 -07:00
  • 8d62a65ca7 rebase main Michael Yang 2024-05-13 12:44:58 -07:00
  • bf5ba6065b migrate registry domain Michael Yang 2024-02-13 16:34:55 -08:00
  • 09c0972a6a filepath.ToSlash Michael Yang 2024-02-13 16:30:49 -08:00
  • 0bacb30007 Workaround broken ROCm p2p copy Daniel Hiltgen 2024-07-05 12:46:28 -07:00
  • 9fbba6e8eb
    Merge 4dc444bc55beba7fe32db3d6467ddd4d2f5299fe into 53da2c69654769c0c086af695722e1d9b9ee6ecc Marcus Vogel 2024-07-08 22:43:09 +08:00
  • 179fb688b3
    Merge 316a97e43b6b381046e70edcc1320be4babdad99 into 53da2c69654769c0c086af695722e1d9b9ee6ecc Joan Fontanals 2024-07-08 18:49:54 +12:00
  • fb7ebd8f5d
    Merge 36cf87f31457732c0f0e3b25519f1ba9c6dd28d7 into 53da2c69654769c0c086af695722e1d9b9ee6ecc richardanaya2_2048b.Q6_K.gguf 2024-07-07 21:39:20 -07:00
  • 53da2c6965
    llm: remove ambiguous comment when putting upper limit on predictions to avoid infinite generation (#5535) v0.2.0 Jeffrey Morgan 2024-07-07 14:32:05 -04:00
  • b0de40bc5d llm: remove ambiguous comment when putting upper limit on predictions to avoid infinite run-ons jmorganca 2024-07-07 14:21:06 -04:00
  • d8def1ff94
    llm: allow gemma 2 to context shift (#5534) v0.1.49-rc14 Jeffrey Morgan 2024-07-07 13:41:51 -04:00
  • 91a38cbef1 llm: allow gemma 2 to context shift jmorganca 2024-07-07 13:02:04 -04:00
  • 571dc61955
    Update llama.cpp submodule to a8db2a9c (#5530) Jeffrey Morgan 2024-07-07 13:03:09 -04:00
  • 0e09c380fc
    llm: print caching notices in debug only (#5533) Jeffrey Morgan 2024-07-07 12:38:04 -04:00
  • e13d321b23 llm: print caching notices in debug only jmorganca 2024-07-07 12:36:04 -04:00
  • f01a5f2fa3 update patches jmorganca 2024-07-07 12:11:09 -04:00
  • 5657fc22e0 update llama.cpp submodule to commit a8db2a9c jmorganca 2024-07-07 12:05:56 -04:00
  • e25faca879 Add Environment Variable For Row Split Thomas Liao 2024-07-07 01:22:30 -07:00
  • 0ee87615c7
    sched: don't error if paging to disk on Windows and macOS (#5523) v0.1.49-rc13 Jeffrey Morgan 2024-07-06 22:01:52 -04:00
  • ee20e0288b add TODO for other OSes jmorganca 2024-07-06 21:57:40 -04:00
  • 886672a6b5 sched: dont error if paging for macOS and Windows jmorganca 2024-07-06 21:03:12 -04:00
  • f8241bfba3
    gpu: report system free memory instead of 0 (#5521) v0.1.49-rc12 Jeffrey Morgan 2024-07-06 19:35:04 -04:00
  • c9d2bb28a2 gpu: report system free memory instead of 0 jmorganca 2024-07-06 19:27:42 -04:00
  • 4607c70641
    llm: add -DBUILD_SHARED_LIBS=off to common cpu cmake flags (#5520) Jeffrey Morgan 2024-07-06 18:58:16 -04:00
  • b0b02c75e5 llm: add -DBUILD_SHARED_LIBS=off to common cpu cmake flags jmorganca 2024-07-06 18:57:32 -04:00
  • c12f1c5b99 release: move mingw library cleanup to correct job v0.1.49-rc11 jmorganca 2024-07-06 16:12:29 -04:00
  • a08f20d910 release: remove unwanted mingw dll.a files jmorganca 2024-07-06 15:21:15 -04:00
  • 6cea036027 Revert "llm: only statically link libstdc++" jmorganca 2024-07-06 15:10:48 -04:00
  • 5796bfc401 llm: only statically link libstdc++ v0.1.49-rc10 jmorganca 2024-07-06 14:06:20 -04:00
  • f1a379aa56 llm: statically link pthread and stdc++ dependencies in windows build v0.1.49-rc9 jmorganca 2024-07-06 12:54:02 -04:00
  • 6d7248207e
    Update amd-igpu-780m.md alexhegit 2024-07-06 16:11:31 +08:00
  • 9ae146993e llm: add GGML_STATIC flag to windows static lib v0.1.49-rc8 jmorganca 2024-07-06 03:27:05 -04:00
  • 9b58aecd3e llm: build with GGML_STATIC=on for static lib jmorganca/ggml-static jmorganca 2024-07-06 02:50:25 -04:00
  • 0f7ebb7a89
    Update README.md Mateusz Migas 2024-07-06 08:15:31 +02:00
  • e0348d3fe8
    llm: add COMMON_DARWIN_DEFS to arm static build (#5513) v0.1.49-rc7 Jeffrey Morgan 2024-07-05 22:42:42 -04:00
  • ddd875fdde llm: add COMMON_DARWIN_DEFS to arm static build jmorganca 2024-07-05 22:41:49 -04:00
  • 2cc854f8cb
    llm: fix missing dylibs by restoring old build behavior on Linux and macOS (#5511) v0.1.49-rc6 Jeffrey Morgan 2024-07-05 21:48:31 -04:00
  • b72ec5c2fe crlf -> lf jmorganca 2024-07-05 21:00:08 -04:00
  • 1f1fb2562e llm: fix missing dylibs by restoring old build behavior jmorganca 2024-07-05 20:59:13 -04:00
  • c0e6c22b04 Revert "fix cmake build (#5505)" jmorganca 2024-07-05 20:55:19 -04:00
  • 5304b765b2
    llm: put back old include dir (#5507) Jeffrey Morgan 2024-07-05 19:34:21 -04:00
  • fb6cbc02fb update named templates Michael Yang 2024-06-27 14:15:17 -07:00
  • 9136beb08c llm: update link paths for old submodule commits jmorganca 2024-07-05 19:06:00 -04:00
  • d60ab2d480 llm: put back old include dir jmorganca 2024-07-05 18:43:07 -04:00
  • 4fd5f3526a
    fix cmake build (#5505) v0.1.49-rc5 v0.1.49-rc4 Jeffrey Morgan 2024-07-05 19:07:01 -04:00
  • 842f85f758
    Merge pull request #5502 from dhiltgen/ci_fixes Daniel Hiltgen 2024-07-05 15:39:11 -07:00
  • 9d30f9f8b3 Always go build in CI generate steps Daniel Hiltgen 2024-07-05 12:25:53 -07:00
  • c5c20aedf3 fix cmake build jmorganca 2024-07-05 18:24:57 -04:00
  • 631cfd9e62
    types/model: remove knowledge of digest (#5500) Blake Mizerany 2024-07-05 13:42:30 -07:00
  • fe7ef0edf3 types/model: remove knowledge of digest Blake Mizerany 2024-07-05 11:59:14 -07:00
  • 326363b3a7 no funcs Michael Yang 2024-07-03 13:49:14 -07:00
  • ac7a842e55 fix model reloading Michael Yang 2024-07-03 09:00:07 -07:00
  • 2c3fe1fd97 comments Michael Yang 2024-06-20 11:00:08 -07:00
  • 269ed6e6a2 update message processing Michael Yang 2024-06-17 10:38:55 -07:00
  • 78fb33dd07
    fix typo in cgo directives in llm.go (#5501) Jeffrey Morgan 2024-07-05 15:18:36 -04:00
  • a1396f1282 fix typo in cgo directives in llm.go jmorganca 2024-07-05 15:18:03 -04:00
  • 8f8e736b13
    update llama.cpp submodule to d7fd29f (#5475) Jeffrey Morgan 2024-07-05 13:25:58 -04:00
  • cf7cca8c1b fix pooling bug jmorganca 2024-07-04 21:23:18 -04:00
  • 9813954828 update commit to d7fd29f jmorganca 2024-07-04 14:00:34 -04:00
  • 987572b035 update patches jmorganca 2024-07-04 11:30:38 -04:00
  • 8134277c1a update commit to 807b0c4 jmorganca 2024-07-04 11:18:26 -04:00
  • 44a0ff860d maybe fix linux build jmorganca 2024-07-04 01:37:03 -04:00
  • e502e45670 maybe fix linux build jmorganca 2024-07-04 01:28:21 -04:00
  • 9d09ced369 link ggml and llama jmorganca 2024-07-04 00:16:39 -04:00
  • 287674ade3 fix windows build jmorganca 2024-07-03 23:56:33 -04:00
  • 97caca505e fix rocm build on windows jmorganca 2024-07-03 23:38:38 -04:00
  • 6bd11e2243 fix build on macOS jmorganca 2024-07-03 22:07:56 -04:00
  • 7bc0923872 fix build jmorganca 2024-07-03 20:03:17 -04:00
  • 2c32af211f update llama.cpp submodule to a27152b6 jmorganca 2024-07-02 21:21:47 -04:00
  • d89454de80
    Use slot with cached prompt instead of least recently used (#5492) Jeffrey Morgan 2024-07-05 12:32:47 -04:00
  • f691bcf515 actually report longest jmorganca 2024-07-05 11:44:43 -04:00
  • f53679433d Use common prefix to select slot jmorganca 2024-07-05 11:34:38 -04:00
  • af28b94533
    Merge pull request #5469 from dhiltgen/prevent_system_oom Daniel Hiltgen 2024-07-05 08:22:20 -07:00
  • e9188e971a
    Fix assert on small embedding inputs (#5491) Jeffrey Morgan 2024-07-05 11:20:57 -04:00
  • 78eddfc068
    Merge pull request #4412 from dhiltgen/win_docs Daniel Hiltgen 2024-07-05 08:18:22 -07:00
  • 02c24d3d01
    Merge pull request #5466 from dhiltgen/fix_clip_unicode Daniel Hiltgen 2024-07-05 08:16:58 -07:00
  • 79c1073a31 docs: add OpenGPA in Readme Web & Desktop Laurent Eschenauer 2024-07-05 16:34:58 +02:00
  • a5c7496841
    Merge 97ce037489b1bf40495ee6072c7fc9c6bac834fc into 4d71c559b21ec9207a328b824ce534bdbaf59f2d Alessandro de Oliveira Faria (A.K.A.CABELO) 2024-07-05 02:10:39 -07:00
  • b87d0e4ad7 fix warning & better naming Zhe, Wang 2024-07-05 11:28:42 +08:00
  • 622fb981ce draft works on arc770m Zhe, Wang 2024-07-05 11:06:10 +08:00
  • 70ded27314
    Update llm/patches/09-pooling.diff Jeffrey Morgan 2024-07-04 21:22:34 -04:00
  • 4683bdb241 Fix assert on small embedding inputs jmorganca 2024-07-04 21:18:55 -04:00
  • 000ce02098
    Merge branch 'main' into main Maas Lalani 2024-07-04 17:50:44 -04:00