Commit Graph

  • 18662d1180 consistent whitespace jmorganca 2024-06-11 22:50:10 -07:00
  • 3d1f3569cf update .gitattributes jmorganca 2024-06-11 22:48:06 -07:00
  • 083a9e9b4e link metal jmorganca 2024-06-11 22:46:14 -07:00
  • d0703eaf44 wip jmorganca 2024-06-11 18:53:48 -07:00
  • ce00e387c3 wip meta jmorganca 2024-06-11 11:12:00 -07:00
  • 763d7b601c sync jmorganca 2024-06-10 17:23:09 -07:00
  • 4d0e6c55b0 remove perl docs jmorganca 2024-06-10 09:26:19 -07:00
  • 3375b82c56 remove build scripts jmorganca 2024-06-10 02:56:37 -04:00
  • b8c1065ab6 remove need for perl jmorganca 2024-06-10 00:04:21 -04:00
  • a632a04426 fix output jmorganca 2024-06-09 23:53:40 -04:00
  • 110f37ffb0 arch build jmorganca 2024-06-09 20:19:11 -07:00
  • f2f03ff7f2 add temporary makefile jmorganca 2024-06-09 22:33:31 -04:00
  • ba0ff1c46a fix cuda and rocm builds jmorganca 2024-06-09 19:49:22 -04:00
  • 9966a055e5 fix cgo flags for darwin amd64 jmorganca 2024-06-09 14:30:41 -07:00
  • 7aa7a3c1e5 remove -fPIC from build_hipblas.sh jmorganca 2024-06-07 12:52:49 -04:00
  • de634b7fd7 fix issues with runner jmorganca 2024-06-07 09:32:52 -07:00
  • 795753be7e move sync script back in for now jmorganca 2024-06-07 09:26:44 -07:00
  • 0eed68fed4 llama: sync jmorganca 2024-06-07 00:27:24 -07:00
  • 783134a3bb update to d5c938cd jmorganca 2024-06-07 00:15:58 -07:00
  • 74a158a79e add patches jmorganca 2024-06-06 23:55:47 -07:00
  • 8f79a2e86a cleanup stop code jmorganca 2024-06-04 00:58:58 -07:00
  • a4d402c403 fix example jmorganca 2024-06-04 00:43:03 -07:00
  • e1dfc757b3 revert llm changes jmorganca 2024-06-04 00:40:19 -07:00
  • 7d0a452938 num predict jmorganca 2024-05-28 23:38:44 -07:00
  • 43efc893d7 basic progress jmorganca 2024-05-28 23:11:48 -07:00
  • 20afaae020 add more runner params jmorganca 2024-05-28 00:02:01 -07:00
  • 72f3fe4b94 truncate stop properly jmorganca 2024-05-27 23:09:56 -07:00
  • a379d68aa9 wip stop tokens jmorganca 2024-05-27 14:38:44 -07:00
  • b2ef3bf490 embeddings jmorganca 2024-05-27 11:33:47 -07:00
  • ce15ed6d69 remove dependency on llm jmorganca 2024-05-26 23:23:09 -07:00
  • c0b94376b2 grammar jmorganca 2024-05-26 23:14:44 -07:00
  • 72be8e27c4 sampling jmorganca 2024-05-26 23:01:05 -07:00
  • d12db0568e better example module, add port jmorganca 2024-05-25 20:11:57 -07:00
  • ec17359a68 wip jmorganca 2024-05-24 10:09:35 -07:00
  • fbc8572859 add llava to runner jmorganca 2024-05-23 18:22:15 -07:00
  • 87af27dac0 fix output in build_hipblas.sh jmorganca 2024-05-20 16:43:53 -07:00
  • 54f391309f mods to build_hipblas.sh for linux jmorganca 2024-05-20 16:15:16 -07:00
  • 28bedcd807 wip jmorganca 2024-05-20 15:27:10 -07:00
  • 922d0acbdb improve cuda and hipblas build scripts jmorganca 2024-05-20 16:17:13 -04:00
  • b22d78720e cuda linux jmorganca 2024-05-19 23:11:30 -07:00
  • 905568a47f Update README.md Jeffrey Morgan 2024-05-19 16:47:50 -07:00
  • a15ac52fbe Update README.md Jeffrey Morgan 2024-05-19 16:47:19 -07:00
  • 9547aa53ff disable log file jmorganca 2024-05-19 16:36:32 -07:00
  • e29205ad6d fix readme for llava jmorganca 2024-05-19 16:33:37 -07:00
  • a8f91d3cc1 add llava jmorganca 2024-05-19 16:30:11 -07:00
  • a9884ae136 llama: add clip dependencies jmorganca 2024-05-19 14:06:46 -07:00
  • e37651cca0 add clip and parallel requests to the todo list jmorganca 2024-05-19 14:01:52 -07:00
  • 593d6836ab fix cuda build jmorganca 2024-05-19 03:34:24 -04:00
  • 533a7e7d50 fix build on windows jmorganca 2024-05-19 03:19:41 -04:00
  • 0873d28b16 fix ggml-metal.m build constraints jmorganca 2024-05-19 00:10:15 -07:00
  • bb795faa6c fix ggml-metal.m jmorganca 2024-05-19 00:06:26 -07:00
  • e86db9381a avx2 should only add avx2 jmorganca 2024-05-18 23:53:29 -07:00
  • 4a5633e4bc fix sync script jmorganca 2024-05-18 23:50:50 -07:00
  • 86f453252b fix ggml-metal.m jmorganca 2024-05-18 23:34:58 -07:00
  • dfd8f34806 fix ggml-metal.m jmorganca 2024-05-18 23:31:41 -07:00
  • beb847b40f add license headers jmorganca 2024-05-18 23:30:28 -07:00
  • 785f76d390 pre-patch jmorganca 2024-05-18 23:27:01 -07:00
  • 9fe48978a8 move runner package down jmorganca 2024-05-18 23:15:51 -07:00
  • 01ccbc07fe replace static build in llm jmorganca 2024-05-18 22:22:46 -07:00
  • ec09be97e8 fix build jmorganca 2024-05-18 21:23:53 -07:00
  • 6129f30479 wip... jmorganca 2024-05-16 13:52:38 -07:00
  • eb1aa97961 rename server to runner jmorganca 2024-05-19 00:13:30 -04:00
  • 5e921e06ac Update README.md Jeffrey Morgan 2024-05-18 19:50:23 -07:00
  • 02089baf70 Update README.md Jeffrey Morgan 2024-05-18 19:49:43 -07:00
  • 870e91be76 Update README.md Jeffrey Morgan 2024-05-18 19:47:19 -07:00
  • 7ecc8e86c4 Update README.md Jeffrey Morgan 2024-05-18 19:46:44 -07:00
  • b1696e308e Add missing hipcc flags jmorganca 2024-05-18 23:07:19 -04:00
  • c646115b31 fix .gitattributes jmorganca 2024-05-18 22:39:41 -04:00
  • 0110994d06 Initial llama Go module jmorganca 2024-04-20 20:44:01 -04:00
  • 2ef3a217d1 add sync of llama.cpp jmorganca 2024-04-20 18:08:09 -04:00
  • 5e2653f9fe
    llm: update llama.cpp commit to 8962422 (#6618) Jeffrey Morgan 2024-09-03 21:12:39 -04:00
  • f29b167e1a
    Use cuda v11 for driver 525 and older (#6620) Daniel Hiltgen 2024-09-03 17:15:31 -07:00
  • 209fb2a0d6 Use cuda v11 for driver 525 and older Daniel Hiltgen 2024-09-03 15:45:13 -07:00
  • ce18653192
    Merge a6d30ecefe71502a8a493031737ae5df32e7a5f3 into 037a4d103edff143db2f82e1feb8b5b80afea6f1 royjhan 2024-09-04 00:23:34 +02:00
  • d07564f7bf
    Merge pull request #6619 from ollama/jessegross/goserver-health Jesse Gross 2024-09-03 15:19:34 -07:00
  • 037a4d103e
    Log system memory at info (#6617) Daniel Hiltgen 2024-09-03 14:55:20 -07:00
  • ef23219fbd llm: update llama.cpp commit to 8962422 jmorganca 2024-09-03 17:47:48 -04:00
  • fc6b8b3ed8 Log system memory at info Daniel Hiltgen 2024-09-03 14:42:48 -07:00
  • 2ba5ae582b runner.go: Improve health status reporting when laoding model Jesse Gross 2024-09-03 12:05:46 -07:00
  • 5f8338cd36 api: add Client.BaseURL method presbrey 2024-09-03 17:24:28 -04:00
  • ed8477b45e runner.go: Break out of loops for final tokens Jesse Gross 2024-09-03 10:25:39 -07:00
  • ddb25559c5
    Update README.md Raymond Camden 2024-09-03 15:54:24 -05:00
  • 3ff54aed11
    Merge pull request #6559 from ollama/jessegross/goserver-options Jesse Gross 2024-09-03 13:53:53 -07:00
  • 56cd7c30c0 bump jmorganca 2024-09-03 16:38:13 -04:00
  • 04b188bca4 cli: Enhance 'ps' command with watch mode and filtering Yash_1124 2024-09-04 02:06:01 +05:30
  • 50c05d57e0
    readme: add Painting Droid community integration (#5514) Mateusz Migas 2024-09-03 22:15:54 +02:00
  • 35159de18a
    readme: update Ollama4j link and add link to Ollama4j Web UI (#6608) Amith Koujalgi 2024-09-04 01:38:50 +05:30
  • d9f500d915 runner.go: Support GGUF LoRAs Jesse Gross 2024-08-28 17:12:06 -07:00
  • c5cd67e7a8 runner.go: Don't cast a Go handle to a C void * Jesse Gross 2024-08-28 21:07:16 -07:00
  • 6f50a8633a runner.go: Support resource usage command line options Jesse Gross 2024-08-28 09:29:09 -07:00
  • 3f22481cc7
    llama: fix sync script ggml-metal_darwin_arm64.m filename (#6610) Jeffrey Morgan 2024-09-03 14:01:52 -04:00
  • 5bcf506be1 llama: fix sync script ggml-metal_darwin_arm64.m filename jmorganca 2024-09-03 13:50:05 -04:00
  • d5e9198ef7
    Updated Ollama4j link and added link to Ollama4j Web UI tool. Amith Koujalgi 2024-09-03 22:40:51 +05:30
  • 94fff5805f
    Fix sprintf to snprintf (#5664) FellowTraveler 2024-09-03 11:32:59 -05:00
  • 14d5093cd0
    readme: add PartCAD tool to readme for generating 3D CAD models using Ollama (#6605) OpenVMP 2024-09-03 09:28:01 -07:00
  • 9df5f0e8e4
    Reduce docker image size (#5847) R0CKSTAR 2024-09-04 00:25:31 +08:00
  • 95d989fc08 Added the tool to generate 3D CAD models using Ollama Roman Kuzmenko 2024-09-03 02:17:20 -07:00
  • ad3eb00bee
    readme: add OllamaFarm project (#6508) presbrey 2024-09-02 16:05:36 -04:00
  • bfc2d61549
    readme: add go-crew and Ollamaclient projects (#6583) Jonathan Hecl 2024-09-02 16:34:26 -03:00
  • 741affdfd6
    docs: update faq.md for OLLAMA_MODELS env var permissions (#6587) SnoopyTlion 2024-09-03 03:31:29 +08:00