Commit Graph

  • 34c7321e11 clean up on server stop Bruce MacDonald 2023-09-20 22:32:19 +01:00
  • 4b444e0ad5 remove tmp directories created by previous servers Bruce MacDonald 2023-09-20 21:12:20 +01:00
  • 8c83701e9f
    Merge pull request #566 from jmorganca/mxyng/api-check-model-exists Michael Yang 2023-09-21 10:35:14 -07:00
  • 6137b12799 validate existence and pull model using api Michael Yang 2023-09-21 09:50:52 -07:00
  • 1fabba474b refactor default allow origins Michael Yang 2023-09-21 09:42:16 -07:00
  • 6dfd296e8b Add an example GBNF JSON model Valentin Anger 2023-09-21 17:27:26 +02:00
  • d5335df9c2 Add support for GBNF grammar definitions Valentin Anger 2023-09-21 17:27:07 +02:00
  • 2d5f513dfd context saving/loading Abdullah Ali 2023-09-21 12:08:12 +03:00
  • 765770efdb
    Merge pull request #562 from jmorganca/mxyng/fix-ollama-host Michael Yang 2023-09-20 19:54:47 -07:00
  • 9297ff8330 fix OLLAMA_HOST parsing for ip6 Michael Yang 2023-09-20 17:49:48 -07:00
  • ee4fd16f2c
    Merge pull request #556 from jmorganca/pack-cuda Michael Yang 2023-09-20 15:02:36 -07:00
  • a9ed7cc6aa rename generate.go Michael Yang 2023-09-20 14:39:15 -07:00
  • 6c6a31a1e8 embed libraries using cmake Michael Yang 2023-09-20 12:15:23 -07:00
  • fc6ec356fc remove libcuda.so Bruce MacDonald 2023-09-20 20:36:14 +01:00
  • 1255bc9b45 only package 11.8 runner Bruce MacDonald 2023-09-20 20:00:41 +01:00
  • 084e4c782a
    Merge pull request #557 from jmorganca/mxyng/cleanup Michael Yang 2023-09-20 11:51:01 -07:00
  • 58ffa03d8b fix impossible condition Michael Yang 2023-09-19 15:23:33 -07:00
  • 637f8bc6a5
    Merge pull request #536 from jmorganca/mxyng/redirect-uploads Michael Yang 2023-09-20 11:27:03 -07:00
  • 499e9007a5 pick chunksize based on location Michael Yang 2023-09-19 14:22:54 -07:00
  • b9bb5ca288 use cuda_version Bruce MacDonald 2023-09-20 17:58:16 +01:00
  • 4e8be787c7 pack in cuda libs Bruce MacDonald 2023-09-20 17:40:42 +01:00
  • aa45d7c1df draft: explicitly follow upload redirects Michael Yang 2023-09-14 15:42:50 -07:00
  • e35565c567
    Merge pull request #555 from jmorganca/mxyng/fix-windows-startup Michael Yang 2023-09-19 10:51:58 -07:00
  • a5520bfb42 fix build Michael Yang 2023-09-19 10:42:20 -07:00
  • 2627c464ba
    Merge pull request #554 from jmorganca/mxyng/fix-windows-startup Michael Yang 2023-09-19 09:42:12 -07:00
  • b58d5d16b0 fix mkdir on windows Michael Yang 2023-09-19 09:36:30 -07:00
  • 97a79f67ac Adding ability have cuda work on docker with the ubuntu image provided, along with a docker.md for commands that can be added documenting around docker usage thekevshow 2023-09-18 16:35:21 -05:00
  • 24580df958
    only add a layer if there is actual data (#535) Patrick Devine 2023-09-18 13:47:45 -07:00
  • 80dd44e80a
    Cmd changes (#541) Patrick Devine 2023-09-18 12:26:56 -07:00
  • 94e1d96b29
    Updated README section on community projects for table (#550) James Braza 2023-09-18 12:22:50 -07:00
  • 66003e1d05
    subprocess improvements (#524) Bruce MacDonald 2023-09-18 15:16:32 -04:00
  • c9345fae7f tighten up the terminal output Patrick Devine 2023-09-18 12:04:14 -07:00
  • 4a878dd3dc Updated README section on community projects for table James Braza 2023-09-18 11:45:54 -07:00
  • afb955ee19 Initial commit Tauseef Bashir 2023-09-17 21:00:16 +00:00
  • 61f7f68a41
    Added link to ollama-ui in README James Braza 2023-09-16 15:36:47 -07:00
  • 66dfcad0ea Add placeholder text for the prompt line Patrick Devine 2023-09-16 12:18:43 -07:00
  • c345053a8b
    Merge pull request #537 from jmorganca/mxyng/upload Michael Yang 2023-09-15 17:48:39 -07:00
  • ce1f0089d8 Load the model when starting the repl Patrick Devine 2023-09-15 15:59:56 -07:00
  • 08d7c2a944 fix error on upload chunk Michael Yang 2023-09-15 15:59:30 -07:00
  • bdc01aa575 only add a layer if there is actual data Patrick Devine 2023-09-15 13:57:28 -07:00
  • bc9573dcb1
    Merge pull request #530 from jmorganca/mxyng/progresswriter Michael Yang 2023-09-15 12:43:46 -07:00
  • 762165979e simplify by using glob Bruce MacDonald 2023-09-15 11:24:35 -04:00
  • e53bc57d4d split uploadBlobChunked Michael Yang 2023-09-14 13:30:28 -07:00
  • f0b398d17f implement ProgressWriter Michael Yang 2023-09-14 09:54:05 -07:00
  • 8efbc5df55
    DRAFT: add a simple python client to access ollama (#522) Patrick Devine 2023-09-14 16:37:38 -07:00
  • cad4c59c2d comments Patrick Devine 2023-09-14 16:31:05 -07:00
  • ccc3e9ac6d
    Merge pull request #531 from jmorganca/mxyng/content-length Michael Yang 2023-09-14 13:33:11 -07:00
  • daa4f096f9 set request.ContentLength Michael Yang 2023-09-14 10:05:29 -07:00
  • 3ee85f1c6c
    Merge pull request #526 from jmorganca/mxyng/cleanup Michael Yang 2023-09-14 13:10:59 -07:00
  • 8c111ce539 Update llama.go Bruce MacDonald 2023-09-14 15:42:01 -04:00
  • 5f507b92af Update llama.go Bruce MacDonald 2023-09-14 15:39:47 -04:00
  • 3518d79c5d subprocess improvements Bruce MacDonald 2023-09-05 14:20:48 -04:00
  • 2540c9181c
    support for packaging in multiple cuda runners (#509) Bruce MacDonald 2023-09-14 15:08:13 -04:00
  • 83ffb154bc
    Merge pull request #507 from jmorganca/mxyng/build Michael Yang 2023-09-14 11:25:59 -07:00
  • 9aa192c812 update cuda docker image Michael Yang 2023-09-08 13:41:49 -07:00
  • fc8707686f
    Update API docs (#527) Matt Williams 2023-09-14 08:51:26 -07:00
  • bfeb127bf1
    Update api.md Michael Chiang 2023-09-14 08:50:48 -07:00
  • 3eac72b139
    Update docs/api.md Matt Williams 2023-09-14 08:30:20 -07:00
  • f92bc54e4f
    Update docs/api.md Matt Williams 2023-09-14 08:30:07 -07:00
  • 1b0db8e49b
    Update docs/api.md Matt Williams 2023-09-14 08:29:58 -07:00
  • 747a9a4002 cuda version env var Bruce MacDonald 2023-09-14 11:29:31 -04:00
  • 6e4ee08a88 Update generate_linux.go Bruce MacDonald 2023-09-12 17:12:47 -04:00
  • ec6da57cc3 cpu builds Bruce MacDonald 2023-09-12 13:44:10 -04:00
  • fb3bf845b7 use nvcc cuda version if available Bruce MacDonald 2023-09-12 12:59:14 -04:00
  • d8b4905d5a enable packaging multiple cuda versions Bruce MacDonald 2023-09-11 19:34:12 -04:00
  • 2b5caa37b7 add cuda docker image (#488) Michael Yang 2023-09-08 07:38:19 -07:00
  • 66061920e0 Update generate_linux.sh Bruce MacDonald 2023-09-07 17:55:04 -04:00
  • f54cc3b6b4 linux gpu support Bruce MacDonald 2023-09-05 14:20:48 -04:00
  • e2389b63aa add examples of streaming in python and node matt/streamingapi Matt Williams 2023-09-14 07:12:09 -07:00
  • fd682da42f strange TOC was getting auto generated Matt Williams 2023-09-13 17:03:58 -07:00
  • 82d12935e9 Update API docs Matt Williams 2023-09-13 16:59:45 -07:00
  • f89c23764b
    Merge pull request #525 from jmorganca/mxyng/falcon-decode Michael Yang 2023-09-13 15:08:47 -07:00
  • e6881cabd0 remove unused Michael Yang 2023-09-13 11:46:29 -07:00
  • d028853879 fix: add falcon.go Michael Yang 2023-09-13 14:47:32 -07:00
  • 949553db23
    Merge pull request #519 from jmorganca/mxyng/decode Michael Yang 2023-09-13 12:43:57 -07:00
  • 0ed358d77f add a simple python client to access ollama Patrick Devine 2023-09-12 17:26:58 -07:00
  • 0c5a454361 fix model type for 70b Michael Yang 2023-09-12 10:52:57 -07:00
  • f59c4d03f7
    fix ggml arm64 cuda build (#520) Bruce MacDonald 2023-09-12 17:06:48 -04:00
  • efc7757dfd fix ggml arm64 cuda build Bruce MacDonald 2023-09-12 16:29:14 -04:00
  • 7dee25a07f fix falcon decode Michael Yang 2023-09-12 10:01:20 -07:00
  • 6a6a4519ba amd64 linux build runner Bruce MacDonald 2023-09-12 15:20:04 -04:00
  • ce5cc397b9 cpu builds Bruce MacDonald 2023-09-12 13:44:10 -04:00
  • c01504ae5f use nvcc cuda version if available Bruce MacDonald 2023-09-12 12:59:14 -04:00
  • 1180b0d13f enable packaging multiple cuda versions Bruce MacDonald 2023-09-11 19:34:12 -04:00
  • 659d612e4c add cuda docker image (#488) Michael Yang 2023-09-08 07:38:19 -07:00
  • e7b3247151 Update generate_linux.sh Bruce MacDonald 2023-09-07 17:55:04 -04:00
  • e95bc890ec linux gpu support Bruce MacDonald 2023-09-05 14:20:48 -04:00
  • f221637053
    first pass at linux gpu support (#454) Bruce MacDonald 2023-09-12 11:04:35 -04:00
  • f1aab4f551 Update development.md Bruce MacDonald 2023-09-12 11:03:42 -04:00
  • 34caafc447 Allow customization of ollama models etc path Sasha Devol 2023-09-12 06:03:36 -05:00
  • 4f6f6e2659 Create build scripts Rafael Sundorf 2023-09-12 07:08:51 +02:00
  • 45ac07cd02
    create the blobs directory correctly (#508) v0.0.19 Patrick Devine 2023-09-11 14:54:52 -07:00
  • bb5e4438b4 create the blobs directory correctly Patrick Devine 2023-09-11 14:52:39 -07:00
  • 7d749cc787 fix darwin build script Jeffrey Morgan 2023-09-11 16:31:46 -04:00
  • e7e91cd71c
    add autoprune to remove unused layers (#491) Patrick Devine 2023-09-11 11:46:35 -07:00
  • 6d39bfa590 address comments Patrick Devine 2023-09-11 11:43:04 -07:00
  • e6093f7ab5 use total gpu memory Bruce MacDonald 2023-09-11 12:38:42 -04:00
  • 3a49b0b346 use cmake toolchain to simplify build Michael Yang 2023-09-08 16:07:08 -07:00
  • 3920e15386
    add model format to config layer (#497) Jeffrey Morgan 2023-09-09 17:53:44 -04:00
  • 2cc649f2f7 add model format to config layer Jeffrey Morgan 2023-09-09 11:03:00 -04:00