Commit Graph

  • 26d154d97a Load all layers on arm64 macOS if model is small enough Jeffrey Morgan 2024-01-22 17:04:14 -08:00
  • 0759d8996e
    Merge pull request #2148 from dhiltgen/intel_mac Daniel Hiltgen 2024-01-22 16:56:58 -08:00
  • 0f5b843319 Refine Accelerate usage on mac Daniel Hiltgen 2024-01-22 16:25:56 -08:00
  • ffaf52e1e9 update submodule to 011e8ec577fd135cbc02993d3ea9840c516d6a1c Jeffrey Morgan 2024-01-22 15:16:47 -08:00
  • 940b10b036
    Merge pull request #2144 from jmorganca/mxyng/update-faq Michael Yang 2024-01-22 13:46:57 -08:00
  • 3bc28736cd
    Merge pull request #2143 from dhiltgen/llm_verbosity Daniel Hiltgen 2024-01-22 13:19:16 -08:00
  • 93a756266c faq: update to use launchctl setenv Michael Yang 2024-01-22 12:30:58 -08:00
  • a0a829bf7a
    Merge pull request #2142 from dhiltgen/debug_on_fail Daniel Hiltgen 2024-01-22 12:29:22 -08:00
  • 730dcfcc7a Refine debug logging for llm Daniel Hiltgen 2024-01-22 12:26:49 -08:00
  • 27a2d5af54 Debug logging on init failure Daniel Hiltgen 2024-01-22 12:08:22 -08:00
  • 5f81a33f43
    update submodule to 6f9939d (#2115) Jeffrey Morgan 2024-01-22 11:56:40 -08:00
  • 5c17743b2c remove json stream errors mxyng/fix-status-code Michael Yang 2024-01-18 14:22:09 -08:00
  • 0bd5245acf fix: status on errors Michael Yang 2024-01-17 15:09:47 -08:00
  • 3c69ee1eaa update submodule to 6f9939d Jeffrey Morgan 2024-01-19 23:30:02 -05:00
  • b246c62d2c
    Merge pull request #545 from ollama-webui/openai-fix Timothy Jaeryang Baek 2024-01-22 09:46:39 -08:00
  • d67c632920 fix: openai Timothy J. Baek 2024-01-22 09:45:56 -08:00
  • 6225fde046
    Merge pull request #2102 from jmorganca/mxyng/fix-create-override Michael Yang 2024-01-22 09:37:48 -08:00
  • 4e426f2c0c
    Update README.md Pavel Frankov 2024-01-22 20:05:21 +03:00
  • 069184562b
    readline: drop not use min function (#2134) Meng Zhuo 2024-01-23 00:15:08 +08:00
  • 92d85ae2e6
    Merge pull request #543 from ollama-webui/message-refac Timothy Jaeryang Baek 2024-01-22 04:17:58 -08:00
  • 52e7684ff4 fix: code block unmount issue Timothy J. Baek 2024-01-22 04:14:07 -08:00
  • 917ab08f5c fix: code block styling Timothy J. Baek 2024-01-22 03:43:48 -08:00
  • 9c72f4da09 fix: styling Timothy J. Baek 2024-01-22 03:42:32 -08:00
  • e758855590 refac: response message Timothy J. Baek 2024-01-22 03:33:49 -08:00
  • d2c5f3d591 refac: convert str var to f-string Timothy J. Baek 2024-01-22 01:47:07 -08:00
  • 1abe5a5487
    Merge pull request #537 from CreatorGhost/fix-gpt-4-vision Timothy Jaeryang Baek 2024-01-22 01:41:57 -08:00
  • 83181b7968 fix: add max_token only when field not present Timothy J. Baek 2024-01-22 01:41:00 -08:00
  • b26e0fb7e7 refac Timothy J. Baek 2024-01-22 01:37:54 -08:00
  • b2b2a52248 readline: drop not use min function Meng Zhuo 2024-01-22 17:26:16 +08:00
  • 381d7e7f3a
    Update langchainpy.md vikesh001 2024-01-22 11:01:13 +05:30
  • 5576bb2348
    Merge pull request #2130 from dhiltgen/more_faster Daniel Hiltgen 2024-01-21 16:14:12 -08:00
  • 2738837786
    Merge pull request #2131 from dhiltgen/probe_cards_at_init Daniel Hiltgen 2024-01-21 16:13:47 -08:00
  • ec3764538d Probe GPUs before backend init Daniel Hiltgen 2024-01-21 15:39:59 -08:00
  • 95da5ab8e0
    Merge pull request #538 from Shiyinq/refactor-signup Timothy Jaeryang Baek 2024-01-21 15:36:09 -08:00
  • df54c723ae Make CPU builds parallel and customizable AMD GPUs Daniel Hiltgen 2024-01-21 12:57:13 -08:00
  • daa6bedcf9
    Merge pull request #535 from bhulston/fix/chat-imports Timothy Jaeryang Baek 2024-01-21 13:44:11 -08:00
  • fa8c990e58
    Merge pull request #2127 from dhiltgen/rocm_container Daniel Hiltgen 2024-01-21 11:49:01 -08:00
  • da72235ebf Combine the 2 Dockerfiles and add ROCm Daniel Hiltgen 2024-01-21 11:37:11 -08:00
  • 225fee8dfc
    Add update instructions for Linux & WSL2 David Heimann 2024-01-21 13:59:07 -05:00
  • 0776226e53 Add comment for Header field Christian Neff 2024-01-21 14:43:54 +01:00
  • 89c4aee29e
    Unlock mutex when failing to load model (#2117) Jeffrey Morgan 2024-01-20 20:54:46 -05:00
  • b18f3ad270 Unlock mutex when failing to load model Jeffrey Morgan 2024-01-20 20:05:15 -05:00
  • a447a083f2 Add compute capability 5.0, 7.5, and 8.0 Daniel Hiltgen 2024-01-20 12:15:50 -08:00
  • f32ea81b21
    increase minimum overhead to 1024MiB (#2114) Jeffrey Morgan 2024-01-20 17:11:38 -05:00
  • b7ca21a4c3 increase minimum overhead to 1024MiB Jeffrey Morgan 2024-01-20 16:19:59 -05:00
  • 8339e77e44 add client only target Fabrizio (Misto) Milo 2024-01-20 11:57:17 -08:00
  • 681a914990 Add support for CUDA 5.2 cards Daniel Hiltgen 2024-01-20 10:48:43 -08:00
  • 6a63c94153 feat: add guard clause to improve signup process Shiyinq 2024-01-20 21:54:53 +07:00
  • 8a49a2527c
    Merge branch 'main' into main Richard Macarthy 2024-01-20 11:36:22 +00:00
  • 01f0f1cdbe
    Update requirements.txt t-cool 2024-01-20 13:03:39 +09:00
  • 52194c2520
    Update main.py t-cool 2024-01-20 13:02:38 +09:00
  • 4c54f0ddeb
    sign dylibs on macOS (#2101) Jeffrey Morgan 2024-01-19 19:24:11 -05:00
  • 60afd6ecdd Add workaround for gpt-4-vision-preview model that support 4k tokens Aditya Pratap Singh 2024-01-20 04:34:47 +05:30
  • c08dfaa23d fix: remove overwritten model layers Michael Yang 2024-01-19 14:58:36 -08:00
  • 971b10c047 sign dylibs on macOS Jeffrey Morgan 2024-01-19 16:57:50 -05:00
  • 8662437a9f Add workaround for gpt-4-vision-preview model Aditya Pratap Singh 2024-01-20 04:17:06 +05:30
  • 3b76e736ae
    Merge pull request #2100 from dhiltgen/more_wsl_globs Daniel Hiltgen 2024-01-19 13:41:08 -08:00
  • 552db98bf1 More WSL paths Daniel Hiltgen 2024-01-19 13:23:29 -08:00
  • fdcdfef620
    Merge pull request #2099 from dhiltgen/fix_cuda_model_swap Daniel Hiltgen 2024-01-19 12:22:04 -08:00
  • 5b26d2a686 backend: make the data directory and the artifacts from the frontend customizable using environment variables lucasew 2024-01-19 17:13:09 -03:00
  • e3503d6617 backend: make dotenv optional lucasew 2024-01-19 17:12:14 -03:00
  • 6a042438af Switch to local dlopen symbols Daniel Hiltgen 2024-01-19 11:37:02 -08:00
  • 2789ed31a7 improve scratch buffer estimates scratch Jeffrey Morgan 2024-01-18 00:53:17 -05:00
  • dc88cc3981
    use gzip for runner embedding (#2067) Jeffrey Morgan 2024-01-19 13:23:03 -05:00
  • 4d85e2cb15 Add validation for chatGPT imports, stopping any breaking issues when imports are corrupted/not compatible Brandon Hulston 2024-01-19 11:22:28 -07:00
  • 35ace57784 add rst document for RAG Marclass 2024-01-19 10:48:04 -07:00
  • e2edbedede
    Merge pull request #1 from Marclass/rag-arbitrary-files Marclass 2024-01-19 10:44:53 -07:00
  • f559068186
    feat: Add epub support Dave Bauman 2024-01-13 08:46:56 -05:00
  • f079cb6b56
    Merge pull request #524 from Marclass/rag-arbitrary-files Timothy Jaeryang Baek 2024-01-19 00:09:04 -08:00
  • 31caa89cf1 use gzip for runner embedding Jeffrey Morgan 2024-01-18 23:53:48 -05:00
  • aa1d386042 Allow any file to be used for RAG. Marclass 2024-01-18 20:41:14 -07:00
  • 62976087c6
    Merge pull request #1999 from lainedfles/termux_android_cpu_only Daniel Hiltgen 2024-01-18 17:16:53 -08:00
  • 59b45b0ade fix matrix Michael Yang 2024-01-18 17:05:22 -08:00
  • 8e5fa44837 ci: use stubs libraries Michael Yang 2024-01-18 16:55:05 -08:00
  • 76258af092
    Merge branch 'main' into lwrless/json-schema Lwrless 2024-01-19 09:00:58 +08:00
  • ff33aa37ae
    Merge pull request #522 from ollama-webui/import-chat-fix Timothy Jaeryang Baek 2024-01-18 16:39:46 -08:00
  • 1a06b0cea6 fix: old chat log import issue Timothy J. Baek 2024-01-18 16:38:47 -08:00
  • 344342abdf Restore dyn_ext_server.c since RTLD_DEEPBIND has been removed Self Denial 2024-01-18 17:30:42 -07:00
  • eb76f3e379 Fix CPU-only build under Android Termux enviornment. Self Denial 2024-01-15 02:37:44 -07:00
  • d017e3d0a6
    Merge pull request #2060 from jmorganca/mxyng/fix-show Michael Yang 2024-01-18 16:02:27 -08:00
  • aac9ab4db7 fix show handler Michael Yang 2024-01-18 15:36:50 -08:00
  • 1f5b7ff976
    Merge pull request #1932 from jmorganca/mxyng/api-fields Michael Yang 2024-01-18 14:56:51 -08:00
  • e299831e2c
    Merge pull request #1958 from purificant/ci Michael Yang 2024-01-18 14:53:36 -08:00
  • 745b5934fa add model to ModelResponse Michael Yang 2024-01-18 14:32:55 -08:00
  • a38d88d828 api: add model for all requests Michael Yang 2024-01-11 14:07:54 -08:00
  • abec7f06e5
    Merge pull request #2056 from dhiltgen/slog Daniel Hiltgen 2024-01-18 14:27:24 -08:00
  • e5da190bac
    Merge pull request #2020 from jmorganca/mxyng/install-fedora Michael Yang 2024-01-18 14:23:42 -08:00
  • ecbfc0182f Go bump to v1.21 to pick up slog Daniel Hiltgen 2024-01-18 11:51:34 -08:00
  • fedd705aea Mechanical switch from log to slog Daniel Hiltgen 2024-01-18 10:52:01 -08:00
  • 82ee019bfc
    add open interpreter to list of extensions (#2016) Mike Bird 2024-01-18 16:59:39 -05:00
  • ad9dbc2a04
    Haystack Ollama Integration (#2021) Sachin Sachdeva 2024-01-18 22:38:32 +01:00
  • fccdf4c635
    Merge pull request #1987 from xyproto/archlinux Daniel Hiltgen 2024-01-18 13:32:10 -08:00
  • 4f6f68d475 formatting Bruce MacDonald 2024-01-18 16:26:35 -05:00
  • 171e22b0c4 fix lint Bruce MacDonald 2024-01-18 16:16:35 -05:00
  • 28e0293ce1 lint fix Bruce MacDonald 2024-01-18 15:59:54 -05:00
  • 94c21b5510 maintain system message in chat history Bruce MacDonald 2024-01-18 15:49:10 -05:00
  • d450fb1d1e
    Merge pull request #2055 from dhiltgen/cuda_docs Daniel Hiltgen 2024-01-18 12:07:31 -08:00
  • df40b11d03
    Merge pull request #2007 from dhiltgen/cpu_fallback Daniel Hiltgen 2024-01-18 11:32:29 -08:00
  • 9cd20b0ec8 Refine the linux cuda/rocm developer docs Daniel Hiltgen 2024-01-18 09:44:44 -08:00
  • 4e6207f888
    Merge pull request #511 from ollama-webui/tags Timothy Jaeryang Baek 2024-01-18 02:58:47 -08:00