Commit Graph

  • 1961a81f03 Set corret CUDA minimum compute capability version Daniel Hiltgen 2024-01-09 11:28:24 -08:00
  • f0d145da38
    Update README.md Herval Freire 2024-01-09 16:17:14 -03:00
  • 384d79170e
    Update README.md Herval Freire 2024-01-09 16:16:27 -03:00
  • 8a8c7e7f8d only build for metal on arm64 Jeffrey Morgan 2024-01-09 13:51:04 -05:00
  • 6df83e6daa update rough cuda overhead estimate to 15% + 384MiB Jeffrey Morgan 2024-01-09 11:47:30 -05:00
  • f921e2696e typo Michael Yang 2024-01-09 09:45:42 -08:00
  • 4a33cede20 remove unused fields and functions Michael Yang 2023-12-22 09:55:18 -08:00
  • f95d2f25f3 fix temporary history file permissions Michael Yang 2023-12-18 10:53:51 -08:00
  • 2b9892a808 fix(windows): modelpath and list Michael Yang 2023-12-15 15:50:51 -08:00
  • 2bb2bdd5d4 fix lint Michael Yang 2023-12-15 14:07:34 -08:00
  • acfc376efd add .golangci.yaml Michael Yang 2023-12-15 14:25:12 -08:00
  • 997253143f add lint and test on pull_request Michael Yang 2023-12-15 11:33:52 -08:00
  • 62023177f6
    Merge pull request #1614 from jmorganca/mxyng/fix-set-template Michael Yang 2024-01-09 09:36:24 -08:00
  • 940a536f81
    Update README.md Justin Hayes 2024-01-09 10:28:44 -05:00
  • db51c22093
    refactor(run-ollama-docker.sh): update with port variables Tanvir 2024-01-09 13:46:07 +06:00
  • fd33f68f28
    refactor(run.sh): update with port variables Tanvir 2024-01-09 13:43:55 +06:00
  • 8bb68cfcae
    refactor: update run.sh Tanvir 2024-01-09 13:33:20 +06:00
  • f87a1c9543
    refactor: update run-ollama-docker.sh Tanvir 2024-01-09 13:30:58 +06:00
  • 93c73d355b
    chore(docker): update ollama docker image tag to latest Tanvir 2024-01-09 13:28:35 +06:00
  • 0c0c5955c6
    feat(docker): add option to run ollama w/o GPU support Tanvir 2024-01-09 13:27:39 +06:00
  • e6e05d4641
    chore: add shebang to shell scripts Tanvir 2024-01-09 13:21:36 +06:00
  • 6164f378f2 revert cuda overhead to 20% Jeffrey Morgan 2024-01-09 00:54:25 -05:00
  • f387e9631b use runner if cuda alloc won't fit Jeffrey Morgan 2024-01-09 00:44:34 -05:00
  • 6566387ae3 add TODO for cuda overhead Jeffrey Morgan 2024-01-09 00:28:03 -05:00
  • 37708931fb update cuda overhead to 20% to fix crashes when switching between models and large context sizes Jeffrey Morgan 2024-01-09 00:05:23 -05:00
  • f6cb0a553c update cuda overhead to 15% or 400MiB Jeffrey Morgan 2024-01-08 23:45:45 -05:00
  • 2680078c13 fix build on linux Jeffrey Morgan 2024-01-08 23:44:13 -05:00
  • f1b7e5f560 update overhead to 15% Jeffrey Morgan 2024-01-08 23:37:45 -05:00
  • cb534e6ac2 use 10% vram overhead for cuda Jeffrey Morgan 2024-01-08 23:17:44 -05:00
  • 58ce2d8273 better estimate scratch buffer size Jeffrey Morgan 2024-01-08 21:32:44 -05:00
  • d71e3c9d82
    Merge pull request #431 from ollama-webui/regenerate-shortcut Timothy Jaeryang Baek 2024-01-08 18:10:43 -08:00
  • cae1bfe737 feat: regenerate keyboard shortcut Timothy J. Baek 2024-01-08 18:10:23 -08:00
  • 18ddf6d57d fix windows build Jeffrey Morgan 2024-01-08 20:04:01 -05:00
  • 61e6502449
    Merge pull request #1818 from jmorganca/mxyng/fix-alt-prompt Michael Yang 2024-01-08 13:48:34 -08:00
  • 08f1e18965
    Offload layers to GPU based on new model size estimates (#1850) Jeffrey Morgan 2024-01-08 16:42:00 -05:00
  • 591fd58bce address comments Jeffrey Morgan 2024-01-08 15:02:45 -05:00
  • 8ab3b0b9c8 fix build error on linux Jeffrey Morgan 2024-01-08 15:00:12 -05:00
  • 5c55808f4a
    Update llm/llm.go Jeffrey Morgan 2024-01-08 14:45:21 -05:00
  • 0b8e9ab6d7 add overhead for cuda memory Jeffrey Morgan 2024-01-08 14:43:08 -05:00
  • 0e4930719b
    Update llm/llm.go Jeffrey Morgan 2024-01-08 14:43:51 -05:00
  • 889aa5b8f2
    Update llm/llm.go Jeffrey Morgan 2024-01-08 14:43:31 -05:00
  • 7e8f7c8358
    remove ggml automatic re-pull (#1856) Bruce MacDonald 2024-01-08 14:41:01 -05:00
  • 3f3eb19a3b
    document response in modelfile template variables (#1428) Bruce MacDonald 2024-01-08 14:38:51 -05:00
  • b9b286f9dc document response in modelfile template variables Bruce MacDonald 2023-12-07 17:01:04 -08:00
  • 20a5803c52
    Update gpu/gpu_darwin.go Jeffrey Morgan 2024-01-08 14:18:21 -05:00
  • 3cc91b6276 better estmation for graph alloc Jeffrey Morgan 2024-01-08 14:14:47 -05:00
  • f63003254c dont load +1 layers Jeffrey Morgan 2024-01-08 13:27:44 -05:00
  • 59faadaa63
    Merge bd4b2f16a3ecdb165222dedbedc64ad151ea9856 into 059ae4585e100bb78b4769d759e87c2704f06925 unglazed276 2024-01-08 18:25:24 +01:00
  • bd4b2f16a3
    Make modelfile.md less confusing unglazed276 2024-01-08 18:23:36 +01:00
  • ac302497db remove ggml automatic re-pull Bruce MacDonald 2024-01-08 12:06:05 -05:00
  • 94993b8fd1 feat: support ~/.ollama/origins as config for CORS Ryan Scott Brown 2024-01-07 17:55:47 -05:00
  • 31976b82ee
    Update README.md Stefano Fiorucci 2024-01-08 15:46:10 +01:00
  • 72ba02cac2 forgot api.md janpf 2024-01-08 13:15:18 +01:00
  • 351638ad95 added working n_probs pass through janpf 2024-01-08 13:13:32 +01:00
  • f79ca91741
    Merge pull request #428 from ollama-webui/doc-update Timothy Jaeryang Baek 2024-01-08 02:26:52 -08:00
  • 679590f60c doc: roadmap update Timothy J. Baek 2024-01-08 02:26:28 -08:00
  • 81d26b9485
    Merge pull request #427 from ollama-webui/documents Timothy Jaeryang Baek 2024-01-08 02:12:29 -08:00
  • ad36768c42 doc: wording Timothy J. Baek 2024-01-08 02:12:08 -08:00
  • 594424743b doc: update Timothy J. Baek 2024-01-08 02:11:42 -08:00
  • f6a5d4b063
    Merge pull request #424 from ollama-webui/documents Timothy Jaeryang Baek 2024-01-08 02:05:19 -08:00
  • edceeba1b0 feat: help message added Timothy J. Baek 2024-01-08 02:04:20 -08:00
  • b858b9b236 feat: edit doc modal styling Timothy J. Baek 2024-01-08 01:57:09 -08:00
  • 78c493cfa0 fix: exec into container instead of using the API Hidayat Hamir 2024-01-08 16:56:44 +07:00
  • 1ff3eb8038 feat: doc upload error handling Timothy J. Baek 2024-01-08 01:54:03 -08:00
  • eddb6fc7b7 feat: documents backend integration Timothy J. Baek 2024-01-08 01:49:20 -08:00
  • fe997abc6d feat: transform filename to name Timothy J. Baek 2024-01-08 01:32:55 -08:00
  • 54c4e0761a feat: documents file upload Timothy J. Baek 2024-01-08 01:26:15 -08:00
  • df3d95bf2a refac: message drag file input Timothy J. Baek 2024-01-08 01:12:02 -08:00
  • cc3f84f916 feat: # to import doc Timothy J. Baek 2024-01-07 23:43:32 -08:00
  • 2603ac30bc feat: documents Timothy J. Baek 2024-01-07 22:14:08 -08:00
  • 34e5445424
    Merge pull request #423 from ollama-webui/dev Timothy Jaeryang Baek 2024-01-07 21:41:57 -08:00
  • 02f364bfd9 fix: dockerfile Timothy J. Baek 2024-01-07 21:22:37 -08:00
  • 706a8d4f8d fix Timothy J. Baek 2024-01-07 21:14:20 -08:00
  • 34b0802cac fix: docker Timothy J. Baek 2024-01-07 20:55:32 -08:00
  • fe587c87f7 fix: docker Timothy J. Baek 2024-01-07 20:52:15 -08:00
  • 87326a3157 fix: docker Timothy J. Baek 2024-01-07 20:50:09 -08:00
  • 2188377cba always account for scratch vram Jeffrey Morgan 2024-01-07 23:46:54 -05:00
  • 0345a80033 feat: include embedding weight to image Timothy J. Baek 2024-01-07 20:21:32 -08:00
  • 54424ce376
    Setup test Justin D. Harris 2024-01-08 04:19:47 +00:00
  • 9c44331663
    Merge pull request #422 from ollama-webui/dev Timothy Jaeryang Baek 2024-01-07 20:14:41 -08:00
  • 162641ee43 Update requirements.txt Timothy J. Baek 2024-01-07 19:54:52 -08:00
  • 323b475154
    Merge pull request #421 from ollama-webui/main Timothy Jaeryang Baek 2024-01-07 19:54:10 -08:00
  • a7fd2b5109 select layers based on estimated model memory usage Jeffrey Morgan 2024-01-07 21:47:10 -05:00
  • 3c8fe999c8
    Update README.md - Community Integrations - vscode, Sublime Text, CLI and Ruby icebaker 2024-01-07 10:31:39 -03:00
  • 2926b98e44
    Merge pull request #419 from ollama-webui/doc-update Timothy Jaeryang Baek 2024-01-07 13:59:11 -08:00
  • 32e4953e00 doc: wording Timothy J. Baek 2024-01-07 13:58:57 -08:00
  • b1d9d511a1
    Merge pull request #418 from ollama-webui/dev Timothy Jaeryang Baek 2024-01-07 13:56:41 -08:00
  • 57c050326c feat: docx support Timothy J. Baek 2024-01-07 13:56:01 -08:00
  • 059ae4585e
    Merge pull request #1834 from dhiltgen/old_cuda Daniel Hiltgen 2024-01-07 10:39:49 -08:00
  • 7cf53fc20b Workaround memory memory limitations Daniel Hiltgen 2024-01-07 10:11:45 -08:00
  • bc126f2f53
    Merge pull request #417 from ollama-webui/dev Timothy Jaeryang Baek 2024-01-07 09:46:17 -08:00
  • 537a7f5f00 fix Timothy J. Baek 2024-01-07 09:46:00 -08:00
  • 1dad423911
    Merge pull request #416 from ollama-webui/dev Timothy Jaeryang Baek 2024-01-07 09:38:41 -08:00
  • 9a63376e55 feat: file upload error handling Timothy J. Baek 2024-01-07 09:33:34 -08:00
  • b37b157638 feat: reset vectordb storage support Timothy J. Baek 2024-01-07 09:15:45 -08:00
  • d4b2578f6e feat: rag csv support Timothy J. Baek 2024-01-07 09:05:52 -08:00
  • 6347f501ca
    Merge pull request #1828 from dhiltgen/fix_llava Daniel Hiltgen 2024-01-07 09:05:46 -08:00
  • d6a1bf1406 refac: file upload Timothy J. Baek 2024-01-07 09:00:30 -08:00
  • ffd0a5a2a0 Update main.py Timothy J. Baek 2024-01-07 08:34:05 -08:00
  • c68bb3b950 docker: slim Timothy J. Baek 2024-01-07 08:28:35 -08:00