Commit Graph

  • 0959e471ad DO NOT MERGE - rebase on main - Fix incremental builds on linux (#6780) Daniel Hiltgen 2024-09-13 08:24:08 -07:00
  • e302027cc4 updates readme for rag minicheck Ryan Marten 2024-09-13 12:13:46 -07:00
  • 01ee107fdb adds rag example using minicheck to check grounded claims Ryan Marten 2024-09-13 11:58:49 -07:00
  • a1f917dbc7 add simple example for grounded factuality checking with bespoke-minicheck Ryan Marten 2024-09-13 09:41:10 -07:00
  • 56b9af336a
    Fix incremental builds on linux (#6780) Daniel Hiltgen 2024-09-13 08:24:08 -07:00
  • de538993a4
    readme: add Obsidian Quiz Generator plugin to community integrations Edward Cui 2024-09-13 00:35:13 -07:00
  • 8b0c677b43
    add Agents-Flex Libraries in README.md Michael Yang 2024-09-13 14:36:45 +08:00
  • 47ebc1a29b
    added more redirect codes Tobias Heinze 2024-09-13 07:35:39 +02:00
  • 7359c5ea5e usage templating mxyng/environ-2 Michael Yang 2024-07-05 15:26:42 -07:00
  • 220108d3f4 openai: support include_usage stream option to return final usage chunk Anuraag Agrawal 2024-09-13 12:24:43 +09:00
  • d03328b6ef Fix incremental builds on linux Daniel Hiltgen 2024-09-12 17:23:42 -07:00
  • e15bf559f3 DO NOT MERGE Use GOARCH for build dirs Daniel Hiltgen 2024-09-12 15:50:45 -07:00
  • fda0d3be52
    Use GOARCH for build dirs (#6779) Daniel Hiltgen 2024-09-12 16:38:05 -07:00
  • ff1e356e75 Use GOARCH for build dirs Daniel Hiltgen 2024-09-12 15:50:45 -07:00
  • ab4e7ea93c DO NOT MERGE - rebase on main instead - Optimize container images for startup (#6547) Daniel Hiltgen 2024-09-12 12:10:30 -07:00
  • cd5c8f6471
    Optimize container images for startup (#6547) Daniel Hiltgen 2024-09-12 12:10:30 -07:00
  • e0e6098599 Use docker buildx action for release Daniel Hiltgen 2024-09-11 13:48:20 -07:00
  • 63f78efd11 Converge to buildx based helper scripts Daniel Hiltgen 2024-09-11 09:37:22 -07:00
  • 1ea2994d42 Review comments Daniel Hiltgen 2024-09-04 11:29:56 -07:00
  • 9679377601 Notify systemd that ollama server is ready wujing 2024-09-12 21:15:58 +08:00
  • 0f9416f5f6
    Merge f1aa5c7565585015d90da67d28ffb8c086801407 into fef257c5c50347943bb1e5e06ebb5e22fd9b69a0 zhong 2024-09-12 10:06:13 +08:00
  • b04d8e9119
    Merge 732240af5992a3a958d044c141daa5ccb45adfa2 into fef257c5c50347943bb1e5e06ebb5e22fd9b69a0 nopoz 2024-09-12 10:05:17 +08:00
  • fef257c5c5
    examples: updated requirements.txt for privategpt example dcasota 2024-09-12 03:56:56 +02:00
  • 43fe5ba1d7
    Merge 2a00ff7408a81c4e01b9e47ff054749af4c96439 into d066d9b8e0995bbc2107791068892b61b81789cf Hernan Martinez 2024-09-11 18:55:58 -07:00
  • d066d9b8e0
    examples: polish loganalyzer example (#6744) Adrian Cole 2024-09-12 09:37:37 +08:00
  • 5a00dc9fc9
    readme: add ollama_moe to community integrations (#6752) RAPID ARCHITECT 2024-09-11 20:36:26 -05:00
  • c354e87809
    Merge pull request #6767 from ollama/jessegross/bug_6707 Jesse Gross 2024-09-11 17:20:22 -07:00
  • 93ac3760cb runner: Flush pending responses before returning Jesse Gross 2024-09-11 14:00:20 -07:00
  • 42550883e5
    Merge pull request #6765 from ollama/jessegross/bug_6707 Jesse Gross 2024-09-11 16:38:25 -07:00
  • abed273de3
    add "stop" command (#6739) Patrick Devine 2024-09-11 16:36:21 -07:00
  • 6a05cd715b documentation for stopping a model Patrick Devine 2024-09-11 16:02:45 -07:00
  • e5845d9690 runner: Flush pending responses before returning Jesse Gross 2024-09-11 14:00:20 -07:00
  • 034392624c
    Merge pull request #6762 from ollama/mxyng/show-output Michael Yang 2024-09-11 14:58:40 -07:00
  • 7467a29dfb runner.go: Move check for incomplete Unicode after stop Jesse Gross 2024-09-11 14:08:15 -07:00
  • a217d26417 runner.go: Pass back StoppedLimit to Ollama Jesse Gross 2024-09-11 14:34:08 -07:00
  • ecab6f1cc5 refactor show ouput Michael Yang 2024-09-11 11:01:30 -07:00
  • 9369836e84 Move payloads around Daniel Hiltgen 2024-08-27 20:02:45 -07:00
  • 4d75c9b477 Refactor payload logic and add buildx support for faster builds Daniel Hiltgen 2024-08-26 14:31:45 -07:00
  • c845b84884 Optimize container images for startup Daniel Hiltgen 2024-08-23 17:40:20 -07:00
  • 7d6900827d
    readme: add QodeAssist to community integrations (#6754) Petr Mironychev 2024-09-11 22:19:49 +02:00
  • 6778270485 WIP - CI steps for container image Daniel Hiltgen 2024-09-11 10:59:22 -07:00
  • 9246e6dd15
    Verify permissions for AMD GPU (#6736) Daniel Hiltgen 2024-09-11 11:38:25 -07:00
  • e8ecb06839 Converge to buildx based helper scripts Daniel Hiltgen 2024-09-11 09:37:22 -07:00
  • 7302a4cf28 Verify permissions for AMD GPU Daniel Hiltgen 2024-09-10 09:50:12 -07:00
  • 77ddaa155e Added QodeAssist link Petr Mironychev 2024-09-11 15:19:52 +02:00
  • 0d2bb4f0e4
    Update README.md RAPID ARCHITECT 2024-09-11 06:56:42 -05:00
  • f82ed23f56
    Merge 30823ec925cf21fcb82ed60a96db089719a82b87 into 735a0ca2e480b40fc714751b73848c08cf4eed43 royjhan 2024-09-11 17:01:45 +08:00
  • c9f9020699 Polish loganalyzer example Adrian Cole 2024-09-11 15:10:28 +08:00
  • 07dae7b4a3 Fixed no redirect URL scenario when downloading blobs wujing 2024-09-11 14:19:54 +08:00
  • dfaf43c00c comments Patrick Devine 2024-09-10 20:59:25 -07:00
  • a205b7f6ec
    Merge pull request #6735 from ollama/jessegross/prompt_cache Jesse Gross 2024-09-10 20:45:00 -07:00
  • 0273dfe0e7 runner.go: Prompt caching Jesse Gross 2024-08-29 17:21:28 -07:00
  • 0fbe7df8ce add "stop" command Patrick Devine 2024-07-31 16:49:08 -07:00
  • 735a0ca2e4
    Merge pull request #6732 from ollama/mxyng/debug-proxy Michael Yang 2024-09-10 16:13:25 -07:00
  • 93407bd6ef Quiet down dockers new lint warnings (#6716) Daniel Hiltgen 2024-09-09 17:22:20 -07:00
  • dddb72e084 add *_proxy for debugging Michael Yang 2024-09-10 09:36:42 -07:00
  • 5358d8e88b fix: Move to debug logging Gabe Goodhart 2024-07-24 07:27:43 -06:00
  • 17604d1c5d test: Unit tests for client/server in all flavors of (m)TLS config Gabe Goodhart 2024-07-23 16:28:12 -06:00
  • 4d4d4f53e1 test: Move TLS test data gen into a test package Gabe Goodhart 2024-07-23 15:32:03 -06:00
  • ed1bb03660 feat: Split blocking server.Serve into non-blocking and blocking Gabe Goodhart 2024-07-23 15:31:23 -06:00
  • 0cc78919f0 feat: Add mTLS config parsing and usage to the client side Gabe Goodhart 2024-08-01 14:08:10 -06:00
  • f42d8d1dd7 test: Unit tests for TLS config Gabe Goodhart 2024-08-01 14:06:32 -06:00
  • c4a6b1a11e feat: Set up serving with (m)TLS Gabe Goodhart 2024-07-22 13:39:36 -06:00
  • 04ebafc6f4 feat: Add validation logic for host scheme vs TLS config Gabe Goodhart 2024-07-22 13:37:57 -06:00
  • cdc5e69704 feat: Add config plumbing for TLS server options Gabe Goodhart 2024-08-01 13:55:20 -06:00
  • 8de8a80e34
    Merge 04b188bca4ce0331146a83fddee870036dd4f453 into 83a9b5271a68c7d1f8443f91c8d8b7d24ab581a9 Yash Parmar 2024-09-10 14:55:51 +02:00
  • 83a9b5271a
    docs: update examples to use llama3.1 (#6718) Jeffrey Morgan 2024-09-09 22:47:16 -07:00
  • 416fe1507c docs: update examples to use llama3.1 jmorganca 2024-09-09 15:22:07 -07:00
  • 65e0e0a9e8 Review comments Daniel Hiltgen 2024-09-04 11:29:56 -07:00
  • 31cb07b79e Move payloads around Daniel Hiltgen 2024-08-27 20:02:45 -07:00
  • 391c0ec0d6 Refactor payload logic and add buildx support for faster builds Daniel Hiltgen 2024-08-26 14:31:45 -07:00
  • f2b607c26b Optimize container images for startup Daniel Hiltgen 2024-08-23 17:40:20 -07:00
  • 4a8069f9c4
    Quiet down dockers new lint warnings (#6716) Daniel Hiltgen 2024-09-09 17:22:20 -07:00
  • 84b84ce2db
    catch when model vocab size is set correctly (#6714) Patrick Devine 2024-09-09 17:18:54 -07:00
  • 7be6019bd7 Fix go lint regression Daniel Hiltgen 2024-09-09 17:17:57 -07:00
  • fdcbf0bfeb Quiet down dockers new lint warnings Daniel Hiltgen 2024-09-09 14:40:40 -07:00
  • e3d43e373b catch when model vocab size is set correctly Patrick Devine 2024-09-09 14:17:39 -07:00
  • b342349b62
    Merge branch 'main' into patch-1 Ramiro Gómez 2024-09-09 22:54:50 +02:00
  • 4869971070
    Merge branch 'main' into patch-1 Deep Lakhani 2024-09-09 10:34:01 -04:00
  • 69a237acd0 readme: add crewAI to community integrations (#6699) Jeffrey Morgan 2024-09-08 00:36:24 -07:00
  • 36ea740d20 readme: add crewAI with mesop to community integrations RAPID ARCHITECT 2024-09-08 02:35:59 -05:00
  • 8688d2d968 openai: align chat temperature and frequency_penalty options with completion (#6688) frob 2024-09-07 18:08:08 +02:00
  • 1fd71f9ced docs: improve linux install documentation (#6683) Jeffrey Morgan 2024-09-06 22:05:37 -07:00
  • ba423b02e9 openai: don't scale temperature or frequency_penalty (#6514) Yaroslav 2024-09-07 02:45:45 +02:00
  • 1a0c50446f readme: add Archyve to community integrations (#6680) nickthecook 2024-09-06 17:06:01 -04:00
  • c51e3210af readme: add Plasmoid Ollama Control to community integrations (#6681) imoize 2024-09-07 04:04:12 +07:00
  • 6634871ad5 Improve logging on GPU too small (#6666) Daniel Hiltgen 2024-09-06 08:29:36 -07:00
  • bdb96b6d05 openai: fix "presence_penalty" typo and add test (#6665) frob 2024-09-06 10:16:28 +02:00
  • 66c8dec772 Fix gemma2 2b conversion (#6645) Patrick Devine 2024-09-05 17:02:28 -07:00
  • 156ad63066 Document uninstall on windows (#6663) Daniel Hiltgen 2024-09-05 15:57:38 -07:00
  • b9c21e78a0 Revert "Detect running in a container (#6495)" (#6662) Daniel Hiltgen 2024-09-05 14:26:00 -07:00
  • 8290da5144 llm: make load time stall duration configurable via OLLAMA_LOAD_TIMEOUT Daniel Hiltgen 2024-09-05 14:00:08 -07:00
  • f29ab37191 Introduce GPU Overhead env var (#5922) Daniel Hiltgen 2024-09-05 13:46:35 -07:00
  • 4adb9e121e Detect running in a container (#6495) Daniel Hiltgen 2024-09-05 13:24:51 -07:00
  • 0111d1f259 llama3.1 memory Michael Yang 2024-08-08 11:18:13 -07:00
  • 54f69c3570 readme: add AiLama to the list of community integrations (#4957) Zeyo 2024-09-06 01:40:44 +05:30
  • 08e2453459 Update gpu.md: Add RTX 3050 Ti and RTX 3050 Ti (#5888) Michael 2024-09-05 12:24:26 -06:00
  • 786461d697 server: fix blob download when receiving a 200 response (#6656) Tobias Heinze 2024-09-05 19:48:26 +02:00
  • 7d269c3303 readme: add Gentoo package manager entry to community integrations (#5714) Vitaly Zdanevich 2024-09-05 20:58:14 +04:00
  • b3159eb8a8 Update install.sh:Replace "command -v" with encapsulated functionality (#6035) 王卿 2024-09-06 00:49:48 +08:00