Commit Graph

  • 888b6fbb5a
    Merge branch 'main' into readme_add_opengpa Laurent Eschenauer 2024-09-05 09:22:18 +02:00
  • cf48603943
    readme: include Enchanted for Apple Vision Pro (#4949) Augustinas Malinauskas 2024-09-05 06:30:19 +01:00
  • 6e67be09b6
    readme: add lsp-ai to community integrations (#5063) Silas Marvin 2024-09-04 22:17:34 -07:00
  • aa64a24a62 Added LSP-AI to README Silas Marvin 2024-06-15 07:08:23 -07:00
  • 0f5f060d2b
    readme: add ollama-php library to community integrations (#6361) Arda Günsüren 2024-09-05 08:01:14 +03:00
  • a2093a6410
    Merge branch 'main' into patch-1 Arda Günsüren 2024-09-05 07:52:45 +03:00
  • f6b1ac4c9a
    Merge 0e1ec461f9bef752ef18a8c42edb75f8bd6722e7 into b3554778bd72587feb5571dd5c8998f27fe47192 Josh 2024-09-05 10:11:37 +05:30
  • 085c2d4bf0 add testdata Patrick Devine 2024-09-04 17:14:23 -07:00
  • db8c944498 fix gemma2-2b conversion pdevine/geems-2b Patrick Devine 2024-09-04 16:59:23 -07:00
  • b3554778bd
    readme: add vnc-lm discord bot community integration (#6644) jk011ru 2024-09-04 19:46:02 -04:00
  • bb7117599f runner.go: Vision models are not yet supported Jesse Gross 2024-09-04 11:12:49 -07:00
  • cbb9b83b16 llama.go: Pass through additional context params Jesse Gross 2024-09-04 15:56:55 -07:00
  • bbe7b96ded
    llm: use json.hpp from common (#6642) Pascal Patry 2024-09-04 19:34:42 -04:00
  • c561e1a4dd
    Update README.md jk011ru 2024-09-04 19:27:30 -04:00
  • eb39d63659 Detect running in a container Daniel Hiltgen 2024-08-24 17:01:02 -07:00
  • 8ab4f3a38c
    Merge d2f13c177a723089d05a0aa728319404894df756 into c18ff18b2c95a8a39dcd7148c0311e8409221ec5 Jakub Burkiewicz 2024-09-04 23:48:28 +02:00
  • c18ff18b2c
    readme: add confichat to community integrations (#6378) Rune Berg 2024-09-05 09:26:02 +12:00
  • eb2448746a Add confichat to README.md Rune Berg 2024-08-16 08:15:05 +12:00
  • e58e1f250e llm: use json.hpp from common Pascal Patry 2024-09-04 15:48:23 -04:00
  • 732240af59
    Update Dockerfile nopoz 2024-09-04 12:25:14 -07:00
  • a3616aa1f6
    llama: sync llama.cpp to commit 8962422 (#6621) Jeffrey Morgan 2024-09-04 15:14:49 -04:00
  • c42bad7a9f llama: sync llama.cpp to commit 8962422 jmorganca 2024-09-03 21:23:46 -04:00
  • 133770a548
    docs: add group to manual Linux isntructions and verify service is running (#6430) Tomoya Fujita 2024-09-04 11:45:09 -07:00
  • b3a737e1cd address review comments. Tomoya Fujita 2024-09-03 10:05:37 -07:00
  • f36ebfb478
    readme: add gollm to the list of community libraries (#6099) Teïlo M 2024-09-04 20:19:41 +02:00
  • 71dd1e286c
    Update README.md Jeffrey Morgan 2024-09-04 13:54:14 -04:00
  • ff3252cfb5
    Merge branch 'main' into main Teïlo M 2024-09-04 19:28:47 +02:00
  • 5b55379651
    readme: add Cherry Studio to community integrations (#6633) 亢奋猫 2024-09-04 22:53:36 +08:00
  • 93eb43d020
    readme: add Go fun package (#6421) Mitar 2024-09-04 16:52:46 +02:00
  • a0f820cf82 Add gitlab.com/tozd/go/fun go package. Mitar 2024-08-19 04:03:22 -07:00
  • 8fce27ef0d
    readme: add Cherry Studio to community integrations 亢奋猫 2024-09-04 22:06:14 +08:00
  • 369479cc30
    docs: fix spelling error (#6391) Carter 2024-09-04 09:42:33 -04:00
  • 7d89e48f5c
    install.sh: update instructions to use WSL2 (#6450) Erkin Alp Güney 2024-09-04 16:34:53 +03:00
  • c2e3fcf59f
    Update scripts/install.sh Jeffrey Morgan 2024-09-04 09:33:24 -04:00
  • 27bcce6d9f
    readme: add claude-dev to community integrations (#6630) Sam 2024-09-04 23:32:26 +10:00
  • 7e8e2cb160
    docs(integrations): add claude-dev Sam 2024-09-04 17:48:51 +10:00
  • b9faa5e958 Add preliminary support for riscv64 Meng Zhuo 2024-09-04 14:32:48 +08:00
  • 491fc312ae
    readme: add PyOllaMx project (#6624) Viz 2024-09-03 23:10:53 -04:00
  • 1802801e69
    Update README.md Jeffrey Morgan 2024-09-03 23:07:30 -04:00
  • 24d77565e6
    Update README.md with PyOllaMx Viz 2024-09-03 23:02:37 -04:00
  • f443dd7b81 llama: sync llama.cpp to commit 8962422 jmorganca/llama-bump jmorganca 2024-09-03 21:23:46 -04:00
  • 8db94469e0 runner.go: Support GGUF LoRAs Jesse Gross 2024-08-28 17:12:06 -07:00
  • c989321509 runner.go: Don't cast a Go handle to a C void * Jesse Gross 2024-08-28 21:07:16 -07:00
  • e4a091bafd runner.go: Support resource usage command line options Jesse Gross 2024-08-28 09:29:09 -07:00
  • fd4ecd1ff5 llama: fix sync script ggml-metal_darwin_arm64.m filename (#6610) Jeffrey Morgan 2024-09-03 14:01:52 -04:00
  • 9d8129b8bb llama: delete unused files (#6523) Jeffrey Morgan 2024-08-29 17:30:11 -07:00
  • c8a1741d9b runner.go: Update TODOs Jesse Gross 2024-08-27 17:05:23 -07:00
  • 46a7c682f2 runner.go: Fix embeddings endpoint Jesse Gross 2024-08-27 13:59:33 -07:00
  • 52e88ab7b3 runner.go: Health endpoint comments Jesse Gross 2024-08-27 13:17:04 -07:00
  • 4ca8579428 runner.go: Cleanups Jesse Gross 2024-08-27 13:19:46 -07:00
  • d022cfc9e6 runner.go: Move pieces[] into sequence Jesse Gross 2024-08-27 10:24:33 -07:00
  • 6ccd0644e1 runner.go: Fix deadlock if a connection is closed during decoding Jesse Gross 2024-08-26 14:20:50 -07:00
  • 0b73cca386 runner.go: Fix resource leaks when removing sequences Jesse Gross 2024-08-26 14:26:48 -07:00
  • 55fb0633db runner.go: Separate KV cache and context sizes Jesse Gross 2024-08-23 17:27:09 -07:00
  • 53b600921e runner.go: Hold mutex for entire time when processing batch Jesse Gross 2024-08-23 16:28:38 -07:00
  • 8e1554c91d runner.go: Scale batches to be processed by numParallel Jesse Gross 2024-08-23 13:44:30 -07:00
  • f52d4b9879 Make new tokenizer logic conditional (#6395) Daniel Hiltgen 2024-08-24 17:25:37 -07:00
  • 76718ead40 runner.go: Support MinP parameter Jesse Gross 2024-08-21 16:13:54 -07:00
  • 90d25d3b0a runner.go: Check for incomplete UTF-8 character Jesse Gross 2024-08-15 13:07:28 -07:00
  • 477f529d26 runner.go: Implement RepeatLastN to penalize repeated tokens Jesse Gross 2024-08-20 11:21:19 -07:00
  • eccd4dd8d2 runner.go: Use correct JSON field names for runners Jesse Gross 2024-08-20 16:58:09 -07:00
  • 69cc5795a7 runner.go: Shift context window when KV cache space is exceeded Jesse Gross 2024-08-14 10:35:49 -07:00
  • 5a441d227a runner.go: Don't decode if nothing has been added to the batch Jesse Gross 2024-08-14 10:32:05 -07:00
  • 8aa97b5e83 llama.go: Advance though tokens when processing multiple batches Jesse Gross 2024-08-13 16:53:35 -07:00
  • 523d84c563 llama.go: Use dynamic buffer for TokenToPiece Jesse Gross 2024-08-19 17:54:57 -07:00
  • ed19fad862 llama.go: Make batch memory allocation match configuration Jesse Gross 2024-08-13 11:18:02 -07:00
  • 5d34320b7c runner.go: Fix off by one in batch size check Jesse Gross 2024-08-13 10:51:50 -07:00
  • 1c36f36c41 llm: Fix array out-of-bounds memory access when tokenizing Jesse Gross 2024-08-15 15:25:21 -07:00
  • 0c2f95f3de runner: Initialize numPredict Jesse Gross 2024-08-13 10:38:03 -07:00
  • ebdf781397 server: Fix double free on runner subprocess error. Jesse Gross 2024-08-16 14:46:33 -07:00
  • 23c7c1326e llm: Fix lint Jesse Gross 2024-08-15 09:50:42 -07:00
  • 8fe30d161c Fix filename for non darwin arm builds Daniel Hiltgen 2024-08-13 14:33:11 -07:00
  • a483a4c4ed lint jmorganca 2024-08-13 11:24:46 -07:00
  • b267ab92b0 Add missing vendor headers to ggml sync Daniel Hiltgen 2024-08-09 16:45:18 -07:00
  • 189ca38f1d Wire up native source file dependencies Daniel Hiltgen 2024-08-05 08:56:47 -07:00
  • 80db43b7b4 Bump llama sync to 1e6f65 Daniel Hiltgen 2024-08-06 16:50:34 -07:00
  • 47b0e81219 fix dolphin-mistral Daniel Hiltgen 2024-08-01 14:47:00 -07:00
  • 21947d5c1b harden integration tests Daniel Hiltgen 2024-08-01 14:41:23 -07:00
  • 751009a5d7 Runtime selection of new or old runners Daniel Hiltgen 2024-08-01 08:54:44 -07:00
  • 8527028bf4 Implement timings response in Go server Daniel Hiltgen 2024-07-29 14:09:55 -07:00
  • e0241118d0 Get embeddings working Daniel Hiltgen 2024-07-31 11:08:09 -07:00
  • f97ee8c506 Fix parallel requests Daniel Hiltgen 2024-07-31 15:02:58 -07:00
  • e9dd656ff5 Update sync with latest llama.cpp layout, and run against b3485 Daniel Hiltgen 2024-07-29 16:21:09 -07:00
  • 6c0d892498 Prefix all build artifacts with an OS/ARCH dir Daniel Hiltgen 2024-06-24 09:23:34 -07:00
  • 13348e3629 Get linux building Daniel Hiltgen 2024-06-23 12:07:41 -07:00
  • 3d5a08c315 add note in readme jmorganca 2024-06-21 16:22:27 -04:00
  • a29851bc9b clean up metal code jmorganca 2024-06-15 10:06:36 -07:00
  • 8dda9293fa fix Makefile on windows jmorganca 2024-06-20 21:52:10 -04:00
  • b3c62dcafd remove printing jmorganca 2024-06-13 18:41:12 -07:00
  • 9b8b7cd9b5 dont apply license to stb_image.h and json.hpp jmorganca 2024-06-13 14:35:11 -07:00
  • 1da6c40f4f lint jmorganca 2024-06-13 14:21:55 -07:00
  • 76ca2de06e update sync header jmorganca 2024-06-13 14:12:23 -07:00
  • 0eabc2e34d remove unused script jmorganca 2024-06-13 14:07:05 -07:00
  • dded27dcfa fix metal jmorganca 2024-06-12 12:18:40 -07:00
  • 080b600865 add header to not edit jmorganca 2024-06-12 11:40:13 -07:00
  • d6b6de9a5a add header to not edit jmorganca 2024-06-12 11:38:42 -07:00
  • 24a741424f fix build on windows jmorganca 2024-06-12 02:47:12 -04:00
  • 4d476d894e fix Makefile jmorganca 2024-06-11 23:18:07 -07:00
  • bd94ddfc56 fix README.md jmorganca 2024-06-11 22:54:45 -07:00
  • f1f54c5bd5 fix README.md jmorganca 2024-06-11 22:54:31 -07:00