ollama

norohind/ollama

Fork 0

forked from third-party-mirrors/ollama

Commit Graph

Select branches

Hide Pull Requests

api

bmizerany/client-registry

bmizerany/embedspeedup

bmizerany/fastverify

bmizerany/filepathnobuild

bmizerany/filepathwithcoloninhost

bmizerany/hrm

bmizerany/mixedcasepullsandpushes

bmizerany/modenameenforcealphanum

bmizerany/noseek

bmizerany/nosillyggufslurps

bmizerany/replacecolon

bmizerany/types/model/defaultfix

bmizerany/x

bruce/iq-quants

brucemacd/allow-ollama

brucemacd/api-doc-formatting

brucemacd/community-docs

brucemacd/concurrent-fail

brucemacd/default-param-tag

brucemacd/encode

brucemacd/ignore-debug

brucemacd/lib-wpath

brucemacd/llama-mem-calc

brucemacd/mem-calc

brucemacd/models_dir_tilde

brucemacd/no-at-create

brucemacd/no-move-prompt-path

brucemacd/openai-chat

brucemacd/remove-ggml-runner

brucemacd/token-gen-timeout

brucemacd/tokenize

brucemacd/update-image-dep

brucemacd/use-req-model-chat

brucemacd/user-template

build_dist

cgo

cp-model

cuda-search

delete-fix

deletemodels

dhiltgen/remove_submodule

distribution

editor

fix-model-names

fix-unknown-model

format-config

go-opts

insecure-registry

jessegross/sample

jessegross/sem

jessegross/whitespace

jmorganca/batch-embeddings

jmorganca/done-reason

jmorganca/enable-fa

jmorganca/execstack

jmorganca/ga

jmorganca/ggml-static

jmorganca/if-none-match

jmorganca/initcmake

jmorganca/limit

jmorganca/llama-bump

jmorganca/llama-cpp-7c26775

jmorganca/llama-cpp-8960fe8

jmorganca/llama-vit

jmorganca/mllama

jmorganca/mm

jmorganca/native

jmorganca/no-error-template

jmorganca/openai-context

jmorganca/options

jmorganca/replace-assets

jmorganca/template-mistral

jmorganca/testing

jyan/auth

jyan/convert-cmdr

jyan/convert-prog

jyan/format

jyan/local

jyan/local2

jyan/ollama-v

jyan/p2

jyan/paligemma

jyan/palitest

jyan/parse-temp

jyan/progress

jyan/q4_4/8

jyan/quant3

jyan/quant4

jyan/quant5

jyan/reord-g

jyan/v0.146

language_support

license-layers

list-models

ls

main

matt/examplemodelfiles

matt/streamingapi

mattw/airenamer

mattw/allmodelsonhuggingface

mattw/communitylinks

mattw/faq-context

mattw/howtoquant

mattw/noprune

mattw/python-functioncalling

mattw/quantcontext

mattw/selfqueryingretrieval

mattw/whatneedstorun

modelfile-readme

modelpath

modenameenforcealphanum

mxyng/api-models

mxyng/create-context

mxyng/environ-2

mxyng/extra-args

mxyng/fix-cli-paste

mxyng/fix-memory

mxyng/fix-status-code

mxyng/func-checks

mxyng/gin-slog

mxyng/install

mxyng/layers-from-files

mxyng/mllama

mxyng/modelname-5

mxyng/modelname-6

mxyng/modelname-7

mxyng/modelname-8

mxyng/server-timestamp

mxyng/split-bin

mxyng/sync

mxyng/tune-concurrency

mxyng/update-registry-domain

native

nogogen

ollama.com

paligemma-support

pdevine/fix-template

pdevine/geems-2b

pdevine/ggla

pdevine/imageproc-redux

pdevine/import-docs

pdevine/newlines

pdevine/parserlines

pdevine/ps-glitches

pdevine/showggmlinfo

progressbar

pulse

readme-updates

remove-first

rename

revert-5963-revert-5924-mxyng/llama3.1-rope

revert-991-brucemacd/history-api

rmdisplaylong

roy-embed-parallel

royh-embed-parallel

royh-imgembed

royh-ls

royh-name

royh-openai-delete

royh-openai-suffixdocs

royh-params

royh-precision

royh-show-rigid

royh-testdelete

royh/embed-viz

royh/ep-methods

royh/stream-tools

royh/whisper

scratch

shell

skip-list

timeout

update-nous-hermes

upgrade-all

upload-progress

whitespace-detection

v0.0.1

v0.0.10

v0.0.11

v0.0.12

v0.0.13

v0.0.14

v0.0.15

v0.0.16

v0.0.17

v0.0.18

v0.0.19

v0.0.2

v0.0.20

v0.0.21

v0.0.3

v0.0.4

v0.0.5

v0.0.6

v0.0.7

v0.0.8

v0.0.9

v0.1.0

v0.1.1

v0.1.10

v0.1.11

v0.1.12

v0.1.13

v0.1.14

v0.1.15

v0.1.16

v0.1.17

v0.1.18

v0.1.19

v0.1.2

v0.1.20

v0.1.21

v0.1.22

v0.1.23

v0.1.24

v0.1.25

v0.1.26

v0.1.27

v0.1.28

v0.1.29

v0.1.3

v0.1.30

v0.1.31

v0.1.32

v0.1.32-rc1

v0.1.32-rc2

v0.1.33

v0.1.33-rc1

v0.1.33-rc2

v0.1.33-rc3

v0.1.33-rc4

v0.1.33-rc5

v0.1.33-rc6

v0.1.33-rc7

v0.1.34

v0.1.34-rc1

v0.1.35

v0.1.35-rc1

v0.1.36

v0.1.37

v0.1.38

v0.1.39

v0.1.39-rc1

v0.1.39-rc2

v0.1.4

v0.1.40

v0.1.40-rc1

v0.1.41

v0.1.42

v0.1.43

v0.1.44

v0.1.45

v0.1.45-rc1

v0.1.45-rc2

v0.1.45-rc3

v0.1.45-rc4

v0.1.45-rc5

v0.1.46

v0.1.47

v0.1.48

v0.1.49-rc1

v0.1.49-rc10

v0.1.49-rc11

v0.1.49-rc12

v0.1.49-rc13

v0.1.49-rc14

v0.1.49-rc2

v0.1.49-rc3

v0.1.49-rc4

v0.1.49-rc5

v0.1.49-rc6

v0.1.49-rc7

v0.1.49-rc8

v0.1.49-rc9

v0.1.5

v0.1.6

v0.1.7

v0.1.8

v0.1.9

v0.2.0

v0.2.1

v0.2.2

v0.2.2-rc1

v0.2.2-rc2

v0.2.3

v0.2.4

v0.2.5

v0.2.6

v0.2.7

v0.2.8

v0.2.8-rc1

v0.2.8-rc2

v0.3.0

v0.3.1

v0.3.10

v0.3.10-rc1

v0.3.11

v0.3.11-rc1

v0.3.11-rc2

v0.3.11-rc3

v0.3.11-rc4

v0.3.12

v0.3.12-rc1

v0.3.12-rc2

v0.3.12-rc3

v0.3.12-rc4

v0.3.12-rc5

v0.3.13

v0.3.14

v0.3.14-rc0

v0.3.2

v0.3.3

v0.3.4

v0.3.5

v0.3.6

v0.3.7

v0.3.7-rc1

v0.3.7-rc2

v0.3.7-rc3

v0.3.7-rc4

v0.3.7-rc5

v0.3.7-rc6

v0.3.8

v0.3.9

v0.4.0

v0.4.0-ci3

v0.4.0-rc0

v0.4.0-rc1

v0.4.0-rc2

v0.4.0-rc3

v0.4.0-rc4

v0.4.0-rc5

v0.4.0-rc6

v0.4.0-rc7

v0.4.0-rc8

v0.4.1

v0.4.1-rc0

51214ddef5 Update docs/openai.md royjhan 2024-07-08 14:46:05 -07:00
b950d749a9 Update docs/openai.md royjhan 2024-07-08 14:44:16 -07:00
3702ed7532 token bug corrected Roy Han 2024-07-03 15:03:54 -07:00
6266603b17 Update docs Roy Han 2024-06-26 14:30:28 -07:00
c279f96371 remove ToolCall from GenerateResponse Michael Yang 2024-07-16 14:51:19 -07:00
23d5beeb9c input royh-imgembed Roy Han 2024-07-16 15:19:13 -07:00
499e87c9ba

Merge pull request #5730 from ollama/mxyng/cleanup Michael Yang 2024-07-16 14:42:13 -07:00
cd0853f2d5

Merge pull request #5207 from ollama/mxyng/suffix Michael Yang 2024-07-16 14:37:32 -07:00
d290e87513 add suffix support to generate endpoint Michael Yang 2024-06-20 19:13:36 -07:00
97c20ede33

README: Added AI Studio to the list of UIs (#5721) Thorsten Sommer 2024-07-16 23:24:27 +02:00
703ecccc6b clean Josh Yan 2024-07-16 14:17:44 -07:00
873f334783 IT WORKS Josh Yan 2024-07-16 14:12:07 -07:00
5a83f79afd remove unneeded tool calls Michael Yang 2024-07-16 13:48:38 -07:00
987dbab0b0

OpenAI: /v1/embeddings compatibility (#5285) royjhan 2024-07-16 13:36:08 -07:00
fa49bfc0bd FIXED TESTS Josh Yan 2024-07-16 12:14:10 -07:00
a8388beb94

Merge pull request #5726 from ollama/mxyng/tools-templates Michael Yang 2024-07-16 12:12:10 -07:00
5afbb60fc4 fix unmarshal type errors Michael Yang 2024-07-16 09:38:46 -07:00
fc1b3ee9bf test Josh Yan 2024-07-16 11:21:13 -07:00
4cb5d7decc

server: omit model system prompt if empty (#5717) Jeffrey Morgan 2024-07-16 11:09:00 -07:00
8eac50dd4f

Merge pull request #5684 from ollama/mxyng/tests Michael Yang 2024-07-16 09:44:45 -07:00
4a565cbf94 add chat and generate tests with mock runner Michael Yang 2024-07-13 17:46:24 -07:00
64039df6d7

Merge pull request #5284 from ollama/mxyng/tools Michael Yang 2024-07-15 18:03:37 -07:00
7ac6d462ec

server: return empty slice on empty /api/embed request (#5713) Jeffrey Morgan 2024-07-15 17:39:44 -07:00
ef5136a745 tools test Michael Yang 2024-07-15 12:17:38 -07:00
8288ec8824

Merge pull request #5710 from dhiltgen/rocm_bump Daniel Hiltgen 2024-07-15 15:32:18 -07:00
d02bbebb11 tools Michael Yang 2024-06-20 13:45:47 -07:00
224337b32f Bump linux ROCm to 6.1.2 Daniel Hiltgen 2024-07-15 15:10:22 -07:00
25be20949c test Josh Yan 2024-07-15 15:08:24 -07:00
9e35d9bbee

server: lowercase roles for compatibility with clients (#5695) Jeffrey Morgan 2024-07-15 13:55:57 -07:00
b9f5e16c80

Introduce /api/embed endpoint supporting batch embedding (#5127) royjhan 2024-07-15 12:14:24 -07:00
eb7cc2d1ce image embeddings Roy Han 2024-07-15 12:13:06 -07:00
903e9df46f test Josh Yan 2024-07-15 11:46:49 -07:00
907b038ff0 reduce error footprint royh-embed-parallel Roy Han 2024-07-15 10:57:01 -07:00
8476ef2bd8 atomic for race jyan/quant4 Josh Yan 2024-07-15 10:44:35 -07:00
766ca1cd7d clean up Roy Han 2024-07-15 09:54:19 -07:00
40c0f9612e unneccesary Josh Yan 2024-07-14 18:41:16 -07:00
e9f7f36029

Support image input for OpenAI chat compatibility (#5208) royjhan 2024-07-13 22:07:45 -07:00
057d31861e

remove template (#5655) Patrick Devine 2024-07-13 20:56:24 -07:00
f7ee012300 server: prepend system message in chat handler v0.2.5 jmorganca 2024-07-13 15:08:00 -07:00
1ed0aa8fea

server: fix context, load_duration and total_duration fields (#5676) v0.2.4 Jeffrey Morgan 2024-07-13 09:25:31 -07:00
ef98803d63

llm: looser checks for minimum memory (#5677) Jeffrey Morgan 2024-07-13 09:20:05 -07:00
02fea420e5

Add Kerlig AI, an app for macOS (#5675) Jarek 2024-07-13 17:33:46 +02:00
22c5451fc2

fix system prompt (#5662) v0.2.3 Michael Yang 2024-07-12 21:04:44 -07:00
f08ac71a3b more clean up Roy Han 2024-07-12 17:30:30 -07:00
424f3f81a9 clean up Roy Han 2024-07-12 17:28:08 -07:00
15a0215203 running Josh Yan 2024-07-12 16:49:57 -07:00
1f73889f34

Merge branch 'royh-batchembed' into royh-embed-parallel royjhan 2024-07-12 16:44:12 -07:00
7e313e5964 remove redundant error check Roy Han 2024-07-12 16:37:29 -07:00
5a8f8e96e0 clean up Roy Han 2024-07-12 16:35:25 -07:00
7cddd6d741 parallelized Roy Han 2024-07-12 16:08:12 -07:00
ebc529cbb3 autodetect stop parameters from template Michael Yang 2024-07-05 17:31:23 -07:00
23ebbaa46e Revert "remove template from tests" Patrick Devine 2024-07-12 15:47:17 -07:00
9ac0a7a50b remove template from tests Patrick Devine 2024-07-12 15:41:31 -07:00
faa3c937cf writeto Josh Yan 2024-07-12 15:37:27 -07:00
e210f8763f merge conflicts royh-precision Roy Han 2024-07-12 15:09:05 -07:00
3971c2333f

Merge branch 'main' into royh-precision royjhan 2024-07-12 15:07:36 -07:00
1f3aefd323 remove function closure Roy Han 2024-07-12 14:45:16 -07:00
2d7048f410 Revert "remove function closure" Roy Han 2024-07-12 14:40:40 -07:00
55d48c6ed1 remove function closure Roy Han 2024-07-12 14:35:43 -07:00
cf57246aba write Josh Yan 2024-07-12 12:59:51 -07:00
6fafe4f753 gguf Josh Yan 2024-07-12 12:58:00 -07:00
e5c65a85df

Merge pull request #5653 from ollama/mxyng/collect-system v0.2.2 Michael Yang 2024-07-12 12:32:34 -07:00
33627331a3

app: also clean up tempdir runners on install (#5646) Jeffrey Morgan 2024-07-12 12:29:23 -07:00
36c87c433b template: preprocess message and collect system Michael Yang 2024-07-12 11:48:06 -07:00
d7c8d4f3f4 ggufwritekv Josh Yan 2024-07-12 12:25:13 -07:00
3d0fd31f0e TensorWriter Josh Yan 2024-07-12 12:18:46 -07:00
4c9a160a08 race Josh Yan 2024-07-12 11:52:10 -07:00
c0b5bf0a36 testing clean up Roy Han 2024-07-12 11:45:45 -07:00
554f3bdc0e interface Josh Yan 2024-07-12 10:02:58 -07:00
e75fb73839 types Josh Yan 2024-07-12 09:42:10 -07:00
179737feb7

Clean up old files when installing on Windows (#5645) Jeffrey Morgan 2024-07-11 22:53:46 -07:00
53e9576f46 testing clean up Roy Han 2024-07-11 20:20:14 -07:00
2fdebffc8d sawp Josh Yan 2024-07-11 18:18:26 -07:00
29ecfe493b write Josh Yan 2024-07-11 17:56:51 -07:00
47353f5ee4

Merge pull request #5639 from ollama/mxyng/unaggregated-system Michael Yang 2024-07-11 17:48:50 -07:00
dbe9527305 clean up Roy Han 2024-07-11 17:28:55 -07:00
10e768826c

fix: quant err message (#5616) Josh 2024-07-11 17:24:29 -07:00
b6c7d01af3 more cmt rmv jyan/progress Josh Yan 2024-07-11 17:21:36 -07:00
9d517cf556 rm comment Josh Yan 2024-07-11 17:20:09 -07:00
5056bb9c01 rename aggregate to contents Michael Yang 2024-07-11 16:06:57 -07:00
c4cf8ad559

llm: avoid loading model if system memory is too small (#5637) Jeffrey Morgan 2024-07-11 16:42:57 -07:00
57ec6901eb revert embedded templates to use prompt/response Michael Yang 2024-07-11 13:11:40 -07:00
e64f9ebb44 do no automatically aggregate system messages Michael Yang 2024-07-11 13:10:13 -07:00
657a1102fc lint complained Josh Yan 2024-07-11 09:26:17 -07:00
791650ddef

sched: only error when over-allocating system memory (#5626) v0.2.2-rc2 Jeffrey Morgan 2024-07-11 00:53:12 -07:00
efbf41ed81

llm: dont link cuda with compat libs (#5621) v0.2.2-rc1 Jeffrey Morgan 2024-07-10 20:01:52 -07:00
cf15589851

Merge pull request #5620 from ollama/mxyng/templates Michael Yang 2024-07-10 17:16:24 -07:00
19753c18c0 update embedded templates Michael Yang 2024-07-10 11:00:29 -07:00
41be28096a add system prompt to first legacy template Michael Yang 2024-07-10 11:00:07 -07:00
694388db90 set context length Roy Han 2024-07-10 15:21:46 -07:00
d352c68ffc move llama.h Josh Yan 2024-07-10 14:20:04 -07:00
37a570f962

Merge pull request #5612 from ollama/mxyng/mem Michael Yang 2024-07-10 14:18:33 -07:00
3c0d043b79 pass the template to the /api/chat endpoint pdevine/fix-template Patrick Devine 2024-07-10 14:17:39 -07:00
d82d25d70c patched Josh Yan 2024-07-10 13:57:28 -07:00
60be9e2840 patch Josh Yan 2024-07-10 13:46:38 -07:00
5a739ff4cb chatglm graph Michael Yang 2024-07-10 13:18:04 -07:00
a083852eb5 quantize progress Josh Yan 2024-07-10 13:21:22 -07:00
4e262eb2a8

remove GGML_CUDA_FORCE_MMQ=on from build (#5588) Jeffrey Morgan 2024-07-10 13:17:13 -07:00
a548eb6003 a8db2a9 jyan/quant3 Josh Yan 2024-07-10 13:10:58 -07:00
f92818d90d patch again Josh Yan 2024-07-10 13:06:40 -07:00