jmorganca
ce15ed6d69
remove dependency on llm
2024-09-03 21:15:12 -04:00
jmorganca
c0b94376b2
grammar
2024-09-03 21:15:12 -04:00
jmorganca
72be8e27c4
sampling
2024-09-03 21:15:12 -04:00
jmorganca
d12db0568e
better example
module, add port
2024-09-03 21:15:12 -04:00
jmorganca
ec17359a68
wip
2024-09-03 21:15:12 -04:00
jmorganca
fbc8572859
add llava
to runner
2024-09-03 21:15:12 -04:00
jmorganca
87af27dac0
fix output in build_hipblas.sh
2024-09-03 21:15:12 -04:00
jmorganca
54f391309f
mods to build_hipblas.sh
for linux
2024-09-03 21:15:12 -04:00
jmorganca
28bedcd807
wip
2024-09-03 21:15:12 -04:00
jmorganca
922d0acbdb
improve cuda and hipblas build scripts
2024-09-03 21:15:12 -04:00
jmorganca
b22d78720e
cuda linux
2024-09-03 21:15:12 -04:00
Jeffrey Morgan
905568a47f
Update README.md
2024-09-03 21:15:12 -04:00
Jeffrey Morgan
a15ac52fbe
Update README.md
2024-09-03 21:15:12 -04:00
jmorganca
9547aa53ff
disable log file
2024-09-03 21:15:12 -04:00
jmorganca
e29205ad6d
fix readme for llava
2024-09-03 21:15:12 -04:00
jmorganca
a8f91d3cc1
add llava
2024-09-03 21:15:12 -04:00
jmorganca
a9884ae136
llama: add clip dependencies
2024-09-03 21:15:12 -04:00
jmorganca
e37651cca0
add clip and parallel requests to the todo list
2024-09-03 21:15:12 -04:00
jmorganca
593d6836ab
fix cuda build
2024-09-03 21:15:12 -04:00
jmorganca
533a7e7d50
fix build on windows
2024-09-03 21:15:12 -04:00
jmorganca
0873d28b16
fix ggml-metal.m
build constraints
2024-09-03 21:15:12 -04:00
jmorganca
bb795faa6c
fix ggml-metal.m
2024-09-03 21:15:12 -04:00
jmorganca
e86db9381a
avx2
should only add avx2
2024-09-03 21:15:12 -04:00
jmorganca
4a5633e4bc
fix sync script
2024-09-03 21:15:12 -04:00
jmorganca
86f453252b
fix ggml-metal.m
2024-09-03 21:15:12 -04:00
jmorganca
dfd8f34806
fix ggml-metal.m
2024-09-03 21:15:12 -04:00
jmorganca
beb847b40f
add license headers
2024-09-03 21:15:12 -04:00
jmorganca
785f76d390
pre-patch
2024-09-03 21:15:12 -04:00
jmorganca
9fe48978a8
move runner
package down
2024-09-03 21:15:12 -04:00
jmorganca
01ccbc07fe
replace static build in llm
2024-09-03 21:15:12 -04:00
jmorganca
ec09be97e8
fix build
2024-09-03 21:15:12 -04:00
jmorganca
6129f30479
wip...
2024-09-03 21:15:12 -04:00
jmorganca
eb1aa97961
rename server
to runner
2024-09-03 21:15:12 -04:00
Jeffrey Morgan
5e921e06ac
Update README.md
2024-09-03 21:15:12 -04:00
Jeffrey Morgan
02089baf70
Update README.md
2024-09-03 21:15:12 -04:00
Jeffrey Morgan
870e91be76
Update README.md
2024-09-03 21:15:12 -04:00
Jeffrey Morgan
7ecc8e86c4
Update README.md
2024-09-03 21:15:12 -04:00
jmorganca
b1696e308e
Add missing hipcc flags
2024-09-03 21:15:12 -04:00
jmorganca
c646115b31
fix .gitattributes
2024-09-03 21:15:12 -04:00
jmorganca
0110994d06
Initial llama
Go module
2024-09-03 21:15:12 -04:00
jmorganca
2ef3a217d1
add sync of llama.cpp
2024-09-03 21:15:12 -04:00
Jeffrey Morgan
5e2653f9fe
llm: update llama.cpp commit to 8962422 ( #6618 )
2024-09-03 21:12:39 -04:00
Daniel Hiltgen
f29b167e1a
Use cuda v11 for driver 525 and older ( #6620 )
...
It looks like driver 525 (aka, cuda driver 12.0) has problems with the cuda v12 library
we compile against, so run v11 on those older drivers if detected.
2024-09-03 17:15:31 -07:00
Daniel Hiltgen
037a4d103e
Log system memory at info ( #6617 )
...
On systems with low system memory, we can hit allocation failures that are difficult to diagnose
without debug logs. This will make it easier to spot.
2024-09-03 14:55:20 -07:00
Mateusz Migas
50c05d57e0
readme: add Painting Droid community integration ( #5514 )
2024-09-03 16:15:54 -04:00
Amith Koujalgi
35159de18a
readme: update Ollama4j link and add link to Ollama4j Web UI ( #6608 )
2024-09-03 16:08:50 -04:00
FellowTraveler
94fff5805f
Fix sprintf to snprintf ( #5664 )
...
/Users/au/src/ollama/llm/ext_server/server.cpp:289:9: warning: 'sprintf' is deprecated: This function is provided for compatibility reasons only. Due to security concerns inherent in the design of sprintf(3), it is highly recommended that you use snprintf(3) instead.
2024-09-03 09:32:59 -07:00
OpenVMP
14d5093cd0
readme: add PartCAD tool to readme for generating 3D CAD models using Ollama ( #6605 )
2024-09-03 12:28:01 -04:00
R0CKSTAR
9df5f0e8e4
Reduce docker image size ( #5847 )
...
Signed-off-by: Xiaodong Ye <yeahdongcn@gmail.com>
2024-09-03 09:25:31 -07:00
presbrey
ad3eb00bee
readme: add OllamaFarm project ( #6508 )
2024-09-02 16:05:36 -04:00