jmorganca
|
72f3fe4b94
|
truncate stop properly
|
2024-09-03 21:15:13 -04:00 |
|
jmorganca
|
a379d68aa9
|
wip stop tokens
|
2024-09-03 21:15:13 -04:00 |
|
jmorganca
|
b2ef3bf490
|
embeddings
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
ce15ed6d69
|
remove dependency on llm
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
c0b94376b2
|
grammar
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
72be8e27c4
|
sampling
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
d12db0568e
|
better example module, add port
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
ec17359a68
|
wip
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
fbc8572859
|
add llava to runner
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
87af27dac0
|
fix output in build_hipblas.sh
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
54f391309f
|
mods to build_hipblas.sh for linux
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
28bedcd807
|
wip
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
922d0acbdb
|
improve cuda and hipblas build scripts
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
b22d78720e
|
cuda linux
|
2024-09-03 21:15:12 -04:00 |
|
Jeffrey Morgan
|
905568a47f
|
Update README.md
|
2024-09-03 21:15:12 -04:00 |
|
Jeffrey Morgan
|
a15ac52fbe
|
Update README.md
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
9547aa53ff
|
disable log file
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
e29205ad6d
|
fix readme for llava
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
a8f91d3cc1
|
add llava
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
a9884ae136
|
llama: add clip dependencies
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
e37651cca0
|
add clip and parallel requests to the todo list
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
593d6836ab
|
fix cuda build
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
533a7e7d50
|
fix build on windows
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
0873d28b16
|
fix ggml-metal.m build constraints
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
bb795faa6c
|
fix ggml-metal.m
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
e86db9381a
|
avx2 should only add avx2
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
4a5633e4bc
|
fix sync script
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
86f453252b
|
fix ggml-metal.m
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
dfd8f34806
|
fix ggml-metal.m
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
beb847b40f
|
add license headers
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
785f76d390
|
pre-patch
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
9fe48978a8
|
move runner package down
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
01ccbc07fe
|
replace static build in llm
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
ec09be97e8
|
fix build
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
6129f30479
|
wip...
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
eb1aa97961
|
rename server to runner
|
2024-09-03 21:15:12 -04:00 |
|
Jeffrey Morgan
|
5e921e06ac
|
Update README.md
|
2024-09-03 21:15:12 -04:00 |
|
Jeffrey Morgan
|
02089baf70
|
Update README.md
|
2024-09-03 21:15:12 -04:00 |
|
Jeffrey Morgan
|
870e91be76
|
Update README.md
|
2024-09-03 21:15:12 -04:00 |
|
Jeffrey Morgan
|
7ecc8e86c4
|
Update README.md
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
b1696e308e
|
Add missing hipcc flags
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
0110994d06
|
Initial llama Go module
|
2024-09-03 21:15:12 -04:00 |
|
jmorganca
|
2ef3a217d1
|
add sync of llama.cpp
|
2024-09-03 21:15:12 -04:00 |
|
Michael Yang
|
fccf8d179f
|
partial decode ggml bin for more info
|
2023-08-10 09:23:10 -07:00 |
|
Bruce MacDonald
|
984c9c628c
|
fix embeddings invalid values
|
2023-08-09 16:50:53 -04:00 |
|
Bruce MacDonald
|
09d8bf6730
|
fix build errors
|
2023-08-09 10:45:57 -04:00 |
|
Bruce MacDonald
|
7a5f3616fd
|
embed text document in modelfile
|
2023-08-09 10:26:19 -04:00 |
|
Michael Yang
|
f2074ed4c0
|
Merge pull request #306 from jmorganca/default-keep-system
automatically set num_keep if num_keep < 0
|
2023-08-08 09:25:34 -07:00 |
|
Bruce MacDonald
|
a6f6d18f83
|
embed text document in modelfile
|
2023-08-08 11:27:17 -04:00 |
|
Jeffrey Morgan
|
5eb712f962
|
trim whitespace before checking stop conditions
Fixes #295
|
2023-08-08 00:29:19 -04:00 |
|