3525 Commits

Author SHA1 Message Date
jmorganca
8f79a2e86a cleanup stop code 2024-09-03 21:15:13 -04:00
jmorganca
a4d402c403 fix example 2024-09-03 21:15:13 -04:00
jmorganca
e1dfc757b3 revert llm changes 2024-09-03 21:15:13 -04:00
jmorganca
7d0a452938 num predict 2024-09-03 21:15:13 -04:00
jmorganca
43efc893d7 basic progress 2024-09-03 21:15:13 -04:00
jmorganca
20afaae020 add more runner params 2024-09-03 21:15:13 -04:00
jmorganca
72f3fe4b94 truncate stop properly 2024-09-03 21:15:13 -04:00
jmorganca
a379d68aa9 wip stop tokens 2024-09-03 21:15:13 -04:00
jmorganca
b2ef3bf490 embeddings 2024-09-03 21:15:12 -04:00
jmorganca
ce15ed6d69 remove dependency on llm 2024-09-03 21:15:12 -04:00
jmorganca
c0b94376b2 grammar 2024-09-03 21:15:12 -04:00
jmorganca
72be8e27c4 sampling 2024-09-03 21:15:12 -04:00
jmorganca
d12db0568e better example module, add port 2024-09-03 21:15:12 -04:00
jmorganca
ec17359a68 wip 2024-09-03 21:15:12 -04:00
jmorganca
fbc8572859 add llava to runner 2024-09-03 21:15:12 -04:00
jmorganca
87af27dac0 fix output in build_hipblas.sh 2024-09-03 21:15:12 -04:00
jmorganca
54f391309f mods to build_hipblas.sh for linux 2024-09-03 21:15:12 -04:00
jmorganca
28bedcd807 wip 2024-09-03 21:15:12 -04:00
jmorganca
922d0acbdb improve cuda and hipblas build scripts 2024-09-03 21:15:12 -04:00
jmorganca
b22d78720e cuda linux 2024-09-03 21:15:12 -04:00
Jeffrey Morgan
905568a47f Update README.md 2024-09-03 21:15:12 -04:00
Jeffrey Morgan
a15ac52fbe Update README.md 2024-09-03 21:15:12 -04:00
jmorganca
9547aa53ff disable log file 2024-09-03 21:15:12 -04:00
jmorganca
e29205ad6d fix readme for llava 2024-09-03 21:15:12 -04:00
jmorganca
a8f91d3cc1 add llava 2024-09-03 21:15:12 -04:00
jmorganca
a9884ae136 llama: add clip dependencies 2024-09-03 21:15:12 -04:00
jmorganca
e37651cca0 add clip and parallel requests to the todo list 2024-09-03 21:15:12 -04:00
jmorganca
593d6836ab fix cuda build 2024-09-03 21:15:12 -04:00
jmorganca
533a7e7d50 fix build on windows 2024-09-03 21:15:12 -04:00
jmorganca
0873d28b16 fix ggml-metal.m build constraints 2024-09-03 21:15:12 -04:00
jmorganca
bb795faa6c fix ggml-metal.m 2024-09-03 21:15:12 -04:00
jmorganca
e86db9381a avx2 should only add avx2 2024-09-03 21:15:12 -04:00
jmorganca
4a5633e4bc fix sync script 2024-09-03 21:15:12 -04:00
jmorganca
86f453252b fix ggml-metal.m 2024-09-03 21:15:12 -04:00
jmorganca
dfd8f34806 fix ggml-metal.m 2024-09-03 21:15:12 -04:00
jmorganca
beb847b40f add license headers 2024-09-03 21:15:12 -04:00
jmorganca
785f76d390 pre-patch 2024-09-03 21:15:12 -04:00
jmorganca
9fe48978a8 move runner package down 2024-09-03 21:15:12 -04:00
jmorganca
01ccbc07fe replace static build in llm 2024-09-03 21:15:12 -04:00
jmorganca
ec09be97e8 fix build 2024-09-03 21:15:12 -04:00
jmorganca
6129f30479 wip... 2024-09-03 21:15:12 -04:00
jmorganca
eb1aa97961 rename server to runner 2024-09-03 21:15:12 -04:00
Jeffrey Morgan
5e921e06ac Update README.md 2024-09-03 21:15:12 -04:00
Jeffrey Morgan
02089baf70 Update README.md 2024-09-03 21:15:12 -04:00
Jeffrey Morgan
870e91be76 Update README.md 2024-09-03 21:15:12 -04:00
Jeffrey Morgan
7ecc8e86c4 Update README.md 2024-09-03 21:15:12 -04:00
jmorganca
b1696e308e Add missing hipcc flags 2024-09-03 21:15:12 -04:00
jmorganca
c646115b31 fix .gitattributes 2024-09-03 21:15:12 -04:00
jmorganca
0110994d06 Initial llama Go module 2024-09-03 21:15:12 -04:00
jmorganca
2ef3a217d1 add sync of llama.cpp 2024-09-03 21:15:12 -04:00