This website requires JavaScript.
Explore
Help
Sign In
norohind
/
ollama
Watch
1
Star
0
Fork
0
You've already forked ollama
forked from
third-party-mirrors/ollama
Code
Pull Requests
Activity
ollama
/
server
History
Jeffrey Morgan
38255d2af1
Use flash attention flag for now (
#4580
)
...
* put flash attention behind flag for now * add test * remove print * up timeout for sheduler tests
2024-05-22 21:52:09 -07:00
..
envconfig
Use flash attention flag for now (
#4580
)
2024-05-22 21:52:09 -07:00
auth.go
Revert "use post token"
2024-05-11 22:19:14 -07:00
download.go
return on part done
2024-05-14 13:04:30 -07:00
fixblobs_test.go
…
fixblobs.go
…
images.go
fix quantize file types
2024-05-20 15:22:11 -07:00
layer.go
cache and reuse intermediate blobs
2024-05-20 13:25:10 -07:00
manifest.go
update list handler to use model.Name
2024-05-07 09:38:45 -07:00
model.go
tidy intermediate blobs
2024-05-20 15:15:06 -07:00
modelpath_test.go
…
modelpath.go
prune partial downloads (
#4272
)
2024-05-09 16:35:20 -07:00
prompt_test.go
…
prompt.go
…
routes_test.go
Move the parser back + handle utf16 files (
#4533
)
2024-05-20 11:26:45 -07:00
routes.go
tidy intermediate blobs
2024-05-20 15:15:06 -07:00
sched_test.go
Use flash attention flag for now (
#4580
)
2024-05-22 21:52:09 -07:00
sched.go
Correct typo in error message (
#4535
)
2024-05-21 13:39:01 -07:00
upload.go
…