ollama

History

Daniel Hiltgen 5e8ff556cb Support forced spreading for multi GPU

Our default behavior today is to try to fit into a single GPU if possible.
Some users would prefer the old behavior of always spreading across
multiple GPUs even if the model can fit into one.  This exposes that
tunable behavior.

2024-06-14 14:51:40 -07:00

config_test.go

move OLLAMA_HOST to envconfig (#5009 )

2024-06-12 18:48:16 -04:00

config.go

Support forced spreading for multi GPU

2024-06-14 14:51:40 -07:00