Daniel Hiltgen ff4f0cbd1d Prevent multiple concurrent loads on the same gpus
While models are loading, the VRAM metrics are dynamic, so try
to load on a GPU that doesn't have a model actively loading, or wait
to avoid races that lead to OOMs
2024-06-14 14:51:40 -07:00
..
2024-05-11 22:19:14 -07:00
2024-06-06 15:19:03 -07:00
2024-06-04 11:13:30 -07:00