Daniel Hiltgen
345420998e
Prevent partial loading on mixed GPU brands
...
In mult-brand GPU setups, if we couldn't fully load the model we
would fall through the scheduler and mistakenly try to load across
a mix of brands. This makes sure we find the set of GPU(s) that
best fit for the partial load.
2024-07-30 11:00:55 -07:00
..
2024-07-22 12:38:03 -04:00
2024-07-26 14:14:48 -07:00
2024-07-26 14:24:24 -07:00
2024-03-14 20:18:06 -07:00
2024-03-14 20:18:06 -07:00
2024-07-25 15:58:30 -07:00
2024-05-29 12:02:07 -07:00
2024-07-01 10:40:54 -07:00
2024-07-01 10:40:54 -07:00
2024-07-22 12:38:03 -04:00
2024-07-26 13:48:23 -07:00
2024-06-13 12:52:03 -07:00
2024-07-03 15:36:11 -07:00
2024-07-16 09:39:31 -07:00
2024-07-15 15:26:16 -07:00
2024-07-26 13:48:23 -07:00
2024-07-16 09:39:31 -07:00
2024-07-19 19:11:25 -07:00
2024-07-16 09:39:31 -07:00
2024-07-15 17:39:44 -07:00
2024-07-22 15:48:15 -07:00
2024-07-30 11:00:55 -07:00
2024-07-30 11:00:55 -07:00
2024-07-26 14:14:48 -07:00