Daniel Hiltgen 345420998e Prevent partial loading on mixed GPU brands
In mult-brand GPU setups, if we couldn't fully load the model we
would fall through the scheduler and mistakenly try to load across
a mix of brands.  This makes sure we find the set of GPU(s) that
best fit for the partial load.
2024-07-30 11:00:55 -07:00
..
2024-07-26 14:14:48 -07:00
2024-07-15 15:26:16 -07:00
2024-07-22 15:48:15 -07:00
2024-07-26 14:14:48 -07:00