Jeffrey Morgan b24e8d17b2
Increase minimum CUDA memory allocation overhead and fix minimum overhead for multi-gpu (#1896)
* increase minimum cuda overhead and fix minimum overhead for multi-gpu

* fix multi gpu overhead

* limit overhead to 10% of all gpus

* better wording

* allocate fixed amount before layers

* fixed only includes graph alloc
2024-01-10 19:08:51 -05:00
..
2024-01-10 15:06:41 -08:00
2024-01-10 15:06:41 -08:00
2024-01-10 15:06:41 -08:00
2024-01-10 15:06:41 -08:00