ollama/softmax.cuh at d12db0568e18eea8981ff0e4ccdfdf9a3dd3ecd5 - ollama - Gitea: Git with a cup of tea

third-party-mirrors/ollama

jmorganca 01ccbc07fe replace static build in llm

2024-09-03 21:15:12 -04:00

6 lines

142 B

Plaintext

Vendored

Raw Blame History

 #include "common.cuh"
 #define CUDA_SOFT_MAX_BLOCK_SIZE 1024
 void ggml_cuda_op_soft_max(ggml_backend_cuda_context & ctx, ggml_tensor * dst);