ollama/diagmask.cuh at 5e921e06acb08d7e2676d3977cd82f48b51f5254 - ollama - Gitea: Git with a cup of tea

third-party-mirrors/ollama

jmorganca 0110994d06 Initial llama Go module

2024-09-03 21:15:12 -04:00

6 lines

155 B

Plaintext

Vendored

Raw Blame History

 #include "common.cuh"
 #define CUDA_DIAG_MASK_INF_BLOCK_SIZE 32
 void ggml_cuda_op_diag_mask_inf(ggml_backend_cuda_context & ctx, ggml_tensor * dst);