ollama/concat.cuh at d12db0568e18eea8981ff0e4ccdfdf9a3dd3ecd5 - ollama - Gitea: Git with a cup of tea

third-party-mirrors/ollama

jmorganca 01ccbc07fe replace static build in llm

2024-09-03 21:15:12 -04:00

6 lines

137 B

Plaintext

Vendored

Raw Blame History

 #include "common.cuh"
 #define CUDA_CONCAT_BLOCK_SIZE 256
 void ggml_cuda_op_concat(ggml_backend_cuda_context & ctx, ggml_tensor * dst);