Daniel Hiltgen 90ca84172c
Fix embeddings memory corruption (#6467)
* Fix embeddings memory corruption

The patch was leading to a buffer overrun corruption.  Once removed though, parallism
in server.cpp lead to hitting an assert due to slot/seq IDs being >= token count.  To
work around this, only use slot 0 for embeddings.

* Fix embed integration test assumption

The token eval count has changed with recent llama.cpp bumps (0.3.5+)
2024-08-22 14:51:42 -07:00
..
2024-07-26 14:14:48 -07:00
2024-08-14 16:55:45 -07:00
2024-08-01 17:06:06 -07:00
2024-08-15 10:29:14 -07:00
2024-08-12 11:46:09 -07:00
2024-07-22 11:49:02 -07:00
2024-07-22 11:26:12 -07:00
2024-08-01 17:06:06 -07:00
2024-07-15 15:26:16 -07:00
2024-08-02 16:04:06 -07:00
2024-07-31 15:58:55 -07:00
2024-08-01 17:06:06 -07:00
2024-08-21 11:49:31 -07:00