3212 Commits

Author SHA1 Message Date
Josh Yan
f1b5d939f5 changes 2024-07-22 15:41:26 -07:00
Josh Yan
d1b7f8bb07 testing auth 2024-07-22 15:41:26 -07:00
Josh Yan
6d4724a06d auth changes' 2024-07-22 15:41:26 -07:00
Josh Yan
c507325288 isLocal testing 2024-07-22 15:41:26 -07:00
Josh Yan
09431f353d timecheck 2024-07-22 15:41:26 -07:00
Josh Yan
8548d1d596 cmt 2024-07-22 15:41:26 -07:00
Josh Yan
478b58dd77 remove knownhosts 2024-07-22 15:41:26 -07:00
Josh Yan
24c5e172ca lint 2024-07-22 15:41:26 -07:00
Josh Yan
d12717e7dc clean 2024-07-22 15:41:26 -07:00
Josh Yan
a80d79536a removed cmt and prints 2024-07-22 15:41:26 -07:00
Josh Yan
4c1e188200 removed client isLocal() 2024-07-22 15:41:26 -07:00
Josh Yan
689a7cb90d lint 2024-07-22 15:41:26 -07:00
Josh Yan
93a8054693 lint 2024-07-22 15:41:26 -07:00
Josh Yan
7769602b75 lint 2024-07-22 15:41:26 -07:00
Josh Yan
8048ce0816 syscopy windows 2024-07-22 15:41:26 -07:00
Josh Yan
72314bf4b5 os copy 2024-07-22 15:41:26 -07:00
Josh Yan
d4ab994ade rmv prints 2024-07-22 15:41:26 -07:00
Josh Yan
c44f4825c4 local copy 2024-07-22 15:41:26 -07:00
Josh Yan
154b59c0b6 isLocal firstdraft 2024-07-22 15:41:26 -07:00
Josh Yan
8ee1ada22a clean 2024-07-22 15:41:26 -07:00
Josh Yan
e9a2ead87a rm bench 2024-07-22 15:41:26 -07:00
Josh Yan
a7721cb1d2 rm config 2024-07-22 15:41:26 -07:00
Josh Yan
1a6197abb1 rm config 2024-07-22 15:41:26 -07:00
Josh Yan
9fbd474bf7 clean 2024-07-22 15:41:26 -07:00
Josh Yan
7e8d8cc72f local path 2024-07-22 15:41:26 -07:00
Josh Yan
cbd98a2e37 still works 2024-07-22 15:41:26 -07:00
Josh Yan
ad36d4ff1b rebase 2024-07-22 15:41:26 -07:00
Josh Yan
461c964941 benchmark 2024-07-22 15:41:26 -07:00
Josh Yan
a993a3a85c on disk copy 2024-07-22 15:41:26 -07:00
Josh Yan
f7d64856d5 start tests 2024-07-22 15:41:26 -07:00
Josh Yan
6b1b85ba3d hide initialize keypair 2024-07-22 15:41:04 -07:00
Josh Yan
5603441538 test 2024-07-22 13:58:50 -07:00
Josh Yan
76b4dfcc9e auth 2024-07-22 13:54:02 -07:00
Daniel Hiltgen
5784c05397
Merge pull request #5854 from dhiltgen/win_exit_status
Refine error reporting for subprocess crash
2024-07-22 10:40:22 -07:00
Daniel Hiltgen
f14aa5435d
Merge pull request #5855 from dhiltgen/remove_max_vram
Remove no longer supported max vram var
2024-07-22 10:35:29 -07:00
Jeffrey Morgan
f8fedbda20
Update llama.cpp submodule commit to d94c6e0c (#5805) v0.2.8-rc2 2024-07-22 12:42:00 -04:00
Jeffrey Morgan
b3e5491e41
server: collect nested tool call objects when parsing (#5824) 2024-07-22 12:38:03 -04:00
Daniel Hiltgen
cc269ba094 Remove no longer supported max vram var
The OLLAMA_MAX_VRAM env var was a temporary workaround for OOM
scenarios.  With Concurrency this was no longer wired up, and the simplistic
value doesn't map to multi-GPU setups.  Users can still set `num_gpu`
to limit memory usage to avoid OOM if we get our predictions wrong.
2024-07-22 09:08:11 -07:00
Daniel Hiltgen
a3c20e3f18 Refine error reporting for subprocess crash
On windows, the exit status winds up being the search term many
users search for and end up piling in on issues that are unrelated.
This refines the reporting so that if we have a more detailed message
we'll suppress the exit status portion of the message.
2024-07-22 08:52:16 -07:00
Jeffrey Morgan
80ee9b5e47
Remove out of space test temporarily (#5825) 2024-07-21 00:22:11 -04:00
Jeffrey Morgan
5534f2cc6a
llm: consider head_dim in llama arch (#5817) v0.2.8-rc1 2024-07-20 21:48:12 -04:00
Daniel Hiltgen
d321297d8a
Merge pull request #5815 from dhiltgen/win_rocm_gfx_features
Adjust windows ROCm discovery
2024-07-20 16:02:55 -07:00
Daniel Hiltgen
06e5d74e34
Merge pull request #5506 from dhiltgen/sched_tests
Refine scheduler unit tests for reliability
2024-07-20 15:48:39 -07:00
Daniel Hiltgen
5d707e6fd5
Merge pull request #5583 from dhiltgen/integration_improvements
Fix context exhaustion integration test for small gpus
2024-07-20 15:48:21 -07:00
Daniel Hiltgen
283948c83b Adjust windows ROCm discovery
The v5 hip library returns unsupported GPUs which wont enumerate at
inference time in the runner so this makes sure we align discovery.  The
gfx906 cards are no longer supported so we shouldn't compile with that
GPU type as it wont enumerate at runtime.
2024-07-20 15:17:50 -07:00
Jeffrey Morgan
1475eab95f
add patch for tekken (#5807) 2024-07-20 13:41:21 -04:00
Jeffrey Morgan
20090f3172
preserve last assistant message (#5802) 2024-07-19 20:19:26 -07:00
Jeffrey Morgan
69a2d4ccff
Fix generate test flakyness (#5804) 2024-07-19 19:11:25 -07:00
Josh
e8b954c646
server: validate template (#5734)
add template validation to modelfile
2024-07-19 15:24:29 -07:00
royjhan
c57317cbf0
OpenAI: Function Based Testing (#5752)
* distinguish error forwarding

* more coverage

* rm comment
2024-07-19 11:37:12 -07:00