Michael Yang
75a07dd8f7
integrate mllama.cpp to server.cpp
2024-10-10 09:56:45 -07:00
Michael Yang
cb1118c842
draft: mllama vision encoder
2024-10-10 09:56:45 -07:00
Patrick Devine
af613bab33
fix prompting
2024-10-09 14:24:36 -07:00
Patrick Devine
6cb0abf6d8
add compositing for pngs
2024-10-08 18:46:58 -07:00
Patrick Devine
3a1c8da5e4
only allow a single image to be passed
2024-10-08 18:30:07 -07:00
Patrick Devine
03cf7627ec
change resize algorithm
2024-10-06 17:15:44 -07:00
jmorganca
71e76f8c90
server.cpp: cleanup cross attention state
2024-09-30 11:17:54 -07:00
jmorganca
7d5e0ff80e
add server.cpp and patches
2024-09-30 11:17:50 -07:00
Patrick Devine
5486c57364
fix template / imageproc issues
2024-09-26 22:39:45 -07:00
Patrick Devine
a2d33ee390
linter feeding
2024-09-26 02:15:17 -07:00
Patrick Devine
96a8b2f7d8
fix prompt for non-mllama multimodal
2024-09-26 01:31:53 -07:00
Patrick Devine
c48e2cfc0d
more fixes for mllama
2024-09-26 01:16:41 -07:00
Patrick Devine
5da1043680
feed the linter
2024-09-25 13:08:08 -07:00
Patrick Devine
f8ed545cbb
image processing for llama3.2
2024-09-25 11:54:43 -07:00
Jeffrey Morgan
55ea963c9e
update default model to llama3.2 ( #6959 )
2024-09-25 11:11:22 -07:00
Daniel Hiltgen
e9e9bdb8d9
CI: Fix win arm version defect ( #6940 )
...
write-host in powershell writes directly to the console and will not be picked
up by a pipe. Echo, or write-output will.
v0.3.12
2024-09-24 15:18:10 -07:00
Alex Yang
35bb6d32b3
readme: update llamaindex links ( #6939 )
2024-09-24 12:15:43 -07:00
Deep Lakhani
98701b58b3
readme: add LLMChat to community integrations ( #6919 )
2024-09-23 17:49:46 -07:00
Mahesh Sathiamoorthy
ad935f45ac
examples: use punkt_tab instead of punkt ( #6907 )
...
This was causing an error since we depend on punkt_tab.
v0.3.12-rc5
2024-09-21 18:55:28 -07:00
Daniel Hiltgen
dbba73469d
runner: Set windows above normal priority ( #6905 )
...
When running the subprocess as a background service windows may
throttle, which can lead to thrashing and very poor token rate.
2024-09-21 16:54:49 -07:00
Daniel Hiltgen
6c2eb73a70
Fix missing dep path on windows CPU runners ( #6884 )
...
GPUs handled the dependency path properly, but CPU runners didn't which
results in missing vc redist libraries on systems where the user didn't
already have it installed from some other app.
2024-09-21 16:28:29 -07:00
Daniel Hiltgen
2a038c1d7e
CI: win arm artifact dist dir ( #6900 )
...
The upload artifact is missing the dist prefix since all
payloads are in the same directory, so restore the prefix
on download.
v0.3.12-rc4
2024-09-20 19:16:18 -07:00
Daniel Hiltgen
616c5eafee
CI: win arm adjustments ( #6898 )
v0.3.12-rc3
2024-09-20 16:58:56 -07:00
Daniel Hiltgen
f5ff917b1d
CI: adjust step ordering for win arm to match x64 ( #6895 )
v0.3.12-rc2
2024-09-20 14:20:57 -07:00
Daniel Hiltgen
d632e23fba
Add Windows arm64 support to official builds ( #5712 )
...
* Unified arm/x86 windows installer
This adjusts the installer payloads to be architecture aware so we can cary
both amd64 and arm64 binaries in the installer, and install only the applicable
architecture at install time.
* Include arm64 in official windows build
* Harden schedule test for slow windows timers
This test seems to be a bit flaky on windows, so give it more time to converge
v0.3.12-rc1
2024-09-20 13:09:38 -07:00
Patrick Devine
5804cf1723
documentation for stopping a model ( #6766 )
2024-09-18 16:26:42 -07:00
Ryan Marten
bf7ee0f4d4
examples: add python examples for bespoke-minicheck
( #6841 )
2024-09-18 09:35:25 -07:00
Michael Yang
504a410f02
llm: add solar pro (preview) ( #6846 )
v0.3.11
2024-09-17 18:11:26 -07:00
Jeffrey Morgan
d05da29912
server: add tool parsing support for nemotron-mini ( #6849 )
2024-09-17 18:06:16 -07:00
Michael Yang
72962c6e08
Merge pull request #6833 from ollama/mxyng/git-am
...
make patches git am-able
2024-09-17 16:33:23 -07:00
Michael Yang
7bd7b02712
make patches git am-able
...
raw diffs can be applied using `git apply` but not with `git am`. git
patches, e.g. through `git format-patch` are both apply-able and am-able
2024-09-17 15:26:40 -07:00
Daniel Hiltgen
8f9ab5e14d
CI: dist directories no longer present ( #6834 )
...
The new buildx based build no longer leaves the dist/linux-* directories
around, so we don't have to clean them up before uploading.
v0.3.11-rc4
2024-09-16 17:31:37 -07:00
Daniel Hiltgen
7717bb6a84
CI: clean up naming, fix tagging latest ( #6832 )
...
The rocm CI step for RCs was incorrectly tagging them as the latest rocm build.
The multiarch manifest was incorrectly tagged twice (with and without the
prefix "v"). Static windows artifacts weren't being carried between build
jobs. This also fixes the latest tagging script.
v0.3.11-rc3
2024-09-16 16:18:41 -07:00
Daniel Hiltgen
0ec2915ea7
CI: set platform build build_linux script to keep buildx happy ( #6829 )
...
The runners don't have emulation set up so the default multi-platform build
wont work.
v0.3.11-rc2
2024-09-16 14:07:29 -07:00
Michael Yang
c9a7541b9c
readme: add Agents-Flex to community integrations ( #6788 )
v0.3.11-rc1
2024-09-16 13:42:52 -07:00
Patrick Devine
d81cfd7d6f
fix typo in import docs ( #6828 )
2024-09-16 11:48:14 -07:00
Pepo
b330c830d3
readme: add vim-intelligence-bridge to Terminal section ( #6818 )
2024-09-15 21:20:36 -04:00
Edward Cui
d889c6fd07
readme: add Obsidian Quiz Generator plugin to community integrations ( #6789 )
2024-09-14 23:52:37 -04:00
Daniel Hiltgen
56b9af336a
Fix incremental builds on linux ( #6780 )
...
scripts: fix incremental builds on linux or similar
2024-09-13 08:24:08 -07:00
Daniel Hiltgen
fda0d3be52
Use GOARCH for build dirs ( #6779 )
...
Corrects x86_64 vs amd64 discrepancy
2024-09-12 16:38:05 -07:00
Daniel Hiltgen
cd5c8f6471
Optimize container images for startup ( #6547 )
...
* Optimize container images for startup
This change adjusts how to handle runner payloads to support
container builds where we keep them extracted in the filesystem.
This makes it easier to optimize the cpu/cuda vs cpu/rocm images for
size, and should result in faster startup times for container images.
* Refactor payload logic and add buildx support for faster builds
* Move payloads around
* Review comments
* Converge to buildx based helper scripts
* Use docker buildx action for release
2024-09-12 12:10:30 -07:00
dcasota
fef257c5c5
examples: updated requirements.txt for privategpt example
2024-09-11 18:56:56 -07:00
Adrian Cole
d066d9b8e0
examples: polish loganalyzer example ( #6744 )
2024-09-11 18:37:37 -07:00
RAPID ARCHITECT
5a00dc9fc9
readme: add ollama_moe to community integrations ( #6752 )
2024-09-11 18:36:26 -07:00
Jesse Gross
c354e87809
Merge pull request #6767 from ollama/jessegross/bug_6707
...
runner: Flush pending responses before returning
2024-09-11 17:20:22 -07:00
Jesse Gross
93ac3760cb
runner: Flush pending responses before returning
...
If there are any pending reponses (such as from potential stop
tokens) then we should send them back before ending the sequence.
Otherwise, we can be missing tokens at the end of a response.
Fixes #6707
2024-09-11 16:39:32 -07:00
Patrick Devine
abed273de3
add "stop" command ( #6739 )
2024-09-11 16:36:21 -07:00
Michael Yang
034392624c
Merge pull request #6762 from ollama/mxyng/show-output
...
refactor show ouput
2024-09-11 14:58:40 -07:00
Michael Yang
ecab6f1cc5
refactor show ouput
...
fixes line wrapping on long texts
2024-09-11 14:23:09 -07:00
Petr Mironychev
7d6900827d
readme: add QodeAssist to community integrations ( #6754 )
2024-09-11 13:19:49 -07:00