2023-07-06 16:34:44 -04:00
2023-07-06 16:34:44 -04:00
2023-07-06 00:04:06 -04:00
2023-07-06 16:34:44 -04:00
2023-07-01 17:54:29 -04:00
2023-07-06 16:34:44 -04:00
2023-07-06 16:34:44 -04:00
2023-07-06 16:18:40 -04:00
2023-07-06 16:34:44 -04:00
2023-06-26 13:41:16 -04:00
2023-07-02 17:34:46 -04:00
2023-07-06 16:34:44 -04:00
2023-07-06 16:34:44 -04:00
2023-07-06 16:34:44 -04:00
2023-06-26 15:57:13 -04:00
2023-07-06 16:34:44 -04:00
2023-07-02 20:33:23 -04:00
2023-07-06 16:34:44 -04:00

Ollama

An easy, fast runtime for large language models, powered by llama.cpp.

Note: this project is a work in progress. Certain models that can be run with ollama are intended for research and/or non-commercial use only.

Install

Using pip:

pip install ollama

Using docker:

docker run ollama/ollama

Quickstart

To run a model, use ollama run:

ollama run orca-mini-3b

You can also run models from hugging face:

ollama run huggingface.co/TheBloke/orca_mini_3B-GGML

Or directly via downloaded model files:

ollama run ~/Downloads/orca-mini-13b.ggmlv3.q4_0.bin

Building

go generate ./...
go build .

Documentation

Description
No description provided
Readme 126 MiB
Languages
Go 90.5%
C 3.4%
Shell 1.7%
TypeScript 1.3%
Makefile 0.9%
Other 2.1%