third-party-mirrors/ollama

Fork 1

Go to file

Bruce MacDonald 8ea5e5e147 separate routes

2023-07-06 16:34:44 -04:00

api

client updates

2023-07-06 16:34:44 -04:00

app

auto updater for macos

2023-07-06 00:04:06 -04:00

cmd

client updates

2023-07-06 16:34:44 -04:00

docs

Move python docs to separate file

2023-07-01 17:54:29 -04:00

llama

client updates

2023-07-06 16:34:44 -04:00

server

separate routes

2023-07-06 16:34:44 -04:00

templates

move prompt templates out of python bindings

2023-07-06 16:34:44 -04:00

web

fix auto update route

2023-07-06 16:18:40 -04:00

.dockerignore

update Dockerfile

2023-07-06 16:34:44 -04:00

.gitignore

add templates to prompt command

2023-06-26 13:41:16 -04:00

.prettierrc.json

move .prettierrc.json to root

2023-07-02 17:34:46 -04:00

Dockerfile

update Dockerfile

2023-07-06 16:34:44 -04:00

go.mod

client updates

2023-07-06 16:34:44 -04:00

go.sum

client updates

2023-07-06 16:34:44 -04:00

LICENSE

proto -> ollama

2023-06-26 15:57:13 -04:00

main.go

add llama.cpp go bindings

2023-07-06 16:34:44 -04:00

models.json

format models.json

2023-07-02 20:33:23 -04:00

README.md

add llama.cpp go bindings

2023-07-06 16:34:44 -04:00

README.md

Ollama

An easy, fast runtime for large language models, powered by llama.cpp.

Note: this project is a work in progress. Certain models that can be run with ollama are intended for research and/or non-commercial use only.

Install

Using pip:

pip install ollama

Using docker:

docker run ollama/ollama

Quickstart

To run a model, use ollama run:

ollama run orca-mini-3b

You can also run models from hugging face:

ollama run huggingface.co/TheBloke/orca_mini_3B-GGML

Or directly via downloaded model files:

ollama run ~/Downloads/orca-mini-13b.ggmlv3.q4_0.bin

Building

go generate ./...
go build .

Documentation

Languages

Go 90.5%

C 3.4%

Shell 1.7%

TypeScript 1.3%

Makefile 0.9%

Other 2.1%