Llama Server

A Mac menu bar app for controlling llama.cpp server.

Under the hood, this is simply a set of bash scripts wrapped in Platypus. It serves as a convenient shortcut to:

Switch between models using a dropdown menu
Configure different server flags for different models
Sandbox llama.cpp with sandbox-exec

Usage

Download it from Releases and drag it into your Applications directory.

The app is currently unsigned, so you'll need to enable it from "System Settings > Privacy & Security".

Click the menu bar icon to create your configuration file (~/.config/llamaserver/options.sh). This defines the locations of your llama.cpp server script and models, as well as a list of CLI flags.

Use "Load model" to choose from a list of *.guff and *.model.sh files from your configured directory:

While a running llama.cpp/server process is detected, the menu will show the last 5 lines of server output. Click "Stop server" to unload the model:

Due to a limitation of Platypus, the menu bar icon doesn't update in the background. You need to click the icon again to get fresh logs.

Model-Specific Overrides

Create a *.model.sh file in your configured models directory to set server flags that only apply to that model. For example:

LLAMA_SERVER_MODEL_OPTIONS=(
  --model ~/path/to/guff
  --ctx-size 16384
  --chat-template chatml
)

Sandboxing

MacOS ships with an officially-deprecated, poorly-documented, yet heavily-depended-upon sandboxing utility called sandbox-exec. Llama Server can optionally run llama.cpp with this utility for (plausibly) more security.

You can use the included example.sandbox.sb as a starting point, and uncomment the sandbox path to enable this feature.

More information on sandbox-exec:

Roadmap

Set per-model server settings (ex: ctx-size)
Include sandbox-exec script
Support multiple model folders
Cache the server script PID for better pkill precision
Find way to deliver realtime menu bar icon updates
Notify when server is ready

Credits

llama.cpp for the excellent foundation.
Platypus for making wrapper apps easy.
Draw Things for the spiffy icon.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
assets		assets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example.model.sh		example.model.sh
example.sandbox.sb		example.sandbox.sb
llama-server-osx.sh		llama-server-osx.sh
options.sh		options.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Llama Server

Usage

Model-Specific Overrides

Sandboxing

Roadmap

Credits

About

Releases 3

Languages

License

kaizau/llama-server-osx

Folders and files

Latest commit

History

Repository files navigation

Llama Server

Usage

Model-Specific Overrides

Sandboxing

Roadmap

Credits

About

Resources

License

Stars

Watchers

Forks

Releases 3

Languages