Secure LLM with Pomerium

This project is based off the instructions in Pomerium's Self-Hosted LLM Behind Pomerium guide.

What's do you get?

Open WebUI with your local LLMs, care of Ollama, secured by Pomerium.

Prerequisites

Pomerium Zero account
Docker and Docker Compose or Orbstack
Port 443 open on your router or wherever you host this project.

Configure Pomerium Zero

When you create a Pomerium Zero account, you will be taken to the Pomerium Zero console. Here you can create a new route for Open WebUI. There will already be one for the verify route which is just a page to show your claims in your JWT when logged in.

You'll also have one policy out of the box which will allow access to only you via the email you used to sign up with.

Follow the steps in the docs to add a route for Open WebUI and a policy to allow access to it.

Environment Configuration

To configure the environment for this project, you need to set up the .env file with the following variables:

POMERIUM_ZERO_TOKEN=replace-with-your-pomerium-zero-token
WEBUI_URL=https://llm.yourdomain.pomerium.app # or whatever you called your route for Open WebUI in the Pomerium Zero console.
VERIFY_URL=https://verify.yourdomain.pomerium.app
GPU_DEVICE=/dev/dri/your-device # e.g. /dev/dri/renderD128

Make sure to replace the placeholder values with your actual configuration details. This setup is crucial for the proper functioning of the Pomerium Zero integration and GPU acceleration if applicable.

GPU Acceleration (Optional)

If your machine has a GPU (e.g. AMD, Intel, or NVIDIA), you can enable GPU access for Open WebUI to accelerate model inference.To do so, uncomment the devices and group_add lines in the docker-compose.yaml and configure your GPU device in your .env file.

For example, on an AMD GPU (using /dev/dri/renderD128):

GPU_DEVICE=/dev/dri/renderD128

Make sure your user has access to the GPU device (usually by being in the video group):

getent group video

Note: GPU acceleration requires that Open WebUI and Ollama support your GPU type. Currently, support for AMD and Intel GPUs may be limited or experimental.

Adding Local Models

To add local models within the container, follow these steps:

Access the Container:
- First, make sure your container is running. You can access it using:
```
docker exec -it open-webui /bin/bash
```
Download Local Models:
- Once inside the container, you can download the desired local models using:
```
ollama pull <model_name>
```
- Replace <model_name> with the specific model you want to pull.
Verify the Model:
- After pulling, verify that the model has been successfully added by listing the available models:
```
ollama list
```

Ensure that your container has internet access to pull models successfully. If you encounter any issues, check your container's network settings or consult the Ollama documentation for troubleshooting tips.

Custom Domain

You can also use your own domain with Pomerium Zero. For more information, see the Pomerium Zero documentation for custom domains.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
compose.yaml		compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Secure LLM with Pomerium

What's do you get?

Prerequisites

Configure Pomerium Zero

Environment Configuration

GPU Acceleration (Optional)

Adding Local Models

Custom Domain

About

Uh oh!

Releases

Packages

Uh oh!

nickytonline/secure-llm-pomerium

Folders and files

Latest commit

History

Repository files navigation

Secure LLM with Pomerium

What's do you get?

Prerequisites

Configure Pomerium Zero

Environment Configuration

GPU Acceleration (Optional)

Adding Local Models

Custom Domain

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Packages