Light Logo
CLI (v0.53.0)

Running models locally

Learn how to run models locally with the WriftAI CLI

Overview

wriftai model start runs a model locally, and wriftai model serve runs a model server for local development. Both currently use Docker to do this.

Requirements

  • Docker, version 29.6.0 or later

GPU support

Both commands can additionally expose GPUs to a model. This depends on Docker's NVIDIA GPU integration.

Prerequisites

To run a model locally with GPU acceleration, you also need:

If any of these are missing, the model runs on CPU instead of failing outright.

You can confirm your setup is working independently of wriftai by running:

docker run -it --rm --gpus all ubuntu nvidia-smi

If this prints a GPU table, your host is ready. If it errors, the issue is in your Docker/NVIDIA setup rather than in wriftai.

Choosing which GPUs to expose

Both wriftai model start and wriftai model serve accept a --gpus flag. All available GPUs are exposed by default.

For the full list of accepted --gpus values and usage examples, see the model start and model serve reference pages.