Local Models

Local Models is the cross-provider model management screen inside RealTimeX.

Open it from Settings > AI Providers > Local Model Management.

What this page is for

Use this page when you want one place to inspect and manage models across local backends.

The current product supports:

Ollama
LocalAI
LM Studio
llama.cpp
RealTimeX Local

This page is broader than RealTimeX Local, which focuses only on the managed built-in GGUF runtime.

Provider status

Each provider tab can appear as:

online
offline
unconfigured
loading

If a provider is unconfigured, RealTimeX expects you to finish its base URL or provider setup first in Settings > AI Providers > LLM, then return here.

What changes by provider

Not every provider supports the same actions.

In the current product:

Ollama and LocalAI are pull-oriented model registries
LM Studio mainly reflects models you already loaded inside LM Studio
llama.cpp reflects the running server and local GGUF files
RealTimeX Local uses the managed local models directory and built-in runtime

That means this page is best understood as one shell over several different local model ecosystems.

Common actions

Depending on the provider, the current UI can let you:

add or pull a new model
review installed model counts
inspect size, modified time, digests, and IDs
copy model identifiers
load a model
open model details
edit per-model settings
refresh or update metadata
repair incomplete models
delete models from local storage

Some actions are provider-specific, so a tab may expose fewer controls than another.

RealTimeX Local inside Local Models

The RealTimeX Local tab is where you manage the built-in GGUF inventory across the managed local models directory.

The current flow supports:

recommended model downloads
Hugging Face search
manual repository or file entry
default-model selection
load and warmup behavior
repair for incomplete artifacts

Use this tab when you want deeper inventory management than the simpler RealTimeX Local settings page.

Ollama and LocalAI

For Ollama and LocalAI, the current UI is oriented around pulling models by name or gallery ID.

Use these tabs when:

you already run one of those providers
you want RealTimeX to attach to that local model inventory
you prefer provider-native model management instead of the managed built-in runtime

LM Studio

LM Studio behaves differently from pull-based providers.

In the current app, the usual expectation is:

load a model inside LM Studio first
then return to Local Models to inspect it from RealTimeX

If the tab looks empty, it usually means LM Studio is not currently serving a loaded model to the instance.

llama.cpp

The llama.cpp tab is tied to the running llama.cpp server and its model directory.

The current UI can help you:

inspect the active model
review GGUF files on disk
download GGUF files from Hugging Face
understand when the model directory or server configuration is incomplete

If the server connection fails or no model directory is available, fix the llama.cpp runtime setup first, then return to the tab.

Model settings

Some providers expose a model settings modal with runtime tuning controls such as:

keep-alive time
temperature
context window size
max tokens
advanced sampling controls

These settings are useful when you want a model to stay warm longer or behave differently without changing the whole provider configuration.

Incomplete and repairable models

The current UI can flag unhealthy local models, including cases like:

missing multimodal projector files
missing split GGUF parts
partially downloaded artifacts

When a model is marked incomplete, repair it or remove and re-download it before using it in production workflows.

Typical workflow

Configure the provider in Settings > AI Providers > LLM if needed.
Open Settings > AI Providers > Local Model Management.
Choose the provider tab you want to work with.
Pull or download a model if none exist yet.
Inspect details or adjust per-model settings if needed.
Load the model where that provider supports explicit loading.
Set the related provider as your active LLM in Settings > AI Providers > LLM.

When to use Local Models

Use it when you manage more than one local provider.
Use it when you need visibility into model files, digests, health, and provider-specific operations.
Use it when you want to troubleshoot why a local provider looks empty, offline, or incomplete.

For the managed built-in local path, see RealTimeX Local. For provider selection at the instance level, see Large Language Models.

RealTimeX Local Text Splitting & Chunking