Local Models
Local Models is the cross-provider model management screen inside RealTimeX.
Open it from Settings > AI Providers > Local Model Management.
What this page is for
Use this page when you want one place to inspect and manage models across local backends.
The current product supports:
OllamaLocalAILM Studiollama.cppRealTimeX Local
This page is broader than RealTimeX Local, which focuses only on the managed built-in GGUF runtime.
Provider status
Each provider tab can appear as:
- online
- offline
- unconfigured
- loading
If a provider is unconfigured, RealTimeX expects you to finish its base URL or provider setup first in Settings > AI Providers > LLM, then return here.
What changes by provider
Not every provider supports the same actions.
In the current product:
OllamaandLocalAIare pull-oriented model registriesLM Studiomainly reflects models you already loaded inside LM Studiollama.cppreflects the running server and local GGUF filesRealTimeX Localuses the managed local models directory and built-in runtime
That means this page is best understood as one shell over several different local model ecosystems.
Common actions
Depending on the provider, the current UI can let you:
- add or pull a new model
- review installed model counts
- inspect size, modified time, digests, and IDs
- copy model identifiers
- load a model
- open model details
- edit per-model settings
- refresh or update metadata
- repair incomplete models
- delete models from local storage
Some actions are provider-specific, so a tab may expose fewer controls than another.
RealTimeX Local inside Local Models
The RealTimeX Local tab is where you manage the built-in GGUF inventory across the managed local models directory.
The current flow supports:
- recommended model downloads
- Hugging Face search
- manual repository or file entry
- default-model selection
- load and warmup behavior
- repair for incomplete artifacts
Use this tab when you want deeper inventory management than the simpler RealTimeX Local settings page.
Ollama and LocalAI
For Ollama and LocalAI, the current UI is oriented around pulling models by name or gallery ID.
Use these tabs when:
- you already run one of those providers
- you want RealTimeX to attach to that local model inventory
- you prefer provider-native model management instead of the managed built-in runtime
LM Studio
LM Studio behaves differently from pull-based providers.
In the current app, the usual expectation is:
- load a model inside LM Studio first
- then return to
Local Modelsto inspect it from RealTimeX
If the tab looks empty, it usually means LM Studio is not currently serving a loaded model to the instance.
llama.cpp
The llama.cpp tab is tied to the running llama.cpp server and its model directory.
The current UI can help you:
- inspect the active model
- review GGUF files on disk
- download GGUF files from Hugging Face
- understand when the model directory or server configuration is incomplete
If the server connection fails or no model directory is available, fix the llama.cpp runtime setup first, then return to the tab.
Model settings
Some providers expose a model settings modal with runtime tuning controls such as:
- keep-alive time
- temperature
- context window size
- max tokens
- advanced sampling controls
These settings are useful when you want a model to stay warm longer or behave differently without changing the whole provider configuration.
Incomplete and repairable models
The current UI can flag unhealthy local models, including cases like:
- missing multimodal projector files
- missing split GGUF parts
- partially downloaded artifacts
When a model is marked incomplete, repair it or remove and re-download it before using it in production workflows.
Typical workflow
- Configure the provider in
Settings > AI Providers > LLMif needed. - Open
Settings > AI Providers > Local Model Management. - Choose the provider tab you want to work with.
- Pull or download a model if none exist yet.
- Inspect details or adjust per-model settings if needed.
- Load the model where that provider supports explicit loading.
- Set the related provider as your active LLM in
Settings > AI Providers > LLM.
When to use Local Models
- Use it when you manage more than one local provider.
- Use it when you need visibility into model files, digests, health, and provider-specific operations.
- Use it when you want to troubleshoot why a local provider looks empty, offline, or incomplete.
For the managed built-in local path, see RealTimeX Local. For provider selection at the instance level, see Large Language Models.