🤖 Models

Wrap any model with custom instructions, tools, and knowledge to build specialized agents.

The Models workspace lets you create configuration presets that sit on top of any base model. Pick GPT-4o, Claude, Llama 3, or anything else connected to Open WebUI, then bind a system prompt, knowledge bases, tools, skills, and parameter overrides to it. The result is a purpose-built agent that behaves exactly the way you need without modifying the underlying model.

A "Python Tutor" that always uses your style guide. A "Meeting Summarizer" with your company's template. A "Code Reviewer" with your linting rules baked in. Every agent is a thin wrapper: pick a base model, configure it, and share it with your team.

Why Models?

One base model, many personas

The same GPT-4o can power a coding assistant, a customer support bot, and a creative writer. Each preset has its own system prompt, tools, and knowledge, so the model behaves differently depending on which preset is selected.

Knowledge and tools come pre-attached

Instead of manually attaching documents and enabling tools every chat, bind them once to the model preset. Users get a fully configured agent out of the box.

Granular access control

Restrict models to specific users or groups. A finance team sees their models; engineering sees theirs. Admins control what's available instance-wide.

Dynamic system prompts

Use Jinja2-style variables like {{ USER_NAME }} and {{ CURRENT_DATE }} so the system prompt adapts to each user and session automatically.

Key Features


🧩 Model presets	System prompt, tools, knowledge, skills, and parameters in one package
🏷️ Dynamic variables	`{{ USER_NAME }}`, `{{ CURRENT_DATE }}`, `{{ CURRENT_TIME }}` injected automatically
🔧 Bound tools	Force-enable specific tools per model
📚 Attached knowledge	Knowledge bases and files always available via RAG or full context
🎭 Skills	Bind markdown instruction sets loaded on-demand via `view_skill`
👥 Access control	Restrict to specific users or groups
📊 Global defaults	Set baseline capabilities and parameters for all models at once
🔊 Per-model TTS voice	Give each persona its own voice

Creating a Model

Click + New Model in Workspace > Models, or click the ellipsis (...) on an existing model and select Edit.

Core configuration

Field	Description
Avatar	Upload a custom image. Animated GIF and WebP are supported
Name and ID	Display name and unique identifier
Base Model	The actual model that powers this agent
Description	Short summary shown in the model selector
Tags	Organize models in the dropdown
Visibility	Private (specific users/groups) or public

System prompt and variables

The system prompt defines the behavior and persona. Use dynamic variables for context-aware instructions:

Variable	Output example
`{{ CURRENT_DATE }}`	`2024-10-27`
`{{ CURRENT_TIME }}`	`14:30:05`
`{{ USER_NAME }}`	`Admin`
`{{ USER_GROUPS }}`	`Engineering, Beta Testers` (comma-separated; empty if the user is in no groups)

You are a helpful assistant for {{ USER_NAME }}.
The current date is {{ CURRENT_DATE }}.

Group-aware system prompts

{{ USER_GROUPS }} lets a single shared model adapt its behavior to the caller's RBAC groups — e.g. "You may discuss internal roadmap items only when {{ USER_GROUPS }} contains 'Engineering'." The placeholder is resolved server-side at chat time, and the database lookup runs only when the variable is actually referenced in the template.

Capabilities and bindings

Toggle what the model can do and bind resources:

Setting	What it controls
Knowledge	Bind collections or files. Click attached items to toggle between Focused Retrieval and Full Context. See Retrieval Modes
Tools	Force-enable specific tools (e.g., Calculator for a Math Bot)
Skills	Bind Skills so their manifests are always injected
Filters	Attach pipeline filters (e.g., PII redaction)
Actions	Attach action scripts (e.g., "Add to Memories")
Vision	Enable image analysis (requires a vision-capable base model)
Web Search	Enable the configured search provider
Code Interpreter	Enable Python code execution
Image Generation	Enable image generation
Builtin Tools	Control which tool categories are available: Time, Memory, Chats, Notes, Knowledge, Channels, Task Management, Automations
File Context	When enabled, attached files are processed via RAG. When disabled, no file content is extracted
TTS Voice	Set a specific voice for this model's responses

Advanced parameters

Stop Sequences: Force-stop generation on specific strings (e.g., <|end_of_text|>, User:). Press Enter after each.
Temperature, Top P, etc.: Adjust creativity and determinism.

Prompt suggestions

Clickable starter chips that appear when a user opens a fresh chat with this model. Add phrases like "Explain this code step-by-step" or "Summarize this document" to guide users.

Model Management

From the model list, click the ellipsis (...) on any model:

Action	Description
Edit	Open the configuration panel
Hide	Remove from the model selector without deleting
Clone	Create a copy (appends `-clone`)
Copy Link	Copy a direct URL to the model settings
Export	Download the configuration as `.json`
Share	Share to the Open WebUI community
Delete	Permanently remove the preset

Import and export

Import: From .json files or Open WebUI community links
Export: Download all custom model configurations as a single .json
Discover: Browse community presets at the bottom of the page

Downloading base models

To download new base models, go to Settings > Connections > Ollama or type ollama run hf.co/{username}/{repository}:{quantization} in the model selector.

Global Model Defaults (Admin)

Administrators can set baseline capabilities and parameters that apply to all models via Admin Panel > Settings > Models > ⚙️ (gear icon).

Default Model Metadata (DEFAULT_MODEL_METADATA): Baseline capabilities (vision, web search, file context, code interpreter, builtin tools). Per-model overrides always win on conflicts.
Default Model Params (DEFAULT_MODEL_PARAMS): Baseline inference parameters (temperature, top_p, max_tokens, function_calling). Per-model values take precedence when explicitly set. This value is loaded from the environment as JSON; invalid JSON is ignored and falls back to {}.

Merge behavior

Setting type	Strategy	Example
Capabilities	Deep merge	Global sets `file_context: false`, model sets `vision: true` > model gets both
Other metadata	Fill-only	Global sets description, model has none > model gets the global value
Parameters	Simple merge	Global sets `temperature: 0.7`, model sets `0.3` > model gets `0.3`

Knowledge base + function calling interaction

Setting function_calling: native in global params changes how all models handle attached knowledge bases. In native mode, model-attached KBs are not auto-injected. The model must call builtin tools to retrieve knowledge. If your knowledge bases suddenly stop working, check global defaults first.

See Knowledge Base troubleshooting.

Bulk management

Filter the admin model list by status (Enabled, Disabled, Visible, Hidden) and use Bulk Actions to enable or disable all models in the current view at once. Useful when external providers expose hundreds of models.

Model Switching in Chat

Switch models mid-conversation without losing context. Select up to two models simultaneously to compare responses side-by-side, using the arrow buttons to navigate between them.

Use Cases

Team-specific agents

Create a "Sales Assistant" with your CRM knowledge base, objection-handling prompts, and email drafting tools. Share it with the sales group. Engineering never sees it.

Onboarding new users

Build models with descriptive prompt suggestions ("Ask me about our company policies", "Help me set up my development environment") so new team members know exactly what to ask.

Enforcing organizational standards

Set global defaults to disable code interpreter across all models, enforce a consistent temperature, or require function calling. Individual models can override when needed.

Curated-Interface Deployments

A common deployment pattern is to present regular users with a curated model — a preconfigured agent with a specific name, icon, system prompt, and tools — while keeping the underlying base model visible only to power users or admins who need direct access.

The recommended pattern: two base model entries

The correct way to achieve differential visibility is to create two separate base model entries that point to the same underlying LLM:

Entry	Access	Hidden	Who sees it	Purpose
Base model (e.g. "GPT-4o")	Restricted to power users	No	Power users only	Direct exploration and testing
Curated model (e.g. "Company Assistant")	Public	No	Everyone	The sanctioned product for regular users

The curated model is a first-class base model entry — not a workspace model wrapping the restricted one. Configure it with its own name, avatar, system prompt, knowledge bases, tools, and parameter overrides. It connects to the same upstream LLM but is an independent configuration entry.

Step-by-step setup:

In Admin Panel > Models, locate your base model (e.g. "GPT-4o").
Set its access control to Private and grant access only to your power users / admin group.
Click the ellipsis (...) on the base model and select Clone. This creates a copy with all settings.
Rename the clone to your curated product name (e.g. "Company Assistant"). Update the avatar, system prompt, knowledge, and tools as needed.
Set the curated model's access to Public (or restrict it to the groups that should see it).

Now power users see and use the original base model directly, while regular users see only the curated model. Both entries point to the same upstream LLM but are configured independently.

Upgrading the upstream model

When you switch to a newer LLM (e.g. Qwen 3 → Qwen 3.5), update the base model selection on both entries. You can also use Export and Import to keep settings synchronized across entries.

Why not a workspace model on a restricted base?

Workspace models inherit the access requirements of their base model. If a user does not have access to the base model, they cannot use any workspace model built on top of it — even if the workspace model itself is shared with them.

This is by design. Without this requirement, anyone could bypass base model access restrictions by creating a workspace model on a restricted base and sharing it publicly. That would be broken access control.

warning

If you previously relied on workspace models to give users access to base models they couldn't see directly, that pattern depended on an access-control gap that has been patched. The two-base-model pattern described above achieves the same outcome without the security issue.

Alternative: hidden base model

If you don't need differential visibility — meaning no group needs to see the raw base model in the picker — you can use a simpler approach:

Set the base model to Public (so everyone has access).
Hide the base model (ellipsis > Hide) so it doesn't appear in the model selector.
Create a workspace model on top of the (now hidden) base model and share it with your users.

Users see only the workspace model. The hidden base model is accessible under the hood but invisible in the UI. Admins can still access hidden models via direct URL parameters.

This approach works when every user should have the same experience. It does not work when some groups need direct access to the base model in their picker.

Limitations

Preset, not fine-tune

Model presets configure behavior through system prompts and tool bindings. They do not modify the underlying model weights. For deep behavioral changes, you need actual fine-tuning.

Fallback requires configuration

If a base model becomes unavailable, the preset will fail unless ENABLE_CUSTOM_MODEL_FALLBACK is set to True and a default model is configured in Admin Panel > Settings > Models.

This content is for informational purposes only and does not constitute a warranty, guarantee, or contractual commitment. Open WebUI is provided "as is." See your license for applicable terms.

Why Models?​

One base model, many personas​

Knowledge and tools come pre-attached​

Granular access control​

Dynamic system prompts​

Key Features​

Creating a Model​

Core configuration​

System prompt and variables​

Capabilities and bindings​

Advanced parameters​

Prompt suggestions​

Model Management​

Import and export​

Global Model Defaults (Admin)​

Merge behavior​

Bulk management​

Model Switching in Chat​

Use Cases​

Team-specific agents​

Onboarding new users​

Enforcing organizational standards​

Curated-Interface Deployments​

The recommended pattern: two base model entries​

Why not a workspace model on a restricted base?​

Alternative: hidden base model​

Limitations​

Preset, not fine-tune​

Fallback requires configuration​