Olium Agent

olium is the in-process AI agent runtime that powers every agentic feature in Vigolium. It ships as both:

A user-facing command — vigolium agent olium (aliases: vigolium olium, vigolium ol) — for interactive chat in a TUI or scripted one-shot prompts.
A library — pkg/olium/ — that the autopilot, swarm, query, archon-prep, and source-analysis paths all dispatch through. There are no subprocess SDK backends; every AI call in vigolium goes through this engine.

For a higher-level comparison against the other agent subcommands, see Agent Mode.

What it is

A turn-based, tool-using LLM agent written in Go. Components:

Layer	Lives in	Responsibility
Engine	`pkg/olium/engine/`	Multi-turn loop: provider stream → tool dispatch → history append → repeat
Provider	`pkg/olium/provider/`	LLM backend — five drivers (codex-oauth, anthropic-api-key, claude-oauth, openai-api-key, claude-code-cli)
Tools	`pkg/olium/tool/`	The eight built-in primitives the model can call (bash, file ops, search, web fetch)
Skills	`pkg/olium/skill/`	SKILL.md workflow files (agentskills.io format) discovered from project, user, and embedded scopes
TUI	`pkg/olium/tui/`	Bubble Tea front-end (inline scrollback, slash commands, live tool cards)
Headless	`pkg/olium/headless.go`	Non-interactive single-prompt runner for scripts and smoke tests
Autopilot	`pkg/olium/autopilot/`	Long-running autonomous scan loop on top of the engine, with budgets, halt signal, and `report_finding`
Vigolium tools	`pkg/olium/vigtool/`	Scanner-aware extensions: `run_scan`, `run_extension`, `list_sessions`, `list_findings`, `auth_session_lookup`, etc.
Auth	`pkg/olium/auth/`	Codex OAuth credential loading and refresh (handles `~/.codex/auth.json`)

What it does

Each invocation runs one multi-turn loop:

Append the user prompt to history.
Stream a single provider response (text deltas, thinking deltas, tool calls).
Append the assistant turn to history; emit EventTurnDone with token usage.
If there are no tool calls → emit EventRunDone and exit.
Otherwise dispatch the tool calls. If all calls are read-only the engine fans them out in parallel (cap = 8); otherwise it runs them strictly serially so writes can’t race reads. Tool results are appended to history in the model’s original order regardless.
Loop back to step 2 — capped by MaxTurns (default 32 for chat / headless, 200 for autopilot).

Surrounding behavior:

Tool result truncation / spill — results larger than MaxToolResultBytes (default 16 KiB) get head+tail truncation with an elision marker. If SpillDir is set (autopilot does this), the full payload spills to <SpillDir>/tool-results/ and the model gets a head excerpt plus an on-disk path it can read_file.
Per-tool timeout — each tool invocation gets its own deadline (default 5 minutes). A runaway bash curl can’t hang the whole session.
Prompt caching — opt-in via EnablePromptCache. Autopilot turns it on; the Anthropic provider then writes cache_control: ephemeral markers on the system prompt and tool list, cutting repeated-prefix tokens by ~90 % across long runs. Providers without caching ignore the flag.
Skills — when a registry is loaded the engine injects an <available_skills> block into the system prompt at construction, and registers a load_skill tool the model can call to fetch a skill body on demand.

Modes

Interactive TUI (default)

vigolium olium                     # chat
vigolium ol                        # alias
vigolium agent olium               # full path
vigolium ol "audit this repo"      # auto-submitted first prompt
echo "summarise" | vigolium ol     # stdin auto-detected when piped

Bubble Tea inline mode (no alt-screen — output appends to scrollback as it streams). Live partial line, fenced code-block highlighting via chroma, and a one-line “tool exec” card while a tool runs. Slash chooser opens on /:

/clear — clear conversation history.
/skill:<name> [args] — inline expansion of a loaded skill; the body is pasted into the prompt so the model doesn’t have to spend a tool call to load_skill.

The model id, provider, and reasoning effort are shown in the banner header.

One-shot non-interactive

Passing -p / --prompt runs a single prompt non-interactively and streams to stdout — the TUI is skipped automatically.

vigolium ol -p "list every route in this repo"

Prints assistant text to stdout; thinking deltas, tool start/end cards, and per-turn [turn done in= out= cached=] summaries go to stderr. Exits non-zero on engine error.

Library use (autopilot, swarm, query)

pkg/agent/olium_adapter.go is the single dispatch path every other agent feature funnels through:

runOliumPrompt(ctx, cfg, prompt, streamWriter, sourcePath) — fresh engine per call.
runOliumOnEngine(ctx, cfg, eng, prompt, streamWriter) — reuses an engine so the conversation prefix stays warm (used by source-analysis to fork an explore phase into 3 parallel format calls).
acquireProviderSlot(ctx, cfg) — global semaphore (size = agent.olium.max_concurrent, default 4) that bounds in-flight provider calls process-wide so swarm phase fan-out can’t trigger 429s on tier-1 plans.
EffectiveCallTimeout() — default 10 min per provider call; 0 → default, negative → no timeout.

Providers

Five drivers in pkg/olium/provider/. The provider name spells out the auth mechanism so it’s obvious which credential field applies:

Provider	Auth	Default model	Source of credential
`codex-oauth` (default)	OAuth credential file	`gpt-5.5`	`--oauth-cred` → `agent.olium.oauth_cred_path` → `~/.codex/auth.json` (produced by `codex login`)
`anthropic-api-key`	`x-api-key` header	`claude-opus-4-7`	`--llm-api-key` → `agent.olium.llm_api_key` → `$ANTHROPIC_API_KEY`
`claude-oauth`	Bearer token (Claude Code OAuth)	`claude-opus-4-7`	`--oauth-token` → `agent.olium.oauth_token` → `$ANTHROPIC_API_KEY` (produced by `claude setup-token`)
`openai-api-key`	`x-api-key` header	`gpt-5.5`	`--llm-api-key` → `agent.olium.llm_api_key` → `$OPENAI_API_KEY`
`claude-code-cli`	(none — subprocess)	`claude-opus-4-7`	`--claude-bin` (default `claude` on `$PATH`)

With no --provider flag and no YAML override, vigolium auto-detects to codex-oauth. The claude-oauth provider also prepends a Claude Code preamble to the system prompt and adds the oauth-2025-04-20 beta header so it’s accepted on the same endpoint as anthropic-api-key. Codex auth refreshes itself: it parses the JWT, checks expiry with a 60 s skew, and posts to /oauth/token with the stored refresh token, rewriting ~/.codex/auth.json (mode 0o600).

Note: the REST API does not mirror these per-invocation flags. The server resolves the provider once from agent.olium.* in vigolium-configs.yaml and reuses it across requests so warm caches stay stable. To switch providers server-side, edit the YAML and reload.

Tools

Built-in tool registry — eight tools registered in this order:

Name	Read-only?	What it does
`bash`	no	`bash -lc <cmd>` with hard-rejects for catastrophic patterns (`rm -rf /`, `dd` to block devices, fork bombs, `mkfs` against real devices). Default timeout = engine `ToolTimeout` (5 min).
`read_file`	yes	Read file with line-number prefix. Params: `path`, `offset`, `limit` (default 2000).
`write_file`	no	Create or overwrite a file.
`edit_file`	no	Find-and-replace edit on a file.
`ls`	yes	List a directory.
`grep`	yes	Regex search — uses ripgrep when available, else native Go regex. Params: `pattern`, `path`, `glob`, `max_matches` (200), `ignore_case`.
`glob`	yes	Glob pattern → paths.
`web_fetch`	yes	Fetch a URL. Two modes: `http` (default, fast) and `browser` (delegates to `agent-browser` for SPA / JS-heavy pages). Params: `url`, `method`, `headers`, `body`, `max_bytes`, `mode`, `wait_selector`, `wait_ms`.

The IsReadOnly() flag is what the engine uses to decide whether to fan out a turn’s tool calls in parallel. bash runs without an approval prompt (yolo mode) — only the catastrophic-pattern guard prevents disasters.

Autopilot adds more

When the engine runs under vigolium agent autopilot, the registry also gets:

halt_scan — model-driven exit. Sets a halt signal; the run loop exits after the current turn.
report_finding — persists a finding to the database (title, severity, description, remediation, CWE, evidence, confidence, status). Soft-warns at 50 calls, hard-caps at 200.
load_skill — fetch a skill body by name (registered whenever the skill registry is non-empty).
Vigtool — run_scan, run_extension, list_sessions, get_session, list_findings, list_auth_sessions, auth_session_lookup (registered when Repo is non-nil).

Skills

Skills are Markdown workflow files with YAML frontmatter, following the agentskills.io convention so files written for Claude Code or pi work in olium verbatim. Format:

---
name: triage-finding
description: Walk a candidate finding from suspicious response → root cause → PoC.
license: optional
allowed-tools: optional list
---

# Body
Instructional prose the model reads after calling load_skill.

name must match [a-z0-9-]+ (≤64 chars); description ≤1024 chars.

Discovery

The skill registry walks four scopes — first-found-by-name wins:

Project — .agent/skills/ and .claude/skills/ in the working directory and every ancestor, closest first.
User — ~/.vigolium/skills/ (only when IncludeUserSkills=true).
Embedded — shipped in the binary under public/presets/skills/ via go:embed.

Two on-disk layouts are accepted: <root>/<name>/SKILL.md (directory skill, the agentskills.io standard) or <root>/<name>.md (single-file shorthand; frontmatter name must match the filename stem). Generic chat (vigolium agent olium, headless) loads scopes 1 + 3 only. Autopilot and swarm load all three so security-specific workflows in ~/.vigolium/skills/ don’t pollute casual chat.

Use

The engine writes an <available_skills> block into the system prompt listing every skill’s name + description + location. The model fetches bodies on demand via the load_skill tool — progressive disclosure, so unused skills don’t burn tokens. In the TUI, type /skill:<name> [args] to inline-expand a skill body into your prompt directly, no tool call needed.

CLI flags

--provider          codex-oauth | anthropic-api-key | claude-oauth | openai-api-key | claude-code-cli
--model             provider-specific (empty = provider default)
--oauth-cred        OAuth credential file (codex-oauth; default ~/.codex/auth.json)
--oauth-token       Claude Code OAuth bearer token (claude-oauth)
--llm-api-key       API key for anthropic-api-key / openai-api-key
--claude-bin        Path to the `claude` binary (claude-code-cli)
--system            Override the built-in system prompt
-p, --prompt        Initial prompt (alternative to a positional arg). Forces non-interactive mode.
--stdin             Force reading the prompt from stdin

Precedence for the initial prompt: positional args → -p/--prompt → stdin (auto-detected when piped, or forced with --stdin). Values flow CLI → YAML → env: every CLI flag falls back to its agent.olium.* YAML field, which in turn falls back to the documented default or env var.

Configuration

The full agent.olium block:

agent:
  olium:
    provider: codex-oauth          # codex-oauth | anthropic-api-key | claude-oauth | openai-api-key | claude-code-cli
    model: gpt-5.5                 # empty = provider default
    oauth_cred_path: ~/.codex/auth.json
    oauth_token: ""                # claude-oauth; supports ${ENV_VAR}; falls back to $ANTHROPIC_API_KEY
    llm_api_key: ""                # supports ${ENV_VAR}; falls back to $ANTHROPIC_API_KEY / $OPENAI_API_KEY
    reasoning_effort: medium       # minimal|low|medium|high|xhigh (codex)
    system_prompt: ""              # empty = built-in olium prompt
    max_tokens: 1000000
    temperature: 0.0
    max_turns: 32
    cache_size: 1024               # LRU; 0 disables
    max_concurrent: 4              # global cap on simultaneous provider calls; 0 = unbounded
    call_timeout_sec: 600          # per-call deadline; negative = no timeout (parent ctx only)

Adjacent config blocks worth knowing:

agent.sessions_dir — where per-run session directories go. Default ~/.vigolium/agent-sessions/.
agent.browser — toggles agent-browser integration (the binary web_fetch shells out to in mode: browser).
agent.archon — controls the optional archon-audit prep step that autopilot/swarm can stack ahead of the olium loop.

Sessions and on-disk state

Every agent run gets a session directory under agent.sessions_dir (default ~/.vigolium/agent-sessions/<run-uuid>/). Bare vigolium agent olium chat doesn’t write a session — it’s only autopilot/swarm/query that materialise one. Inside a session dir you may find:

runtime.log — per-turn event log (text deltas, tool start/end, turn-done summaries).
tool-results/<tool>-<call-id>.txt — spilled oversized tool outputs (when the engine’s SpillDir is set).
session-config.json — run metadata (project / scan UUIDs, options).
swarm-plan.json, master-output.md, audit-stream.jsonl, checkpoint.json — produced by the higher-level modes that wrap olium (swarm, archon, autopilot).

Browse past runs with vigolium agent session list / --full / --tail.

Stream events

The engine emits a unified Event channel regardless of provider:

Event	Carries
`EventTextDelta`	`Delta` — assistant text increment
`EventThinkingDelta`	`Delta` — reasoning content (Anthropic thinking, codex reasoning)
`EventToolCallStart`	`ToolName`, `ToolArgs` — the model decided to call a tool
`EventToolExecStart` / `EventToolExecProgress` / `EventToolExecEnd`	tool invocation lifecycle, `ToolResult`, `ToolIsErr`
`EventTurnDone`	`StopReason`, `Usage` (input / output / cache-read / cache-write tokens)
`EventRunDone`	terminal usage
`EventError`	`Err` — provider failure, ctx cancellation, max-turns exceeded

Token counts on EventTurnDone are accumulated by every higher-level caller (autopilot for budget enforcement, the adapter for agenttypes.TokenUsage, the swarm for cost reporting).

When to use what

You want to…	Use
Chat / debug / explore interactively	`vigolium ol`
Run one prompt from a script and parse stdout	`vigolium ol -p "..."`
Hand the agent the wheel for an autonomous pentest	`vigolium agent autopilot` (uses olium under the hood with budgets + report_finding)
AI-direct the native scanner (plan → modules → triage)	`vigolium agent swarm`
Single-shot template-driven prompt with structured output	`vigolium agent query`

Olium itself is the general-purpose chat / dev surface and the engine every other mode reuses — it is not a security scan on its own.

Getting Started

Agentic Scan

Native Scan

Native Scanning Phases

Server Mode

Guides

Customization

Others

What it is

What it does

Modes

Interactive TUI (default)

One-shot non-interactive

Library use (autopilot, swarm, query)

Providers

Tools

Autopilot adds more

Skills

Discovery

Use

CLI flags

Configuration

Sessions and on-disk state

Stream events

When to use what

See also

Getting Started

Agentic Scan

Native Scan

Native Scanning Phases

Server Mode

Guides

Customization

Others

Documentation Index

​What it is

​What it does

​Modes

​Interactive TUI (default)

​One-shot non-interactive

​Library use (autopilot, swarm, query)

​Providers

​Tools

​Autopilot adds more

​Skills

​Discovery

​Use

​CLI flags

​Configuration

​Sessions and on-disk state

​Stream events

​When to use what

​See also

What it is

What it does

Modes

Interactive TUI (default)

One-shot non-interactive

Library use (autopilot, swarm, query)

Providers

Tools

Autopilot adds more

Skills

Discovery

Use

CLI flags

Configuration

Sessions and on-disk state

Stream events

When to use what

See also