documentation

Everything about Ava Supernova

Clean, accurate, kept in sync with the agent itself.

What is Ava Supernova?

Ava Supernova is an open-source agentic coding assistant. She writes code with you, teaches you anything you want to learn, audits your projects for security issues, and can control your desktop when you need her to. She remembers what you told her last week, respects the conventions of your project, and runs on your machine by default.

Three surfaces, one agent. Use whichever fits your workflow — they share the same brain.

The three surfaces

VS Code extension — Ava lives next to your editor. Chat panel, unified dashboard, inline diffs.
Desktop IDE — standalone native app for when you want the whole thing in one window. Built with Tauri.
Companion — a mobile-friendly web app for when you are away from your desk. Tasks, journal, memory, quick chats.

What makes her different

Local-first. Your data stays on your machine unless you opt in to cloud sync. No telemetry. Ever.
Open source. Every line is public — extension, IDE, companion, CLI. Fork it, audit it, verify our claims.
Free for everyone. 3M tokens every month on the free tier. No credit card, no account required with your own keys.
Teaching is free forever. Education should not have a price tag.

If you are new to coding, start with Teach mode and ask her to explain something. If you are a seasoned engineer, drop her into Work mode and let the full persona team loose on your codebase. Same agent — different mindset.

Never written code? Start here

If you have never written a line of code, you are exactly who Ava is for. You do not need to learn any jargon to start — you talk to her in plain English, the way you would ask a knowledgeable friend. This page is your whole on-ramp.

What you can do today, with no experience

Ask her anything, in normal words — "what does this file do?", "why is my page blank?", "what even is a database?" She answers in plain language and remembers the conversation.
Learn something from scratch — say "teach me Python from zero" and she builds you a proper course with lessons and quizzes. This is always free.
Get something built — describe what you want ("a button that downloads my notes") and she writes it, showing you every change before it happens.

You don't need to understand everything

Ava has a lot of machinery under the hood — modes, models, specialists, routing. You will see those words around the app and in these docs. Here is the secret: you can ignore almost all of it. Ava picks the right tools and the right helpers for you automatically. The technical pages are there for when you get curious, not because you need them to start.

A gentle first path

Install Ava (next page) and open the chat.
Type a real question you have. Anything. See how she answers.
Try learning mode: type "teach me" and a topic. Follow along.
When you want her to actually make or change something, just ask — she will always show you what she is about to do and wait for your "yes".

Tip: Hit a word you do not recognise? The Plain-English glossary (further down this section) explains every term with no assumed knowledge.

Install

Pick the surface that matches how you work. You can use more than one — Ava syncs her memory across them when you sign in.

VS Code extension

Open the Extensions panel in VS Code (Ctrl+Shift+X / Cmd+Shift+X).
Search for "Ava Supernova" and click Install.
Press Ctrl+Shift+A (Cmd+Shift+A on macOS) to open the chat panel.
Follow the setup wizard — pick a model, set your permission mode, done.

Desktop IDE

Download the installer for your platform from the releases page.
Run it. The IDE launches with a welcome flow the first time.
Sign in for the free platform tokens, or paste your own API key to run fully local.

CLI

npm install -g @ava/cli (or pnpm / yarn — your choice).
Run ava in any project directory.
First run prompts you for a provider and a model. Pick and go.

Tip: No account, no problem. With your own API key (BYOK) you never need to sign in. The free tier is for the platform-managed models.

Your first five minutes

Forget the documentation for a moment. The fastest way to learn Ava is to use her.

Open the chat. Whatever surface you are on.
Type a question about a file in your project. Try: "Explain what this file does." — or pick a bug and say "Find what is wrong with this function."
Ava will read the file (tool call #1). She will ask you to approve the first read. Say yes.
Watch her stream an answer. If she wants to make a change, she will show you the diff and wait for you to approve.
Switch modes. Type >> to enter Work mode, then ?? to try Teach mode with the same question. Feel the difference.

That is it. You have used 4 of 60 tools and 2 of 6 modes. The rest is progressive — you learn what exists when you need it.

Note: Prefer structure? Jump to Core concepts for a proper tour. Prefer learning by doing? Keep going — Ava will ask before she does anything risky.

Local vs cloud, in one paragraph

Everything Ava does is local unless you explicitly opt in. Memory, tasks, journal, personality, settings — all stored on your machine in ~/.ava/ and .ava/ (per project). She talks to model providers over HTTPS to run your request, then returns home. She does not phone home. She does not collect telemetry. She does not train on your code.

If you sign in, you get 3M free tokens per month on platform-managed models (Qwen, MiniMax) and optional cloud sync for memory and settings across machines. Sync is per-feature, revocable anytime. Bring your own API key and Ava works fully without an account.

Tip: The rule: local is sacred. Cloud is additive.

Choosing your routing mode

You do not pick a model — you pick a routing strategy. Ava drives the right model for each subtask under the hood. Three strategies cover every workload, available on every plan.

Maestro — the default

One conductor (Qwen 3.6 Plus) handles every persona, every step. Production-tuned, proven, predictable cost. Pick this if you want Ava to "just work" without thinking about routing. Live on every plan.

Supernova — the polyglot ensemble

A frontier coordinator (DeepSeek V4 Pro, 1.6T parameters / 49B active, 1M context) hands off to specialists per subtask. V4 Flash for high-volume builds, Qwen 3.6 Plus as fallback, Qwen Omni when vision is in play. Best for heavy multi-step work where each step deserves its own specialist.

Aurora — the European stack

Mistral-only routing in three tiers — Mistral Large 3 coordinator + heavy specialists, Mistral Medium 3.5 (the merged flagship released April 2026 — 128B dense, 256K context, vision encoder from scratch, modified-MIT open weights, 77.6% SWE-Bench Verified) for Builder + mid-tier specialists + vision + long-form, Mistral Small 4 at the intent gate. Open weights end to end, never leaves EU infrastructure. Built for GDPR-strict deployments, public-sector, healthcare, anyone with a sovereignty mandate.

Tip: Switch routing mode any time from the model picker — top of the chat panel on every surface. Nothing is sticky.

Want to drive a specific model yourself?

BYOK (bring your own key) gives you both — the three orchestration modes plus the ability to pick a single model and skip routing entirely. Useful when you have a strong preference, a strict budget, or you are testing a specific model. Paste your provider key in settings, pick the model from the picker, done. Full provider matrix lives in the Reference section.

Qwen (Alibaba Cloud)

Qwen 3.6 Plus coordinates Auto Mode; 3.5 Flash and 3.5 Omni Flash are the fast-path options. All models available on every plan.

Model	Context	Input / Output per 1M	Capabilities
Qwen 3.6 Plus	1M	$0.29 / $1.70	toolsvisionthinkingstreaming
Qwen 3.5 Omni Plus	256K	$0.26 / $1.56	toolsvisionthinkingstreaming
Qwen 3.5 Omni Flash	256K	$0.07 / $0.26	toolsvisionstreaming
Qwen 3.5 Plus	1M	$0.20 / $1.20	toolsvisionthinkingstreaming
Qwen 3.5 Flash	256K	$0.05 / $0.40	toolsstreaming

MiniMax

Powers Creative Studio — image, video, music, voice.

Model	Context	Input / Output per 1M	Capabilities
MiniMax M2.7	205K	$0.30 / $1.20	toolsthinkingstreaming
MiniMax M2.5	1M	$0.15 / $1.20	toolsthinkingstreaming
MiniMax M2	1M	$0.26 / $1.00	toolsthinkingstreaming

DeepSeek (Supernova orchestration)

Powers Supernova routing mode. V4 Pro coordinates the persona pipeline; V4 Flash handles high-volume builds and review. Both open-weight MIT, 1M context, dual thinking/non-thinking modes.

Model	Context	Input / Output per 1M	Capabilities
DeepSeek V4 Pro	1M	$1.74 / $3.48	toolsthinkingstreaming
DeepSeek V4 Flash	1M	$0.14 / $0.28	toolsthinkingstreaming

Mistral AI (Aurora orchestration)

Powers Aurora routing mode. EU-based, Apache-2.0 open weights, never leaves European infrastructure. Large 3 coordinates; Small 4 handles specialists with vision support.

Model	Context	Input / Output per 1M	Capabilities
Mistral Large 3	262K	$0.50 / $1.50	toolsthinkingstreaming
Mistral Small 4	262K	$0.15 / $0.60	toolsvisionthinkingstreaming

Plain-English glossary

Every word you might bump into around Ava, explained with no assumed knowledge. You do not need to memorise any of this — flip back whenever a term trips you up.

Word	What it actually means
Agent	An AI that can take actions for you — read a file, run a command, search the web — not just chat. Ava is an agent: she does things, with your say-so.
Model	The actual AI brain doing the thinking (Qwen, DeepSeek, Mistral, and others). Different models have different strengths. You normally never pick one — Ava does.
Mode	The mindset Ava is in. Like a colleague switching hats: builder, teacher, planner, friend. You pick the mode; it changes how she behaves. There are six.
Persona / specialist	Helper roles Ava runs behind the scenes for harder jobs — an explorer, a planner, a fact-checker. Think of a small expert team. You never talk to them directly; Ava coordinates them.
Routing	Ava deciding which model to use for each step so you get good answers without overpaying. It happens automatically — "routing mode" just picks the overall strategy.
Tool	A specific action Ava can take — read a file, run a search, send an email. A "tool call" is her using one. She asks permission before anything risky.
Token	How AI measures text — roughly ¾ of a word. Models are priced per million tokens. Mostly something you can ignore.
Credit	Ava's simpler unit for what an action costs, so you are not doing token maths. Plans come with a monthly bundle.
API key	A private password that lets software use a paid service (like an AI model) on your account. You only need one if you want to bring your own.
BYOK	"Bring Your Own Key." Using your own API key instead of Ava's managed access. Optional — for people who already pay a provider directly.
Local-first	Your data lives on your own computer by default. Nothing is uploaded unless you switch on sync. "Local is sacred."
Permission mode	How cautious Ava is before doing things — ask every time, ask for risky things only, or just go. You set the level.
Context window	How much text a model can hold in mind at once — its short-term memory. "1M context" means about a million words. Bigger = it can read more before forgetting.
Diff	A side-by-side view of exactly what will change in a file — old on one side, new on the other — shown before Ava changes anything so you can approve it.
Repository (repo)	A project folder tracked for changes, usually with Git. If that means nothing to you yet, it is just "the folder my project lives in."
Prompt	What you type to the AI — your question or instruction.

Note: Still stuck on a word? Ask Ava in Chat mode — "what does X mean?" — and she will explain it for your level.

Routing — how Ava picks models

Here is the short version: you never have to pick which AI model to use. Ava does it for you, choosing the best tool for each part of a job — the way a good manager hands the right task to the right person. The rest of this page explains how that works under the hood. You can ignore almost all of it and just enjoy that it happens.

Ava is a thinking system, not a model menu. You pick one of three orchestration strategies (an "orchestration strategy" is just a named recipe for which AI models get used) — Maestro, Supernova, or Aurora — and Ava routes each subtask to the model best suited for it. ("Routing" means sending each piece of work to the model that handles it best.) Coordinator-tier reasoning where it matters, flash-tier specialists for high-volume work. (The "coordinator" is the lead model that thinks and plans; "specialists" are faster, cheaper models that do the bulk grunt work.) Same persona pipeline runs on all three; what changes is the underlying fleet.

Tip: You don't need to read the table below to use Ava — it's here for the curious. The one thing worth knowing: Maestro is live on every plan today, and it is a good default.

Side-by-side

	Maestro	Supernova	Aurora
Coordinator	Qwen 3.6 Plus	DeepSeek V4 Pro (1.6T / 49B active, 1M ctx)	Mistral Large 3 (675B / 41B active, 262K ctx)
Builder	Qwen 3.6 Plus (reuses coordinator)	Qwen 3.6 Plus	Mistral Medium 3.5 (128B dense, 256K ctx)
Mid-tier specialists	—	DeepSeek V4 Flash (code reviewer, fact checker, recon, scanner, security verifier, etc.)	Mistral Medium 3.5 (Architect, Verifier, Sequencer, Tutor, Reporter, etc.)
Light tier / intent gate	Qwen 3.5 Flash	Qwen 3.5 Flash (Scout, Verifier, Sequencer, Challenger, Integrator)	Mistral Small 4
Heavy specialists	—	—	Mistral Large 3 (Researcher, Challenger, CVE Researcher, Ideator, Fact Checker)
Chat	Qwen 3.5 Flash	DeepSeek V4 Flash	Mistral Small 4
Image-gen orchestration	Qwen 3.5 Flash	Qwen 3.5 Omni Flash	Mistral Small 4
Vision	Qwen 3.5 Omni Plus	Qwen 3.5 Omni Plus	Mistral Medium 3.5 (vision encoder trained from scratch)
Long-form writing	Qwen 3.5 Plus	Qwen 3.5 Plus	Mistral Medium 3.5
Data residency	Mixed (Alibaba Cloud)	Mixed (DeepSeek + Alibaba Cloud)	EU only · open weights
Status	Live · all plans	Coming soon	Coming soon
Best for	Daily work, predictable cost	Heavy multi-step work, frontier coordinator on every plan	GDPR-strict, public-sector, sovereign EU stacks

Note: The next three sections name specific AI models and their sizes (the "B" numbers are billions of parameters — roughly, how big the model's brain is; bigger usually means smarter but slower and pricier). This is power-user detail. You can skip straight to "Why orchestration?" if you just want the gist.

Maestro

Tier-differentiated Qwen routing, light footprint. Qwen 3.6 Plus owns the coordinator, Builder, planning, security, brainstorm, and long-context work — the slots where its 1M-context hybrid linear-attention pays off. Qwen 3.5 Flash handles chat, image-gen orchestration, and the upstream intent gate, where its $0.07/$0.26 pricing and faster TTFT win on the bulk of low-depth token volume. Qwen 3.5 Omni Plus owns vision input (Qwen 3.6 Plus has no native vision). Qwen 3.5 Plus carries cost-sensitive long-form work. Falls through the priority ladder Qwen 3.6 Plus → 3.5 Plus → 3.5 Flash if the primary coordinator is unavailable.

Supernova

Polyglot ensemble. DeepSeek V4 Pro coordinates and dispatches each subtask to its best-suited specialist: Qwen 3.6 Plus runs every Builder spawn, V4 Flash handles chat plus the mid-tier review/audit personas, Qwen 3.5 Flash absorbs the light-tier classifier work, Qwen 3.5 Omni Plus owns vision input, Qwen 3.5 Omni Flash orchestrates image-gen tool calls, Qwen 3.5 Plus carries the cost-sensitive long-form Content Writer persona. Frontier reasoning where it matters, flash-tier economics on the bulk of token volume.

Aurora

European AI stack — sovereign by design. Mistral-only routing in three tiers: Mistral Large 3 (675B/41B-active sparse MoE) handles the coordinator role plus the heavy specialists that need depth — Researcher, Challenger, CVE Researcher, Ideator, Fact Checker, Security Verifier — and the long-context grunt where its sparse-MoE efficiency wins. Mistral Medium 3.5 — the merged flagship released April 2026, 128B dense, 256K context, vision encoder trained from scratch, modified-MIT open weights, 77.6% SWE-Bench Verified — runs the actual working tier: Builder spawns, mid-tier specialists (Architect, Verifier, Sequencer, Tutor, Reporter), vision input, and long-form writing. Mistral Small 4 expanded from a single role at the intent gate to also handling chat and image-gen orchestration — its $0.15/$0.60 pricing and configurable reasoning earn their keep on the high-volume low-depth routes. Open weights end to end, never leaves EU infrastructure. Aurora deliberately does not silently cross-route to a non-Mistral model when one is unavailable; the router surfaces an error instead — that is what makes it "Aurora" rather than a Mistral-flavoured Maestro.

Why orchestration?

Routing each subtask to the right specialist costs less than running every step on a frontier model — and produces better results, because each model is used for what it is best at. Coordinator tokens get spent on reasoning and planning; build-tier tokens flow to flash-tier models. Cost stays predictable, quality stays high, and you do not need to know which model to pick — Ava picks for each step.

Note: All three modes are universally available — on platform credits and on your own keys. BYOK adds the option to bypass orchestration and drive a single model directly when you want that level of control.

Ava Credits — how billing works

Credits are simply how Ava counts what you use — like minutes on a phone plan. Every plan, including the free one, comes with a pool of credits, and each thing Ava does (answering you, making an image) costs a small number of them. That's the whole idea. The detail below is for when you want to know exactly what costs what.

Ava bills in credits, not raw tokens. (A "token" is the tiny chunk of text — roughly a few letters — that AI models read and write in; most services charge you per token, which is hard to predict.) One credit covers one unit of work — a chat turn, a persona spawn (one helper doing a piece of the task), an image generation. Decoupling from token-level metering means you do not need to know which model is running to know what an action will cost. The same chat turn costs the same whether Maestro routes it to Qwen 3.6 Plus or Supernova routes it to V4 Pro.

Plans

Every plan has access to every feature — model access, Creative Studio, all six modes, the full persona orchestration. Higher tiers buy more credits and a higher rate-limit ceiling, not unlocked features.

Free — $0/month. 300 credits. 20 requests/minute rate limit. No card required.
Pro — $19/month. 5,000 credits. 60 requests/minute.
Ultra — $39/month. 10,000 credits. 120 requests/minute.
Enterprise — $79/month. 20,000 credits. 200 requests/minute.

Note: Unused credits roll over while your subscription is active — there is no forced clawback. Top-ups stack on the subscription pool: 750 credits for $3, 2,000 for $8, 4,000 for $15.

What things cost

Per-action cost table — what gets deducted from your credit pool when each action runs. (A "cache hit" is when Ava reuses work it already did instead of redoing it; when that happens you pay less — the charge drops by 70%, or 50% on output-heavy models like DeepSeek V4 Pro.)

Chat turn — 2 credits. A single back-and-forth in any mode.
Light call — 1 credit. Intent gate, classification, single-shot read.
Heavy persona — 3 credits. Architect, Researcher, CVE Researcher, Ideator (depth ≥ 4).
Light persona — 1 credit. Scout, Verifier, Sequencer, Challenger (depth ≤ 2).
Orchestration — 10 credits. Full persona pipeline spawn (Conductor + multi-persona task).
Image generation — 12 credits. Hailuo image-01 via Creative Studio.
Video generation — 150 credits. Hailuo 02 Pro 1080p 6s clip.
Voice generation — 10 credits. Speech 2.8 HD synthesis.
Music generation — 50 credits. MiniMax Music 2.5/2.6.
Background removal — 2 credits.

Per-model multipliers

This bit is for the curious — you don't need it to use Ava. A multiplier just scales the credit cost up or down depending on how expensive the model behind an action is, so heavier models cost a bit more. ("Net margin" is the slice the project keeps after paying its own bills — we aim low so the value flows back to you.)

Heavier models multiply the action cost so credits track real spend instead of a flat bracket. Calibrated for ~55% net margin.

0.6× — Mistral Small 4 (Aurora's specialist seat). Sub-1× because Mistral Small 4 is cheaper than the Qwen 3.6 Plus anchor.
1.0× — Qwen 3.5 Flash, Qwen Omni Flash, DeepSeek V4 Flash, MiniMax. Base rate.
1.2× — Qwen 3.5 Plus, Qwen 3.5 Omni Plus.
1.4× — Mistral Large 3 (Aurora coordinator). About par with the Qwen 3.6 Plus anchor.
1.5× — Qwen 3.6 Plus (Maestro coordinator).
6.0× — DeepSeek V4 Pro (Supernova coordinator + heavy specialists). Recalibrated from 5.0× to restore margin parity at the published V4 Pro rate.

Note: Anthropic models (Claude Opus / Sonnet / Haiku) skip credit billing entirely — they are BYOK only and you pay Anthropic directly. Same for any other BYOK provider when used with your own key.

BYOK and credits

BYOK ("bring your own key" — you supply your own account with an AI provider and a key, which is a private password that lets software use that paid account) is for people who already pay an AI provider directly. If that is not you, you can ignore this section entirely.

Bring-your-own-key requests bypass platform infrastructure entirely — the call goes direct from Ava to the provider, you pay the provider, no credits are consumed. The only exception is when a BYOK session spawns a managed-tier persona for a sub-task; the credit deduction still applies because Ava's orchestration surface is being used.

Where to check your balance

Extension — Settings → Account, or the credit pill in the chat header.
IDE — Account dashboard tile shows current balance and burn rate.
Web — ava-supernova.com/dashboard shows balance, history, and renewal date.

Tip: Local-first: Ava works without an account on the Custom Model card (Ollama, LM Studio, vLLM, or any OpenAI-compatible endpoint). No credits, no platform tokens — your model, your machine, your bill of zero.

The six modes

A mode is the mindset Ava is in — like a colleague swapping hats. Same person, same memory of you, but a builder thinks differently from a teacher. You pick the mode and it changes how she behaves and which helpers she brings. Switch any time with the mode selector, or type a two-character shortcut.

Prefix	Mode	Mindset	Summary
`>>`	Work	Builder — full tool access, executes plans with precision.	Write code, run tests, ship changes. Simple tasks run directly; complex multi-file or architecture work triggers the persona conductor.
`::`	Plan	Architect — evidence-based proposals, no code changes.	Read-only. Researches, designs approaches, challenges assumptions, and presents a plan for your approval before any execution.
`..`	Chat	Friend — personal conversation, off the clock.	Open conversation with memory, search, journal, weather, and news. No personas, no orchestration — just Ava.
`??`	Teach	Tutor — Socratic guidance, adaptive to pace.	Build curriculums, deliver lessons, run quizzes, track progress. Five specialists run the teaching loop.
`!!`	Security	Auditor — systematic vulnerability scanning, paranoia as a virtue.	Five-persona security team: recon, scan, CVE research, verify, report. OWASP and dependency CVE coverage end-to-end.
`**`	Brainstorm	Ideator — grounded ideas, actionable paths.	Mines your context, researches the market, generates specific ideas, challenges them, and refines the survivors into next steps.

Which one do you want?

Start from what you are trying to do — the mode follows from that:

I want something built or changed → Work (>>). You describe it, she builds it, showing you every change first. The full helper team kicks in for bigger jobs.
I want a plan before anything changes → Plan (::). She thinks it through and hands you a plan to approve or edit. She will not touch your files in this mode.
I just want to talk or ask → Chat (..). A thinking partner with memory, news, and weather. No file changes, no code — just conversation.
I want to learn something → Teach (??). A real course built for you: lessons, quizzes, the lot. Free on every plan, forever. Perfect if you are new.
I want my project checked for security holes → Security (!!). A team works through the standard list of common web vulnerabilities (the "OWASP" categories) and known reported flaws (called "CVEs"), then hands you a report sorted by how serious each issue is. This one is aimed at developers — skip it if that is not you.
I want ideas → Brainstorm (**). She generates options grounded in your actual situation, pokes holes in the weak ones, and refines the rest into next steps.

Note: The "helper team" (personas) runs entirely behind the scenes — you never talk to them directly. You just get a more thorough answer. You do not need to know their names.

Tip: You can switch modes mid-conversation. Your memory carries across; the specialists reset to suit the new mode.

Memory — how Ava remembers you

In plain terms: Ava remembers you. Tell her something once — what you like, a decision you made — and she keeps it, so you don't have to repeat yourself in the next chat or next week. That's the headline. The layers below just explain how she keeps that memory tidy over time.

Ava has a persistent memory ("persistent" means it sticks around after you close the app) that survives across conversations, across projects, and across machines if you enable sync (sync copies your memory between your devices). It is not a chat log. It is a structured, searchable understanding of who you are, what you like, and what you have decided.

Five layers

Extract — key facts captured from every message in real time.
Reflect — deeper analysis runs at the end of meaningful sessions. Patterns, themes, contradictions.
Accumulate — corrections and preferences compound over weeks and months.
Analyse — the graph engine links related memories and flags contradictions when new information conflicts with old.
Consolidate — similar memories merge; stale ones prune. Memory gets sharper over time, not noisier.

Note: The five layers above run on their own — you never have to manage them. They're here so you can see the memory genuinely improves over time rather than turning into a pile of clutter.

Recall

When you ask a question, Ava searches memory by meaning ("semantic search" — it understands what you mean, so it finds the right memory even if you phrase it differently), not just by matching exact words. Mode-specific filters run alongside — Chat mode pulls personal context, Work mode pulls project decisions and code patterns.

Note: Memory is scoped per-user and per-project (your memories are yours, kept separate by person and by project). Secrets and credentials (passwords, keys) are blocked at the save boundary — they can never land in memory.

Tasks and journal

Ava keeps three kinds of notes for you, and it's easy to mix them up. The quick way to remember: memory is what she knows about you, tasks are what you still need to do, and the journal is a diary of how things went. Here is when to use which.

Three things that sound similar and are not: memory, tasks, and journal. Here is when to use which.

Memory — persistent facts. Preferences, decisions, patterns. Asked about repeatedly, rarely created manually.
Tasks — things to do. Action items with due dates and status. You tick them off when done.
Journal — reflection. Daily log, dual entries (yours and Ava observations about the session). Useful for mood tracking, session reviews, context for tomorrow.

Ava writes to all three when it makes sense. You can write to any of them directly from the dashboard. Nothing is locked to a single place you use Ava — a task you create in one (say the command-line tool) shows up everywhere else (like the desktop app), automatically.

Permissions — three modes, ten categories

Permissions are your safety dial. They decide when Ava asks your okay before doing something on your computer, versus just getting on with it. You can set her to ask about everything, ask about the risky stuff only, or trust her to run — and you can change it any time. If you're not sure, leave it on the default and you'll be fine.

Ava has 60 tools ("tools" are the actions she can take — reading a file, searching the web, and so on). Every tool belongs to a category (file operations, shell, git, web, media, database, system, documents, memory, learning). Every tool call passes through the permission gate (a checkpoint that decides whether to ask you first) before it runs. You control what gets auto-approved, what asks once, and what always asks.

Category	Strict	Balanced	Autonomous
File operations	First time only	Auto-allow	Auto-allow
Shell	Always ask	Always ask	Auto-allow
Git	Always ask	First time only	Auto-allow
Web	Always ask	First time only	Auto-allow
Media	First time only	Auto-allow	Auto-allow
Database	Always ask	Always ask	Auto-allow
System	Always ask	Always ask	Auto-allow
Documents	First time only	Auto-allow	Auto-allow
Memory	Auto-allow	Auto-allow	Auto-allow
Learning	Auto-allow	Auto-allow	Auto-allow
	Every write and dangerous tool asks first. Maximum control.	Writes auto-allowed, dangerous tools still confirm. The default.	Everything auto-allowed. Plans and user questions still pause.

Balanced is the default. Strict for when you are auditing her behaviour (watching closely to check each step). Autonomous for when you want her to just go.

Warning: Autonomous does not mean unchecked. Plans, clarifying questions, and explicit ask-user prompts still pause for your input. Irreversible operations (git force-push, database drops) always confirm regardless of mode.

Interjection and hard-stop

You are always in charge. While Ava is busy working, you can either nudge her in a new direction without breaking her stride, or stop her cold. Two buttons, two outcomes — that's all this page is about.

Ava runs until her plan completes or you stop her. You have two ways to step in (the word for this is "intervene") while she is running.

Interject — steer without cancelling

Type a new message while Ava is working. She finishes her current tool call, reads your message, and adjusts her plan. No context is lost. Use this when you want to redirect, add information, or change priorities mid-run.

Hard-stop — cancel and clear

Press Escape (CLI), click Stop (extension, IDE), or type a stop command. The current run cancels, pending tool calls drop, and the conversation is ready for a fresh turn.

Tip: Interject first. Hard-stop is for when something is genuinely wrong. Interjecting keeps the thread alive.

Desktop personas

Note: Available in the Ava Supernova IDE only. The VS Code extension and CLI/companion don't ship desktop-automation tools — open the IDE to use this feature.

This page is for the curious and for power users. "Desktop automation" (sometimes called "computer use") means Ava can actually click around your screen and use apps for you — open Gmail, fill a form, send a message — like a person at the keyboard. To do that safely, she splits the job across five small helpers, each with one narrow role. You never deal with them directly; this just shows you who does what.

Every step in a desktop trajectory (one run of Ava driving your screen toward a goal) runs a five-persona wave. Each persona has one job and only one job. The separation keeps prompts tight, outputs structured, and reasoning skeptical by design.

Scout

Reports what is visible on screen right now. Never invents elements. Never plans. Returns a ScreenState JSON with element IDs, names, bboxes, confidence, and a grounding source (UIA, Playwright DOM, or OmniParser vision).

Planner

Decides the single next action. One action per step — never batches. Prefers reversible paths. Classifies risk per the safety ontology. Outputs a ProposedAction JSON with kind, target, params, riskClass, reasoning, and an expectedPostState prediction.

Actor

Executes exactly what Planner proposed — after the approval gate has cleared. No improvisation. No retry. Failure is reported faithfully and Verifier handles the next move.

Verifier

Checks whether the action landed. Compares the fresh ScreenState against Planner's prediction. Returns verified, deviated, or rollback_needed. Also runs loop detection — repeating an action that already succeeded is a deviation, not a verification.

Narrator

The only voice the user hears during a trajectory. One past-tense sentence per step ("Opened Gmail.", "Clicked Compose."). Also writes the structured audit log. Says when something went wrong — never hides a problem.

Desktop safety ontology

Note: Available in the Ava Supernova IDE only — describes the safety model that gates every desktop automation action before Actor runs.

In short: before Ava clicks anything on your screen, she sorts the action by how risky it is. Reading the screen? Harmless. Sending money or deleting something? She always stops and asks you first — every single time, no exceptions. The five labels below are just the rungs on that risk ladder. This is power-user detail, but the takeaway is simple: the dangerous stuff always needs your yes.

Every proposed action is classified into one of five risk classes. The classification combines three signals — what the on-screen target is, a plain-language description of it ("semantic caption" — Ava's own words for what she's looking at), and the action's settings — and escalates up (treats it as riskier) when they disagree. A false positive (asking when it didn't need to) is just a nag. A false negative (not asking when it should have) is a product-ending incident.

The five classes

Observational — read UIA tree, screenshot, hover. Always auto-allowed.
Navigational — scroll, move focus, open a menu. Auto-allowed in Drive mode.
Mutative-reversible — type into a field, paste, open an app, navigate a URL. Confirms in Ask mode; auto in Drive within the whitelist.
Mutative-irreversible — send, submit, pay, delete, confirm, publish. Always confirms, regardless of permission level. Approval never caches.
Privileged — UAC elevation, sudo, registry edits, credential prompts. Forbidden by default. Single-use opt-in required per session.

Never-cached approvals

Every competitor caches approvals (remembers your "yes" and reuses it next time). That is the largest foothold for prompt injection — an attack where a booby-trapped web page sneaks hidden instructions to the AI to make it act against you. If approvals are cached, a compromised page can ride on a "yes" you gave earlier. Ava never caches. Each mutative-irreversible action asks fresh.

Secrets handling

Passwords, API keys, and 2FA codes (the one-time codes a second app or text gives you when you log in) are handled via opaque handles — placeholders that stand in for the real value. Ava sees "{secret}", not the actual password. The real value never enters her memory, her activity log, or what she "reads" — a separate low-level layer types it in at the last moment, at the keypress, so the AI part never touches it.

Desktop grounding hierarchy (Preview)

Warning: Preview — how Scout decides which grounding source to use each step.

Before Ava can click a button, she has to actually understand what's on your screen and where things are — that's "grounding" (figuring out what each thing on screen is and its exact spot). She has three ways to do it and picks the most reliable one that works for the app you're in. This is deep power-user detail; the point is just that she prefers solid, accurate methods over guessing from a picture.

Three tiers, evaluated in order. Cheapest reliable source wins.

1. Browser window → Playwright DOM

If the active window is a web browser, Ava reads the page's own structure (the "DOM" — the underlying list of what's on a page) instead of guessing from the picture. Reading the structure is about 10× more reliable than eyeballing pixels — she knows exactly where each button is and when the page has finished loading. This is the main path for anything on the web, not a backup.

2. Native app with UIA → Windows UI Automation

For ordinary desktop apps on Windows, Ava reads a built-in accessibility map ("UIA", the same data screen-readers use to describe a window) — it lists each control, its name, and where it sits. If that map returns at least five usable, named items, she trusts it and uses it.

3. UIA empty or junk → OmniParser vision

When that map is missing or useless (some apps, games, and remote screens don't provide one), Ava falls back to actually looking at a screenshot. A vision tool (OmniParser) spots the clickable regions and labels what each one is, turning the picture back into a list she can act on — combined with whatever little the accessibility map did give.

Note: OmniParser is opt-in the first time Ava needs it, with a clear privacy prompt. BYOK users can point at their own local OmniParser deployment.

Desktop kill switches (Preview)

Warning: Preview — three independent ways to abort a running trajectory. Any one of them wins.

A "kill switch" is exactly what it sounds like — an emergency way to stop Ava instantly when she's controlling your screen. The reassuring part: there are three of them, each built separately, so if one fails another still works. You don't have to learn all three. Just know that when something looks wrong, you can always stop it, and the strongest stop (the panic kill below) doesn't rely on Ava cooperating.

After reading every public autonomy incident (real cases where AI agents ignored stop commands, deleted email archives, wiped drives), we built three kill switches in three different layers. None of them depend on the AI model itself — if Ava is stuck or unresponsive, you can still stop her.

Pause — space or the Pause button

Freezes on the next step boundary. Resumable. Use this when you want to intervene without losing the trajectory.

Stop — Escape or the Stop button

Clean abort. Narrator summarises what was done so far. Not resumable. Audit log commits.

Panic kill — triple-Escape or Ctrl+Shift+K

The hardest stop there is. A small, separate watchdog program force-quits the part of Ava that is driving your screen. It does not wait for the AI to respond, does not need the app window to be working, and does not care what state things are in — it just kills it. This is the one for when something is genuinely wrong.

All three work from the companion app too. Either driver can end it.

Tool reference

This is the full lookup of every tool Ava has — a "tool" just being a specific action she can take, like reading a file or running a search. You do not need to read this page. Ava picks the right tool herself every time. It is here for when you are curious about exactly what she can do.

The full built-in toolbox. Every tool here is registered in the core runtime (the engine underneath all three surfaces) and available to Ava when the mode and permission mode allow it. Filter by category or risk.

No need to memorise any of this — the table below is just a reference card.

File operations

Tool	Risk	What it does
`analyze_architecture`	safe	Analyse project architecture — dependency graph, circular imports, coupling hotspots, file metrics.
`browse_library`	safe	Browse the project creative asset library — images, videos, audio, documents.
`docs_lookup`	safe	Search Ava documentation to help users with features, setup, and troubleshooting.
`file_edit`	write	Replace an exact string in a file with new content.
`file_read`	safe	Read the contents of a file with line numbers.
`file_write`	write	Create or overwrite a file with the given content.
`find_symbol`	safe	Find where functions, classes, types, and other symbols are defined or referenced.
`glob`	safe	Find files matching a glob pattern.
`grep`	safe	Search file contents using regex patterns.
`list_directory`	safe	List the contents of a directory with file types and sizes.
`project_index`	safe	Scan, refresh, or display the project structure index.
`self_inspect`	safe	Read Ava own source code when the actual implementation needs to be quoted or examined.

Documents

Tool	Risk	What it does
`apply_plan`	write	Apply a batch of file edits atomically with a git checkpoint for safe rollback.
`doc_generate`	write	Generate project documentation (README, API docs, architecture overview) from source code.
`document_manage`	write	Create, read, edit, and export documents (Word, Excel, PDF, CSV, Markdown).
`email_draft`	safe	Draft an email with structured content, tone control, and .docx file output.
`journal_write`	safe	Write to the dual journal — Ava session observations and the user reflection log.
`present_plan`	write	Present a structured plan for the user to review and approve before execution.
`report_generate`	write	Generate a structured .docx report from tasks, journal, memory, and project data.
`switch_mode`	safe	Offer to transition to a different mode when the current mode work is complete.
`task_manage`	safe	Manage the personal task list — list, create, complete, update, delete.
`todo_write`	safe	Create or update a visual task list for tracking progress in the current session.

System

Tool	Risk	What it does
`ask_user`	safe	Ask the user a question and wait for their response.
`audit_dependencies`	safe	Run a security audit on project dependencies and report vulnerabilities.
`curator`	safe	Consult the Curator specialist for a design or taste decision.
`detect_language`	safe	Detect the human language of a text snippet from 20+ supported languages.
`env_write`	write	Write a granted secret to the project env file. Hard-fails if the env file is not in .gitignore.
`get_datetime`	safe	Get the current date, time, and timezone from the host system.
`propose_tool`	write	Propose a new tool to the development team when a capability gap is hit.
`secret_request`	safe	Request access to a secret from the user vault. Returns an opaque handle the host substitutes at execution time.
`support_request`	write	Submit a support ticket to the Ava development team.

Shell

Tool	Risk	What it does
`bash`	dangerous	Execute shell commands with output capture, error handling, and safety sandboxing.
`benchmark`	safe	Run a command and measure execution time, optionally comparing against a stored baseline.
`debug_logs`	safe	Read and filter log files or command output for debugging.
`test_generate`	write	Generate test scaffolding for a source file using the detected test framework.
`test_run`	safe	Detect the project test framework and run tests (full suite or specific file).

Web

Tool	Risk	What it does
`browser`	dangerous	Automate browser interactions — navigate, click, fill, screenshot, extract text, run JS.
`http_request`	dangerous	Make HTTP requests (GET, POST, etc.) with automatic retries and response parsing.
`news`	safe	Fetch curated tech and AI news, filterable by category or keyword.
`release_notes`	safe	Fetch published release notes for Ava Supernova.
`weather`	safe	Get current weather and 3-day forecast. Auto-detects location if none specified.
`web_search`	safe	Search the web for current information.

Database

Tool	Risk	What it does
`database_query`	dangerous	Execute read-only database queries and visualise results in tables.

Media

Tool	Risk	What it does
`generate_image`	write	Generate AI images from a text prompt with quality verification via a vision model.
`generate_music`	write	Generate AI music from a text prompt. Supports instrumental and vocal tracks with lyrics.
`generate_video`	write	Generate a short AI video from a text prompt using MiniMax Hailuo.
`generate_voice`	write	Generate AI voice/speech from text using MiniMax TTS.
`remove_background`	write	Remove the background from an image, making it transparent.

Git

Tool	Risk	What it does
`git_commit`	write	Stage and commit changes with an auto-generated or custom commit message.
`git_create_pr`	write	Create a GitHub pull request from the current branch with an auto-generated title and description.
`git_diff`	safe	Show formatted git diffs with structured modes (staged, unstaged, all, branch).
`git_status`	safe	Run read-only git commands (status, diff, log, branch, show).
`rollback`	dangerous	Restore, discard, or check the status of a git checkpoint.

Learning

Tool	Risk	What it does
`learning_create`	write	Create a structured learning curriculum with modules and lessons for the user.
`learning_progress`	safe	View learning progress, list curriculums, find next lesson, check review schedule, or search learning content.
`learning_teach`	write	Deliver a lesson, write teaching content, give feedback, run a quiz, or trigger a review.

Memory

Tool	Risk	What it does
`memory_delete`	safe	Delete a specific memory entry by ID.
`memory_recall`	safe	Search persistent memory for information from past conversations using semantic queries.
`memory_save`	safe	Save information to persistent memory that survives across conversations.
`memory_update`	safe	Update an existing memory entry by ID.

Risk levels

Safe — reads things without changing anything. file_read, glob (find files by name pattern), grep (search inside files for text), git_status (Git is the system that tracks changes to your project).
Write — changes files, saves a checkpoint of your work (a "commit"), or produces something new. file_edit, git_commit, generate_image.
Dangerous — runs commands on your computer (the "shell"), controls the desktop, goes out to the internet, or queries databases. bash, browser, desktop_*.

Note: Persona specialists have their own tool allowlists. Scout can read but not write. Builder has full access. Challenger has no tools at all — its job is to question the plan.

Models

A "model" is the AI brain doing the thinking — Qwen, DeepSeek, Mistral and others. This page lists every model Ava can use. The short version: you do not have to choose one. Ava picks the right model for each step herself. Read on only if you want to understand or override that.

Every model Ava can route to, plus the three orchestration strategies — the rules that decide which model handles each part of a job. Pick a routing mode and let Ava dispatch, or pick a single model and skip routing entirely.

Routing modes

Three orchestrated strategies. The same persona pipeline (the small team of specialist helpers Ava runs behind the scenes) runs on all three; what changes is the underlying fleet of models. Pick a mode in the model selector and Ava handles the rest. Each card below shows the constituent specialists the conductor routes to.

You do not need to memorise the cards below — they are here so you can see what each strategy is made of.

✦ Maestro — single-conductor

One conductor drives the entire persona pipeline (Scout, Architect, Builder, Verifier). A cheap fast model handles the upstream intent gate so the conductor only spins up when orchestration is actually needed. Default for everyone, live on every plan.

Model	Context	Input / Output per 1M	Capabilities
Qwen 3.6 Plus — conductor + every persona	1M	$0.29 / $1.70	toolsvisionthinkingstreaming
Qwen 3.5 Flash — upstream intent gate / classifier	256K	$0.05 / $0.40	toolsstreaming

✦ Supernova — polyglot ensemble

Best-of-breed routing — the coordinator picks the right specialist for each subtask. Frontier reasoning where it matters, flash-tier cost where it does not.

Model	Context	Input / Output per 1M	Capabilities
DeepSeek V4 Pro — coordinator + planning, chat, long-context, security, brainstorm; Researcher, CVE Researcher, Ideator personas	1M	$1.74 / $3.48	toolsthinkingstreaming
Qwen 3.6 Plus — Builder + coding, image-gen; Architect persona	1M	$0.29 / $1.70	toolsvisionthinkingstreaming
DeepSeek V4 Flash — Teach route; Code Reviewer, Fact Checker, Quiz Master, Recon, Scanner, Curriculum Architect, Tutor, Curator, Explorer, Refiner, Security Verifier/Reporter personas	1M	$0.14 / $0.28	toolsthinkingstreaming
Qwen 3.5 Plus — Content Writer persona (cost-sensitive long-output writing)	1M	$0.20 / $1.20	toolsvisionthinkingstreaming
Qwen 3.5 Flash — intent gate; Scout, Verifier, Sequencer, Challenger, Integrator personas (depth ≤ 2)	256K	$0.05 / $0.40	toolsstreaming
Qwen 3.5 Omni Plus — vision route + Design Reviewer persona (only vision + audio capable model in scope)	256K	$0.26 / $1.56	toolsvisionthinkingstreaming

✦ Aurora — European AI stack

Mistral-only routing. Every call lands on a Mistral model — Aurora deployments never leave European infrastructure. For GDPR-strict deployments, AI Act compliance, sovereignty mandates. Apache-2.0 open weights end-to-end. No cross-routing fallback — that is the EU-stack guarantee.

Model	Context	Input / Output per 1M	Capabilities
Mistral Large 3 — coordinator + planning, chat, long-context, security, brainstorm; Researcher, Challenger, Fact Checker, CVE Researcher, Security Verifier, Ideator personas	262K	$0.50 / $1.50	toolsthinkingstreaming
Mistral Small 4 — Builder + coding, vision, image-gen, teach; intent gate; Architect, Verifier, Sequencer, Curriculum Architect, Content Writer, Quiz Master, Tutor, Recon, Scanner, Reporter, Explorer, Refiner personas	262K	$0.15 / $0.60	toolsvisionthinkingstreaming

Note: Aurora is deliberately Mistral-only. If a Mistral model is unavailable the router returns an error rather than cross-routing — that is the EU-stack guarantee. Pick Maestro or Supernova for graceful degradation.

Tip: All three modes are universally available — on platform credits and on your own keys. BYOK adds the option to bypass orchestration and drive a single model directly.

Platform-managed models

Available on every plan, including the free tier. Tokens (the units of text AI is measured in — roughly three-quarters of a word each) count against your plan allowance. The keys (the private passwords that unlock each paid model) are rotated and monitored by the platform — nothing for you to configure.

Qwen (Alibaba Cloud)

Qwen 3.6 Plus coordinates Auto Mode; 3.5 Flash and 3.5 Omni Flash are the fast-path options. All models available on every plan.

Model	Context	Input / Output per 1M	Capabilities
Qwen 3.6 Plus	1M	$0.29 / $1.70	toolsvisionthinkingstreaming
Qwen 3.5 Omni Plus	256K	$0.26 / $1.56	toolsvisionthinkingstreaming
Qwen 3.5 Omni Flash	256K	$0.07 / $0.26	toolsvisionstreaming
Qwen 3.5 Plus	1M	$0.20 / $1.20	toolsvisionthinkingstreaming
Qwen 3.5 Flash	256K	$0.05 / $0.40	toolsstreaming

MiniMax

Powers Creative Studio — image, video, music, voice.

Model	Context	Input / Output per 1M	Capabilities
MiniMax M2.7	205K	$0.30 / $1.20	toolsthinkingstreaming
MiniMax M2.5	1M	$0.15 / $1.20	toolsthinkingstreaming
MiniMax M2	1M	$0.26 / $1.00	toolsthinkingstreaming

DeepSeek (Supernova orchestration)

Powers Supernova routing mode. V4 Pro coordinates the persona pipeline; V4 Flash handles high-volume builds and review. Both open-weight MIT, 1M context, dual thinking/non-thinking modes.

Model	Context	Input / Output per 1M	Capabilities
DeepSeek V4 Pro	1M	$1.74 / $3.48	toolsthinkingstreaming
DeepSeek V4 Flash	1M	$0.14 / $0.28	toolsthinkingstreaming

Mistral AI (Aurora orchestration)

Powers Aurora routing mode. EU-based, Apache-2.0 open weights, never leaves European infrastructure. Large 3 coordinates; Small 4 handles specialists with vision support.

Model	Context	Input / Output per 1M	Capabilities
Mistral Large 3	262K	$0.50 / $1.50	toolsthinkingstreaming
Mistral Small 4	262K	$0.15 / $0.60	toolsvisionthinkingstreaming

Bring your own key

An "API key" is a private password that lets software use a paid service on your own account. "BYOK" just means "bring your own key" — using yours instead of ours. Paste your API key in settings. BYOK requests go direct from Ava to the provider (the company that runs the model) — they do not pass through our infrastructure and do not consume platform tokens. You pay the provider; we do not see the traffic.

Anthropic

Model	Context	Input / Output per 1M	Capabilities
Claude Opus 4.7	200K	$5.00 / $25.00	toolsvisionstreaming
Claude Opus 4.6	200K	$5.00 / $25.00	toolsvisionstreaming
Claude Sonnet 4.6	200K	$3.00 / $15.00	toolsvisionstreaming
Claude Haiku 4.5	200K	$1.00 / $5.00	toolsvisionstreaming

DeepSeek

V4 launched 2026-04-24 — open-weight MIT, 1M context, dual thinking modes. Legacy V3.2 IDs are auto-routed to V4 Flash and retire 2026-07-24.

Model	Context	Input / Output per 1M	Capabilities
DeepSeek V4 Pro	1M	$1.74 / $3.48	toolsthinkingstreaming
DeepSeek V4 Flash	1M	$0.14 / $0.28	toolsthinkingstreaming

Kimi (Moonshot AI)

K2.6 is SoTA on agentic coding — 58.6 on SWE-Bench Pro (beats Opus 4.6), built for 300-sub-agent orchestration. 256K context.

Model	Context	Input / Output per 1M	Capabilities
Kimi K2.6	256K	$0.95 / $4.00	toolsvisionthinkingstreaming
Kimi K2.5	256K	$0.60 / $3.00	toolsvisionthinkingstreaming

Mistral AI

Same models the Aurora routing mode uses — pick them directly with your own Mistral key, or let Aurora orchestrate. EU infrastructure, Apache-2.0 open weights.

Model	Context	Input / Output per 1M	Capabilities
Mistral Large 3	262K	$0.50 / $1.50	toolsthinkingstreaming
Mistral Small 4	262K	$0.15 / $0.60	toolsvisionthinkingstreaming
Mistral Large	262K	$2.00 / $6.00	toolsvisionstreaming
Codestral	256K	$0.30 / $0.90	toolsstreaming
Devstral 2	262K	$0.40 / $2.00	toolsstreaming

Zhipu AI

GLM-5 reports 77.8% on SWE-Bench.

Model	Context	Input / Output per 1M	Capabilities
GLM-5	200K	$1.00 / $3.20	toolsvisionthinkingstreaming
GLM-4.7 Flash	128K	$0.07 / $0.40	toolsstreaming
GLM-4.5 Flash (Free)	128K	Free	toolsstreaming

Xiaomi

MiMo V2.5 — matches Claude Sonnet 4.6 on agentic multimodal, Gemini 3 Pro on Video-MME. Sustains 1,000+ sequential tool calls. Released 2026-04-22.

Model	Context	Input / Output per 1M	Capabilities
MiMo V2.5-Pro	1M	$1.00 / $3.00	toolsvisionthinkingstreaming
MiMo V2.5	1M	$0.40 / $2.00	toolsvisionstreaming

Custom (Ollama / LM Studio / vLLM / BYOM)

Point Ava at any OpenAI-compatible endpoint — local (Ollama, LM Studio, vLLM on your machine) or remote (private vLLM cluster, self-hosted finetune, OpenRouter, Together). Configure via Settings → Custom Model in the extension or IDE. You supply the base URL + model name; capabilities depend entirely on what you have running.

Model	Context	Input / Output per 1M	Capabilities
Your model	32K	Free	toolsstreaming

Tip: For benchmark scores, capability tags, and side-by-side filtering, see the full Models page at ava-supernova.com/models. Pricing shown above is our best read of each provider rate card at publication — always check the provider website before committing to a workload.

Persona orchestration

A "persona" is a helper role Ava plays behind the scenes for harder jobs — an explorer, a planner, a fact-checker. You never talk to them directly and you do not need to understand this page to use Ava. It is here for when you are curious how she breaks a big job into a small expert team.

24 specialists across 5 mode teams. The Conductor (the helper that coordinates the rest) decides which team runs, in what order, and with what context. Chat mode has no personas — it is Ava herself.

How orchestration runs

You send a message in Plan, Teach, Security, or Brainstorm mode (or a complex Work request).
The Conductor reads the request, picks the mode team, and builds an execution plan — some personas run sequentially, some in parallel waves.
Each persona has its own system prompt, a restricted tool allowlist, and a shared context pool of what previous personas found.
The final persona produces a summary, a plan, or a report, depending on mode.
Ava presents the result to you. You approve, tweak, or redirect.

The full roster below is reference only — no need to memorise who does what. Ava assembles the right team for you.

work team

Scout — Maps the codebase and surfaces current state before anyone plans or builds.
Architect — Designs the approach from Scout findings and produces the implementation blueprint.
Verifier — Fact-checks the plan and confirms assumptions are correct before Builder runs.
Sequencer — Breaks the verified plan into ordered, dependency-aware implementation steps.
Challenger — Questions the plan to prevent over-engineering and surfaces simpler alternatives.
Builder — Executes the verified and sequenced plan to deliver the final implementation.

plan team

Researcher — Gathers evidence about competitors, trends, and user needs before strategy decisions.
Architect — Analyses codebase patterns and proposes strategic approaches with trade-offs.
Challenger — Challenges strategic decisions, guards timing, and prevents scope creep.

teach team

Curriculum Architect — Designs learning paths with dependency ordering and balanced progression.
Content Writer — Writes clear explanations with examples and analogies tailored to the learner.
Fact Checker — Verifies lesson content is accurate and flags outdated information or errors.
Quiz Master — Creates assessments that test understanding rather than memorisation.
Tutor — Delivers lessons with Socratic guidance, checks understanding, adapts to pace.

security team

Recon — Maps the attack surface — entry points, tech stack, data flows.
Scanner — Systematically checks each OWASP category against the identified attack surface.
CVE Researcher — Looks up known CVEs in dependencies and assesses real-world impact.
Verifier — Confirms each finding is real and eliminates false positives with proof.
Reporter — Structures verified findings into a severity-sorted, actionable report.

brainstorm team

Explorer — Mines user context and memory to build a profile for personalised ideation.
Researcher — Researches market gaps, trends, and demand signals to ground ideation.
Ideator — Generates specific, actionable ideas tailored to the person and the market.
Challenger — Stress-tests ideas and cuts weak ones ruthlessly against reality.
Refiner — Sharpens surviving ideas into concrete next steps and validation tests.

Tip: Work mode only invokes the full team for complex tasks. Simple edits run Ava directly — no orchestration overhead.

CLI commands

This page is for the CLI — the version of Ava you type to in a terminal window, rather than clicking buttons. If you use the extension or the desktop app, you can skip it entirely. It is here for people who prefer the keyboard.

The CLI is a REPL — a prompt that waits for you to type, runs what you typed, then waits again. These slash commands (instructions that start with "/") work from the prompt.

/help — list commands and current mode.
/mode <name> — switch mode (work, plan, chat, teach, security, brainstorm).
/model <id> — switch model without leaving the session.
/clear — clear the current conversation. Memory is untouched.
/history — list past conversations in this project.
/memory — open the memory browser.
/tasks — list and manage tasks.
/journal — open today journal entry.
/settings — open the settings menu.
/exit — quit the REPL.

Launch flags

Flags are extra options you add when you start Ava, each beginning with "--", to set how she opens.

--mode <name> — start in a specific mode.
--model <id> — start with a specific model.
--permission <mode> — strict / balanced / autonomous.
--project <path> — set the project root explicitly.

Configuration

This page shows where Ava keeps her settings and files on your computer. You almost never need to touch any of it by hand — the app manages it for you. It is here for when you want to see, back up, or share those files yourself.

Ava reads configuration (her saved settings) from two scopes. Global settings — the ones that apply everywhere — live in a folder called ~/.ava/ (the "~" is shorthand for your home folder). Per-project settings live in a .ava/ folder inside the project itself.

Global — ~/.ava/

settings.json — model preferences, permission mode, personality, language.
memory/ — your global memories (preferences, decisions that apply everywhere).
tasks.json — cross-project task list.
journal/ — daily journal entries, date-organised.

Project — .ava/

instructions.md — project-specific instructions Ava always loads. Durable context ("we use pnpm, not npm", "API keys live in 1Password").
context.md — optional freeform project notes.
memory/ — memories scoped to this project.
datasets/config.json — opt-in dataset capture configuration.

Tip: Add .ava/ to your .gitignore (a list of files Git is told to leave out of the shared project) if you do not want to share memories with your team. Keep instructions.md in the repo (the shared project folder) — that is how Ava learns the project conventions.

Keyboard shortcuts

Keyboard shortcuts are optional speed-ups — every one of them is also a button you can click. Look here only if you want to drive Ava faster from the keyboard.

Shortcuts by surface (each of the three places Ava runs: the VS Code extension, the desktop app, and the terminal). Customise them in each surface settings. No need to memorise the list below.

Action	Windows / macOS	Surface	Description
Open Chat	`Ctrl+Shift+A` / `Cmd+Shift+A`	extension	Open or focus the Ava chat panel.
Focus Input	`Ctrl+Escape` / `Ctrl+Escape`	extension	Jump cursor into the chat input field.
Switch to Work	`Ctrl+Shift+1` / `Cmd+Shift+1`	ide	Activate Work mode (>>).
Switch to Plan	`Ctrl+Shift+2` / `Cmd+Shift+2`	ide	Activate Plan mode (::).
Switch to Chat	`Ctrl+Shift+3` / `Cmd+Shift+3`	ide	Activate Chat mode (..).
Switch to Teach	`Ctrl+Shift+4` / `Cmd+Shift+4`	ide	Activate Teach mode (??).
Switch to Security	`Ctrl+Shift+5` / `Cmd+Shift+5`	ide	Activate Security mode (!!).
Switch to Brainstorm	`Ctrl+Shift+6` / `Cmd+Shift+6`	ide	Activate Brainstorm mode (**).
Cancel Run	`Escape` / `Escape`	cli	Cancel an in-progress agent execution.
Cancel Run (hard)	`Ctrl+C` / `Ctrl+C`	cli	Interrupt an in-progress agent execution.
Interject	`Enter` / `Enter`	cli	Send a mid-run message to the agent without cancelling.

Project context

This explains how Ava reads up on your project before she starts — the files she looks at to understand what you are working on. You do not need to set any of this up; she does it on her own. Read it if you want to know where to put a note so she always sees it.

Ava loads several layers of context (background information she holds in mind) when you open a project. Understanding the layers helps you put information in the right place.

Loaded automatically

.ava/instructions.md — treated as durable, high-priority guidance. Always in context.
.ava/context.md — freeform notes. Loaded, lower priority.
README.md — scanned on first run for project summary.
package.json / pyproject.toml / Cargo.toml — the small files that list a project setup; read to detect the language, framework, and test command.
.gitignore — the list of files to leave out of version control; respected by all file tools (glob, grep, project_index).

Loaded on demand

"On demand" means Ava only fetches these when a task actually needs them, rather than up front.

Project index — a structured map of your files and the named pieces inside them (functions, classes — "symbols") and how they connect ("imports"). Built lazily, meaning only when Ava needs it.
Git history — the record of past changes; pulled when git_status, git_diff, or git_create_pr are called (git is the tool that tracks every change to a project; "PR" is a pull request, a proposed batch of changes).
Memory — entries are matched by meaning ("semantically") and injected one turn at a time, not all at once.

Tip: Put conventions in instructions.md. Put notes in context.md. Let memory capture the rest on its own.

Desktop budget caps

Note: Available in the Ava Supernova IDE only. Budget limits for desktop automation trajectories.

When Ava controls your desktop on your behalf — clicking, typing, opening apps — each run is called a "trajectory". This page explains the safety limits that stop any one run from going on too long or costing too much. The takeaway: she always stops and checks in before overrunning. The numbers are here if you want the detail.

Every trajectory runs under three simultaneous caps (hard limits). The first one hit wins. The trajectory never silently continues past a breach — the Narrator (the helper that talks you through what is happening) pauses and asks what to do.

Three hard caps (default)

Step count — 30 steps
Token budget — 500,000 tokens
Wall-clock — 5 minutes

Task cost bands

Simple (search docs, open a page) — 5–10 steps, ~70–135K tokens, ~$0.04–0.08
Medium (log in, check logs, download output) — 15–25 steps, ~205–340K tokens, ~$0.12–0.21
Complex (triage three GitHub issues) — 25–30 steps, ~340–410K tokens, ~$0.21–0.25

Free tier includes 3M tokens monthly — roughly 15 medium tasks or 21 complex ones if you only use desktop automation. Mixed use (chat + code + a few automations) is comfortable.

Tip: You can override caps per-task at start, but breaches always surface a decision point. Budget is never silently burned.

Desktop session whitelist

Note: Available in the Ava Supernova IDE only. Scoped app/site permissions for desktop trajectories.

A "whitelist" is simply the list of apps and websites you have said Ava is allowed to touch while she controls your desktop. This page explains how you grant that permission and why it resets each time. The short version: she can only act where you let her, and she asks before going anywhere new.

At the start of a desktop session Ava asks where she is allowed to act. Plain English works: "Gmail, Cursor, the Azure portal." She reads that and confirms the match back to you ("Gmail web, Cursor the app, portal.azure.com").

How it enforces

Looking-only actions — taking a screenshot, reading the list of buttons and fields on screen (the "UIA tree", Windows own map of what is in a window) — are allowed everywhere, because you need to see what is on screen to decide whether to add an app to the whitelist. Anything that changes something is blocked outside the list; Ava pauses and asks whether to add it.

Session-scoped, deliberately

The whitelist does NOT persist across sessions. There is no "remember this app" option. Every new trajectory starts with a fresh scope. Inconvenient by design — forgetting to reauthorise is a feature, not a bug.

Mid-trajectory additions

During a trajectory you can add apps from the header: "+ Allow app" opens a small input. Tap Add and the new app is live for the rest of the session.

Knowledge packs

In plain terms: hand Ava a folder of your own documents, and she can read from them to answer your questions. A "knowledge pack" is just that bundle of files — your reference material she is allowed to draw on.

Drop a directory of reference material (specs, RFCs, meeting notes, API docs) into a knowledge pack. Ava can pull the relevant pieces into context when a conversation touches the topic.

Packs are indexed locally. Retrieval is semantic — Ava finds the right document by meaning, not filename. ("Semantic" means she matches on what a document is about, so you do not need to remember the exact words or the file name.) Packs can be shared with a team or kept private.

Creative Studio

Want to make an image, a short video, a song, or a spoken voiceover? Just describe it and Ava makes it for you — no design or audio software needed.

Image, video, music, and voice generation — built into the same workflow as your code. Generate an icon, a product mockup, a demo video, a voiceover for a tutorial. Powered by MiniMax.

What you can make

Images — prompt to PNG/JPG. Background removal one call away.
Video — short AI videos from a text prompt (MiniMax Hailuo).
Music — instrumental or vocal tracks with lyrics.
Voice — text-to-speech with selectable voices.

Creative Studio costs deduct from the same token allowance as coding — one pool, one number. BYOK your own MiniMax key and it is free to your plan.

Tip: Creative Studio is free-tier eligible. Not a paid add-on.

Office Suite

Need a report, an email, or a document written up? Ava can draft it and hand you a real Word, Excel, or PDF file you can open and edit like any other.

Reports, emails, and documents — drafted, formatted, and exported to native formats (.docx is Word, .xlsx is Excel, .pdf is a PDF).

report_generate — a structured .docx pulling from tasks, journal, memory, and project state.
email_draft — tone-controlled email to a .docx file.
document_manage — create, read, edit, export across Word, Excel, PDF, CSV, Markdown.

Useful when Ava is writing on your behalf — status reports, kickoff decks, client updates. The .docx lands in your project where you can polish it.

Daily briefing

In plain terms: a short "here is where you left off" note each morning, so you can pick up without rereading everything.

Morning summary: yesterday journal, open tasks, project state, and any reminders Ava has set. Optional — turn it on in settings.

The briefing uses memory to surface relevant context ("last time you worked on the auth flow, you noted you wanted to revisit the refresh token strategy"). Not a news feed — a focused catch-up.

Workflows

In plain terms: teach Ava a routine you do over and over, give it a name, and she can run the whole thing again on demand. A "workflow" is just that saved routine — a set of steps Ava repeats for you.

Save a sequence of steps you find yourself repeating — review PR → run tests → write release notes → draft the email. A workflow is a reusable plan Ava can replay with different inputs.

Workflows are stored per project in a folder named .ava/workflows/ inside your project. Share them with your team by committing the directory (saving it into your shared project so teammates get it too).

Events and notifications

In plain terms: while Ava works you get a tidy step-by-step list of what she is doing, and a ping when a long job finishes so you can step away.

While Ava is running you see structured updates, not a wall of text. Each tool call, each persona handoff, each streamed delta is a discrete event in the UI — expandable for detail, collapsible for overview.

Background runs emit OS-level notifications when they complete. You can walk away from a long build and Ava will tell you when it is done.

Personality Designer

In plain terms: dial in how Ava talks to you — chatty or brief, formal or casual, more jokes or fewer — so she sounds the way you like.

Shape how Ava talks to you. Tone, energy, verbosity (how much she says), formality, humour. Sliders and presets — terse engineer, warm mentor, dry colleague, or your own blend.

Personality affects phrasing, not competence. She still runs the same tools and follows the same permission rules. Her name is locked to Ava — that is non-negotiable.

Desktop Automation

Note: Available in the Ava Supernova IDE only. The VS Code extension and CLI/companion don't ship desktop-automation tools — driving the OS needs a native shell, which is what the Tauri IDE provides. Open the IDE to use this feature.

In plain terms: Ava can take over your mouse and keyboard to do tasks for you — open an app, click buttons, fill in a form — while you watch and approve anything important.

A new mode for Ava, prefix @@. She observes the desktop, decides what to do, and drives UI automation (controlling the apps on your screen for you) to get it done. Runs on your actual machine with your actual credentials — not a cloud browser. Available on every tier once shipped.

Why another one

Every other desktop-automation product is something you start when you're at your desk. Ava is the one that's still there when your error tracker fires at 2am — IDE keeps running, your phone pairs over a secure channel, you approve irreversibles with a tap.

She also remembers how to navigate your apps across sessions. Every other agent relearns them every time. The memory differentiator compounds with use.

How it works

Each step runs a five-persona wave: Scout observes, Planner picks the next action, Actor executes, Verifier checks, Narrator tells you what happened. The sidecar chooses the cheapest reliable grounding — UIA for native apps, Playwright for browsers, OmniParser vision when nothing else has a useful tree.

See the concepts pages for safety ontology, grounding hierarchy, kill switches, and the session whitelist model.

Common errors

Most errors have a short, specific cause. Here are the common ones and how to fix them. Do not worry if the wording sounds technical — each one below tells you what you are actually seeing and exactly what to do.

Model not responding

What you see: Ava goes quiet and does not reply.

Check the status indicator in the chat header. If it says "connecting", your provider (the AI service Ava is talking to) is reachable but slow — give it thirty seconds. If it says "unavailable", Ava auto-fell-back to the next model in the chain (she automatically switched to a backup model). If nothing at all, verify your API key in Settings and test the connection. (An API key is the private password that lets Ava use a paid AI service.)

Tool call failed

What you see: one of Ava's steps (a "tool call" — her reading a file, running a command, and so on) shows a red error instead of finishing.

Click the failed tool call to see the error. Common causes: she tried to touch a file outside your project folder, which is blocked on purpose for your safety (this guard is called "path traversal" protection — it stops her wandering outside the folder you are working in); a command that needs to run in a different folder; or a file that got moved since Ava last looked at it. Tell her what went wrong — she will retry with corrections.

Memory not recalling

What you see: Ava cannot find something you are sure she should remember.

Memory works by meaning, not exact words (this is what "semantic" means). So "What did I decide about auth?" works better than "find my auth decision". If the memory is definitely there but not surfacing, open the memory browser and search — if it is not there, it was never saved in the first place.

Sign-in loops

What you see: you sign in, but Ava keeps asking you to sign in again — a loop that never lets you through.

The fix is to clear the saved sign-in details and start fresh: in the extension run "Ava: Sign Out" from the command palette, in the IDE use Settings → Account → Sign Out, in the CLI delete the file at ~/.ava/auth.json (a small sign-in file in a folder named .ava in your home directory). Then sign in again.

Token exhausted

What you see: a message that you are out of free usage for the month. ("Tokens" are how AI usage is measured — "exhausted" just means you have used up this month's free allowance.)

Free-tier monthly budget resets on the 1st of each month. Until then, switch to a BYOK provider (use your own API key) to keep working. Your account remains active — only the free platform-managed requests are paused.

Where logs live

A "log" is just a diary your computer keeps of what happened — useful when something breaks and you (or support) need to see the details.

When you need the raw picture — what tool was called, what information it got, what the AI service returned — the logs are plain text files in a folder on your computer at ~/.ava/logs/ (a folder named .ava in your home directory, with a logs folder inside it).

~/.ava/logs/session-<date>.log — per-session transcripts including tool calls, streamed responses, and errors.
~/.ava/logs/extension.log — VS Code extension host log. Activation failures, panel errors.
~/.ava/logs/ide.log — desktop IDE log. Sidecar process messages, Rust panics.
~/.ava/logs/cli.log — CLI REPL log.

In the extension, "Ava: Show Logs" in the command palette opens the relevant log in an editor tab. In the IDE, Settings → Diagnostics has a "Open Logs" button. The CLI can tail its own log with ava logs --follow.

Warning: Logs may contain snippets of the project files Ava read during a session. Review before sharing.

Filing a support request

In plain terms: if something is wrong and the fixes above did not help, send us a message about it. The easiest way is to just ask Ava to do it for you.

If a problem is not covered by common errors, file a support ticket (send the support team a request for help). You can do it inside Ava (no form to fill out) or on the web.

From inside Ava

Ask her to file it. "Ava, can you file a support ticket about this?" She calls support_request, attaches the relevant session context (with credentials redacted), and sends it. Fastest path.

From the web

Visit ava-supernova.com/support. Pick a category (bug, feature, account, security, general) and describe what happened. Response within one working day.

What to include

What you were trying to do.
What actually happened.
Which surface (extension, IDE, CLI, companion) and which version.
Any error message or log excerpt. Redact credentials first.

Reporting security issues

In plain terms: if you have spotted a way Ava could be broken into or misused (a "vulnerability" — a security weakness), please tell us privately first so we can fix it before anyone can take advantage. This page is how.

Found a vulnerability? Thank you. We take every report seriously and work with you on the fix before any public disclosure (before the problem is announced publicly).

How to report

Email security@ava-supernova.com with a description, reproduction steps, and your preferred disclosure window.
Or file a support ticket with category "security" — same triage queue.
We acknowledge within 24 hours.

Our commitments

Coordinated disclosure — we agree a timeline with you before anything is public. Default is 90 days.
Credit — you are credited in the fix notes unless you prefer anonymity.
No legal retaliation — good-faith research is welcome. We will not send lawyers after you.

Note: The security page at ava-supernova.com/security has the full posture — architecture guarantees, redactor, permission gates.