what becomes possible

When the agent is built this way

Other AI tools answer questions. Ava holds the thread, dispatches specialists, switches modes mid-conversation, runs on your own model, and remembers what you decided three weeks ago. Seven capability classes nobody else can match — shown as the actual conversations they make possible.

Continuity

She remembers. So you don't have to.

Five layers of memory hold the thread across sessions, devices, and weeks. Pick up exactly where you left off — even if you forgot. No other agent does this; most reset on every conversation.

Coming back after three weeks

you › right where were we

pulling project memory: ava-supernova-platform

last session 2026-04-12 — wired Stripe webhooks for checkout.completed

left a TODO: still need to verify subscription downgrades

want me to pick up the downgrade flow, or change direction?

A decision from last month resurfaces

you › should I switch to prisma here

memory_recall: 2026-04-08

you decided on Drizzle over Prisma — reason: zero-overhead types, no codegen step

sticking with that, the type situation has not changed

if you want to revisit the decision say so

Persona orchestration

24 specialists. One conversation.

Every Work-mode task spawns a team. Scout finds the files, Architect designs the approach, Verifier checks the tests exist, Sequencer orders the steps, Challenger questions the plan, Builder writes the code. You see the choreography happen in the stream.

Build a REST endpoint

you › add a /v1/refunds endpoint

Scout — 4 relevant files: routes/, models/, tests/

Architect — POST /v1/refunds, idempotency-key required, mirror /v1/charges

Challenger — what about partial refunds and currency conversion edge cases?

Sequencer — model → migration → route → tests → docs

Builder — writing src/routes/refunds.ts (this is the streaming you watch live)

Security audit

you › !! audit this project

Recon — mapping attack surface across 47 routes

Scanner — running OWASP top-10 + dependency CVE checks

CVE Researcher — flagged 2 issues with exploit context

Verifier — confirmed reproducible in staging

Reporter — generated findings.docx, 11 pages, exec summary on page 1

Cross-mode handoffs

Plan it, build it, audit it, teach it. One thread.

Type a prefix to switch how Ava thinks. Plan researches read-only, Work builds, Security audits, Teach explains. Same conversation, same memory, different mindset on demand. The conductor weaves them.

A whole feature, end to end

you › :: design an OAuth flow for our platform

[Plan mode] Researcher gathering OAuth 2.1 spec, our existing auth surface

Architect proposing PKCE + refresh token rotation

you › >> ok build it

[Work mode] Builder writing /v1/oauth/authorize + callback

you › !! audit what you just built

[Security mode] CVE Researcher — token endpoint missing rate limit

you › ?? explain the PKCE choice for the team

[Teach mode] Tutor — let me walk through why PKCE matters here...

Bug → research → fix in one breath

you › tests are failing on the date logic

[Work] Scout — 3 failing tests in lib/dates.test.ts

you › :: research the issue first

[Plan] Researcher — confirmed Intl.DateTimeFormat behaves differently on Node 22

you › >> apply the fix

[Work] Builder writing the polyfill, all tests green

Three orchestrated routing fleets

You pick the strategy. Ava picks the model.

Maestro tier-routes Qwen for production-tuned cost efficiency. Supernova ensembles DeepSeek + Qwen + flash-tier specialists. Aurora keeps everything in EU sovereign Mistral. Same agent, same conversation — different fleet underneath.

Maestro on a coding session

you › fix the failing tests

[Maestro] coordinator: Qwen 3.7 Plus

Builder spawn: Qwen 3.7 Plus (Terminal-Bench leader)

Verifier: Qwen 3.5 Flash (cheap classifier — depth ≤ 2)

cost: 4 credits this turn

Aurora for an EU-sovereign deployment

you › design the migration path

[Aurora] coordinator: Mistral Large 3 (EU-only routing locked)

Researcher: Mistral Large 3 (long-context synthesis)

Builder: Mistral Medium 3.5 (77.6% SWE-Bench Verified)

no cross-routing fallback — zero non-EU model in the loop

Sovereign + local

Run her fully offline. On your own model. Free. Forever.

Custom Model card → point at Ollama, LM Studio, vLLM, or any OpenAI-compatible endpoint. Full agent — 60 tools, 24 personas, 5-layer memory — with $0 ongoing cost. The floor is free, not a free trial. No account, no telemetry, no cloud.

Local Llama on a 5-year-old laptop

you › review my pull request

provider: generic (http://localhost:11434/v1)

model: llama-3.3-70b (your local Ollama)

Scout, Architect, Builder all running on local weights

no API calls. no telemetry. nothing leaves the machine.

Air-gapped enterprise environment

you › audit this codebase for vulnerabilities

security mode active — 5 personas spawning

inference: on-prem vLLM cluster, qwen3-coder-32b

memory + journal + tasks: local disk only

output: findings.docx, never leaves your network

One memory, every surface

Capture on the train. Build at the desk. Walk it through on the phone.

IDE, VS Code extension, CLI, and Companion all share the same agent and the same memory. The thought you voice-noted on the train this morning is in the project context when you sit down to code at 9.

Voice on the move → context at the desk

you › [Companion 8:14am, voice] remember to refactor the auth middleware, it has the secret-leak bug we discussed last week

memory_save: project/auth-middleware-refactor

cross-referenced 2026-04-22 conversation about secret_request handles

[IDE 9:32am] task surfaced in daily briefing — top of list

Pick up an IDE conversation in the extension

you › [IDE → extension] same conversation, different surface

conversation history: local-only, encrypted at rest

sync via your account, end-to-end

persona team mid-conversation? continues from where you left off

no "let me catch you up" — it never lost context

Free for everyone

Every feature. Every model. Every user. Free.

Plans only scale credits and rate limits. They never gate features. Pro doesn't unlock anything Free can't do. Local stack costs $0 forever. Aurora EU sovereign mode is on every plan. Creative Studio is on every plan. The 24 personas are on every plan. Everything works for everyone.

What Free actually gets

all 6 modes — Work / Plan / Chat / Teach / Security / Brainstorm

all 60 tools, all 24 personas, all 3 routing fleets

Creative Studio — image, music, voice, video

300 credits/month + BYOK + local stack

no card. no account required for BYOK or local. no upsell.

And the deeper move

every conversation (opt-in) feeds the open dataset

the data closed labs collected for themselves —

we capture it for the next generation of open AI to train on

free now. open forever. that is the moat.

This is what becomes possible.

Free for everyone with a machine. No card. No account. No catch.

Install for VS Code Try the Companion