NeuroRouter — Context Operating System for Agents

NeuroRouter sits between Claude Code, Codex, and the model API. It keeps the live model window focused on the work that still matters: source transcript to semantic field to target-model context.

It is the answer to a very specific failure mode: one long session becomes stale, expensive, and fragile, but the operator still needs continuity. NeuroRouter preserves the active work field, repairs safe tool-chain breaks before they become upstream 400s, and nudges the agent away from dead loops before the session burns more money than progress.

When teams also use Hiveram, NeuroRouter does not become the shared truth. Hiveram stores the work graph and portable bundles. NeuroRouter decides what slice of that graph enters the live model window right now.

More context is not control. NeuroRouter projects the slice of work the model is allowed to act on now.

The four promises

Architect, recall, rocket

This is the operator workflow the product is growing toward: architect the work once, rehydrate a focused execution session with the right briefing, and launch a rocket-sized task on the cheapest capable surface.

Get The Free Version

The community edition is published at obstalabs/neurorouter. Install it with Homebrew, Scoop, or download the release binaries directly.

Homebrew

macOS or Linux via the public Obsta Labs tap.

brew tap obstalabs/tap
brew install obstalabs/tap/neurorouter

Scoop

Windows via the public Obsta Labs bucket.

scoop bucket add obstalabs https://github.com/obstalabs/scoop-bucket
scoop install obstalabs/neurorouter

Direct Binary

Download tarballs and zip archives from the release page.

https://github.com/obstalabs/neurorouter/releases/latest

Four compiler passes

One product, one job: preserve what the next model call needs, remove what no longer carries work, and prove the result is still trustworthy.

Proof of continuity

The neurorouter integrity command summarizes whether sessions stayed healthy, degraded, or critical using support-safe session evidence. It reports size, RCS, anchor preservation, prevented failures, and integrity downgrades so compiler claims can be checked against real sessions instead of trusted as a demo metric.

How it works

Verified with Claude Code and Codex CLI. Other OpenAI-compatible tools require compatibility testing before support is claimed; generic chat-completions clients such as Qwen Code are not currently advertised as supported. Provider credentials pass through to configured upstream APIs; NeuroRouter does not phone home with request content or store provider keys on disk.

Proof and mechanics

The request log shows what was sent, what was compiled away, and whether decisions, constraints, and rejected approaches survived:

RCS is a request-level continuity score. It is useful, but it is not the whole health model: Session Integrity can invalidate a green RCS when the active objective is stale, the workspace lock conflicts, recovery had to disable major shaping stages, or loop/progress signals say the agent is stuck. Smaller context is not automatically better. Correct context is.

Vector Lock keeps the active constraint set: objective, chosen approach, current state, hard constraints, unresolved blockers, and rejected approaches. Workspace Identity Lock keeps the allowed repo, path, and release target explicit. These are not chat memory, RAG, or learning. They are the minimal local state that lets NeuroRouter compile the next request without losing the work.

What NeuroRouter is not

Security

NeuroRouter runs locally and forwards provider credentials only to the upstream provider you configure. It does not phone home with request content or store provider keys on disk. Detected credentials are redacted or blocked before forwarding according to policy.

This is a structural difference from cloud LLM proxies. A 2026 study found 26 LLM proxy services collecting user credentials. The LiteLLM supply-chain breach (March 2026) compromised thousands of organizations. A local proxy removes the hosted credential database from this path; it is not a promise to catch every encoded, chunked, or transformed secret.

When it pays for itself

NeuroRouter Pro is most useful when AI coding sessions get long, expensive, or fragile. It keeps useful context alive, removes stale transcript drag, protects detected secrets, repairs safe tool-chain continuity breaks before provider 400 errors, and warns when a session is no longer making trustworthy progress.

It is not a replacement for the model, a hosted gateway, memory, RAG, or autonomous agent brain. It is the live-window layer in a wider stack that can also include Hiveram for shared truth and portable handoff.

Pricing

Free

$0 — AGPL v3, self-hosted

Context hygiene. Deterministic shaping removes stale reads, repeated reminders, and detected secrets — locally, zero setup. The foundation.

Detected secret blocking, context shaping metrics, and local audit output
One local process per live session — Claude or Codex, not both
Use freely under AGPL terms

Pro

$29 / month

Deterministic context engineering. Vector Lock, Session Integrity, anchor preservation, graduated nudges, and local recovery. Every transformation is pattern-matched — no LLM calls, no network dependency, no non-determinism. The proxy is a mirror, not an oracle.

Everything in Free
Vector Lock for active decisions, constraints, blockers, and rejected approaches
Session Integrity and Workspace Identity Lock for stale-objective, wrong-repo, loop, and recovery downgrades
Support-safe continuity evidence with RCS and anchor preservation
Claude Code and Codex in one local process; other tool surfaces require explicit compatibility validation
Continuity repair or local blocking for malformed tool chains, oversized requests, and retry spirals
Cache-aware shaping — shapes past the provider cache boundary, not through it
No AGPL obligations

Team

$49 / seat / month

LLM-augmented context intelligence. Everything in Pro plus: a small model (Haiku/Mini) runs parallel to extract objectives, constraints, and approaches that pattern matching cannot reach. Shared policy, session evidence analysis, and org-wide enforcement.

Everything in Pro (deterministic layer stays unchanged)
LLM-powered vector state extraction — understands intent, not just patterns
Shared routing rules and context shaping policy
Session evidence analysis — mine sessions for cost, security, and workflow patterns
Aggregate insights (patterns, not people)
LLM calls are flag-gated, transparent, and cost-capped ($0.05/session)

Enterprise

Custom pricing

Control AI usage at scale without losing speed. Org-wide policies, secure routing, and protection against data leaks, runaway cost, and workflow breakdowns.

Everything in Team
Path obfuscation — usernames, internal URLs, and org paths are obfuscated before forwarding when enabled
Compliance pipeline — export to Splunk, Datadog, Elastic
Hosted gateway option — zero ops
Up to 1M context window support
Dedicated support

Install Pro or Team

After checkout (or starting the 14-day trial), install the Pro build with Homebrew and activate your license key. Pro replaces the free neurorouter binary with the same command name — you do not run both at once.

1. Install

macOS or Linux via the public Obsta Labs tap.

brew tap obstalabs/tap
brew install obstalabs/tap/neurorouter-pro

2. Activate

Use the license key emailed at checkout or shown on the trial start page.

nr activate <your-license-key>

3. Launch

Start the local proxy and prepare your coding agent in one command.

nr launch claude
nr launch codex

Direct download: latest Pro release. Team and Enterprise tiers use the same binary — license keys carry the seat entitlements. Lost your key? Email hello@obstalabs.dev.