What does Vortic actually do for an underwriter?

Vortic takes a broker submission — PDF, email, spreadsheet — and returns a fully cited underwriting memo in under 30 seconds. Insured details extracted and reconciled, postcode pulled to flood zone, loss history surfaced, TIV sanity-checked across line items, pricing benchmarked, treaty utilisation projected, compliance red-flags raised. The underwriter reads the memo and clicks bind, decline, refer, or query. Judgment stays with the human. The lookups go to the system.

What underwriting lines does Vortic support?

Today: commercial property and casualty in the UK and US, including E&S coastal, retail commercial, and Lloyd's coverholder books. Specialty lines (cargo, professional indemnity, construction, cyber) are supported with custom agent prompts during onboarding. We do not currently target retail personal lines.

How long does onboarding take?

Two weeks from contract to first live submission for a typical MGA. Week one: connect data sources (broker email inbox, existing book of bound risks, treaty programme). Week two: tune the risk agent against your appetite, adjust the memo template, set role permissions, train the team on Command Center.

How does Vortic handle data we cannot send to third-party LLMs?

Three layers. PII redaction before any model sees the submission — insured names, broker emails, postcodes are masked in the audit log by default. You choose which ground-truth lookups (flood, postcode, FEMA NFIP) join the prompt per-agent. Enterprise customers can route the platform to their own private LLM deployment so no inference traffic leaves the VPC.

How do you ensure the AI does not hallucinate flood zones, loss history, or pricing?

Ground-truth blocks. For factual claims — flood zone, postcode, sanctions, regulatory thresholds — Vortic injects verified data from the source (EA Flood, FEMA NFIP, postcodes.io, your loss-history database) directly into the prompt and instructs the model to cite or refuse. The memo carries explicit citations back to the source. If a lookup fails, the memo says "no data" instead of guessing.

What audit trail does the platform keep for compliance?

Every model call is logged with the prompt version, model used, tokens consumed, latency, and structured output. Every memo cites its sources. Every decision (bind / decline / refer / query) is captured immutably with the underwriter, the rationale, and the agent outputs at that moment. Designed against Lloyd's coverholder oversight and PRA SS3/19 model governance from day one. The full audit table is exportable for regulator submission.

How does Vortic integrate with our existing systems?

Inbound: a webhook endpoint where brokers (or your own broker portal) can POST submissions directly into your queue, authenticated with a bearer key. Outbound: HMAC-signed event webhooks fire on every bind / decline / refer to your policy admin system, treaty platform, or finance ledger. CSV import is available for bringing existing bound books into Vortic for portfolio analysis.

How does the team and permissions model work?

Four roles per organisation: owner, admin, member, viewer. Owners manage billing and ownership; admins manage members and integrations; members run pipelines and bind; viewers see decisions but cannot execute. Email-invite flow with a 14-day token. A user can belong to multiple organisations.

How do credits and pricing work?

Pay-as-you-go. $5 free on signup. $0.10 per credit, $20 minimum top-up, credits never expire. A full underwriting pipeline (parse + eight specialists + memo) costs around 18 credits ≈ $1.80 per submission. Enterprise pricing available with SSO, dedicated CSM, and routing to private LLM deployments.

What happens if a regulator asks why a specific risk was bound?

Open the submission history page. It shows every agent that ran, the inputs they received, the outputs they produced, the prompt version active at that moment, and the underwriter's final decision with timestamp and rationale. The full chain reconstructs in seconds. Built for the moment of regulatory scrutiny, not against it.

How does Vortic compare to ChatGPT or general AI tools?

A general AI tool can read a PDF and summarise it. Vortic runs nine specialist agents in coordinated parallel, grounds each one against verified data sources (EA, FEMA, postcodes.io, your loss-history), produces an underwriter memo with structured citations, sits behind a human bind gate, and logs every call for the regulator. Different problem, different posture.

Does Vortic replace underwriters?

No. Vortic automates extraction, ground-truth lookups, specialist analysis, and memo drafting. Underwriters retain judgment on bind, decline, refer, conditions, and broker conversations. Agents propose; humans dispose; the audit trail captures everything.

Back to all posts

Apr 22, 2026·20 min read·Vortic team

Why we run on free OpenRouter models (and what to watch out for)

Llama 3.3 70B, DeepSeek V3, Qwen 2.5, Gemini 2.0 — the contrarian case for free, and the engineering you need around them.

Executive summary

Vortic defaults to a curated free-tier OpenRouter model roster so we can meter underwriting work per action without pricing every customer into enterprise-seat economics. Free inference is not free operationally: rate limits, flaky JSON, silent model churn, and uneven hallucination rates become your COGS in engineering hours. This article explains the model mix, the five reliability hazards we engineered around, three operational use cases (scenario / features / outcomes / benefits), and how token economics feed buyer-facing credit pricing.

The model roster and why each slot exists

Fast band — Llama 3.3 70B Instruct: triage, routing, lightweight transforms.
Balanced band — DeepSeek V3: most specialist agents (cost/latency sweet spot).
Heavy synthesis — Qwen 2.5 72B Instruct: memo merge where coherence matters most.
Multimodal — Gemini 2.0 Flash: messy schedules, scans, mixed layouts.

Assigning models per agent isolates failures: if extraction quality slips, you tune one subgraph—not the entire stack.

Economic leverage for underwriting SaaS

Approximate token economics taught us an uncomfortable truth: premium endpoints make sense for demos, but per-run margins collapse if every specialist fires an expensive model sequentially. Free tiers shift marginal inference toward zero so long as retry/fallback discipline holds—letting us expose transparent credit metering (full pipeline ≈ 18 credits at $0.10/credit during standard programmes) instead of hiding model subsidies inside opaque seat fees.

Buyers care about predictable unit costs at burst, not our provider religion.

The five operational hazards (and mitigations)

1. Aggressive rate limits. OpenRouter free tiers impose tight request budgets; parallel fan-out can hit HTTP 429 under concurrent desks.

Mitigation: streamWithRetry() with exponential backoff on 429/503; optional paid fallback via OPENROUTER_PAID_FALLBACK_MODEL (for example anthropic/claude-haiku-4.5) so operators never hard-fail mid-bind prep.

2. Prose-before-JSON. Models prepend conversational wrappers breaking naive JSON.parse.

Mitigation: Strip wrappers; regex extract first balanced {...} block; validate schema; degrade to envelope types instead of crashing routes.

3. Model churn. Free model IDs disappear without ceremony.

Mitigation: Central registry (lib/llm/models.ts)—swap IDs per agent in one commit; pin versions in traces for replay.

4. Quality variance. Specialist extraction F-scores move when switching families.

Mitigation: Golden-set regression per agent; promote models only when memo completeness + citation density holds.

5. No native tool-schema enforcement. Unlike hosted tool-use APIs, free routes rely on prompt contracts.

Mitigation: Strict JSON schema prompts + validator gates + partial refunds on failed specialist nodes where billing fairness matters.

Use case 1 — Specialty MGA scaling burst submissions post-marketing campaign

Scenario: Thirty-day inbound spike doubles concurrent pipelines; finance insists marginal AI spend stays bounded.

Key features

Retry-aware streaming; optional paid fallback only on exhausted backoff buckets.

Outcomes

Stable P95 latency versus brittle demos collapsing under parallel fan-out.

Benefits

Growth marketing tolerated without emergency procurement negotiations mid-quarter.

Use case 2 — Carrier innovation sandbox benchmarking Opus vs free roster honestly

Scenario: Architecture committee demands apples-to-apples ROI proof before mandating premium models globally.

Key features

Trace exports comparing hallucination rates, JSON validity, and wall-clock per golden submission.

Outcomes

Evidence-backed decision to allocate premium spend only on synthesis steps genuinely gaining measurable lift.

Benefits

Avoids blanket Opus tax starving smaller delegated-authority partners unintentionally.

Use case 3 — Finance controlling AI run-rate with departmental chargebacks

Scenario: Underwriting, claims analytics, and marketing pilots share one AI gateway—finance fears opaque overrun.

Key features

Per-agent credit ledger translating engineering topology into invoice lines understandable commercially.

Outcomes

Forecast variance narrows; spikes attributable to campaigns or renewal cron—not mysterious usage blobs.

Benefits

Sustainable governance culture financing continued automation waves responsibly.

Pricing narrative tying engineering to buyer maths

Illustrative comparison only—your tokens vary:

Premium-only stacks might approach $0.30–0.50 raw inference per deep pipeline when chained conservatively.
Optimised free-first stacks approach $0 marginal inference when retries succeed— shifting gross margin toward servicing, support, and fabric integrations instead of hiding GPU subsidies.

Customers reward transparent metering aligned with submission throughput—not mystery subsidies collapsing at renewal.

Closing

Pick models like you pick peril datasets: deliberately, per workflow, with receipts. Free routers unlock economics; engineering discipline unlocks trust.

engineeringllmopenrouter

Why we run on free OpenRouter models (and what to watch out for)

Executive summary

The model roster and why each slot exists

Economic leverage for underwriting SaaS

The five operational hazards (and mitigations)

Use case 1 — Specialty MGA scaling burst submissions post-marketing campaign

Use case 2 — Carrier innovation sandbox benchmarking Opus vs free roster honestly

Use case 3 — Finance controlling AI run-rate with departmental chargebacks

Pricing narrative tying engineering to buyer maths

Closing

Best AI underwriting tools compared (2026 buyer guide)

AI in insurance in 2026: practical trends teams actually adopt