Billing & Limits

Billing and Credits

Free trial, request-based credit costs, wallet top-ups, auto top-up, and BYOK billing bypass.

10 minUpdated 2026-02-15
Summary

Free trial, request-based credit costs, wallet top-ups, auto top-up, and BYOK billing bypass.

8 deep-dive sections1 code samples
Quick Start
  1. Set top-up and minimum credit policy.
  2. Enable per-user and per-key rate limits.
  3. Test 429 + retry behavior in staging.
  4. Monitor charged credits consistency in Usage.

Billing principle

Users are billed in credits, not raw provider token costs. One dollar buys 100 credits.

  • Mode-level default charge is fixed per request (reserved upfront)
  • After the request completes, a settlement step reconciles actual execution usage
  • Wallet balance is shown in /credits
  • Paid credits never expire

Free trial

Every new account receives 20 free credits on signup. Free credits never expire — use them at your own pace. Purchase additional credit packs anytime to add more credits to your wallet.

Free-tier access policy

Free-tier accounts can use models marked is_free=true from the open catalog.
Each request still has a minimum 1-credit charge (unless BYOK).
Paid-model access unlocks after your first top-up (or via BYOK for direct provider billing).

Default charges

ModeDefault chargeTypical use
Chat1 creditSingle-model or auto chat
Compare2 creditsParallel model comparison
Blend4 creditsSynthesis workflow
Judge5 creditsContest + judge scoring
Mesh1 creditFailover routing

How settlement works

Credits are reserved before the request starts, then settled after execution:

Reserve → Execute → Settle
1
Reserve
Deduct default credits upfront
2
Execute
Send request to model provider
3
Settle
Compare actual cost to reserved amount
4
Adjust
Charge more or refund the difference

If actual usage exceeds the reserved credits, the difference is charged. If usage is lower, unused credits are refunded. All adjustments appear as separate transactions in your history.

Top-up flow

Minimum top-up is $3. Maximum single top-up is $10,000.

Checkout to wallet crediting
1
Create checkout
POST /api/v1/credits/purchase
2
Pay in Stripe
Customer completes checkout
3
Webhook settle
POST /api/webhooks/stripe
4
Refresh balance
GET /api/v1/credits/balance

Auto top-up

Enable automatic refills so requests never fail due to low balance:

  1. Complete one Stripe checkout to save a payment method
  2. Enable auto top-up in /settings and set your preferred amount
  3. Set a balance threshold — when credits drop below it, a top-up is triggered
  4. Set a monthly spending cap to control costs

Auto top-ups are processed as off-session Stripe PaymentIntents using your saved payment method. Monthly spending is tracked and capped to prevent runaway charges.

BYOK (Bring Your Own Key)

When a BYOK provider key is configured, requests route directly to the provider using your key. BYOK requests skip credit charges entirely — you pay the provider directly. This is useful when customer contracts require provider-direct billing.

Open catalog models

Some models in the catalog are marked is_free=true (provider-side free tier).
On LLMWise billing, requests still use the normal minimum request charge.

Purpose of open catalog models

Provider-free models are best used for:

  1. Prompt and UX prototyping before spending paid credits
  2. Fallback paths for non-critical traffic during provider spikes
  3. A/B checks against paid models so you only pay where quality difference matters

Catalog updates are synced from OpenRouter, so available is_free=true models can change over time.

You can always fetch the current live list from:

GET /api/v1/models

Filter rows where is_free=true.

Docs Assistant

ChatKit-style guided help

Product-scoped assistant for LLMWise docs and API usage. It does not answer unrelated topics.

Sign in to ask implementation questions and get runnable snippets.

Sign in to use assistant
Previous
Blend Strategies & Orchestration Algorithms
Next
Rate Limits and Reliability