Billing

ModelSwitch uses pay-as-you-go billing. You are charged only for the tokens you use — there are no monthly minimums, no seat fees, and no subscription required to access the API.

How costs are calculated

Every request consumes prompt tokens (your input) and completion tokens (the model’s output). These are billed separately at the rates shown for each model.

Cost = (prompt_tokens / 1,000,000) × prompt_price_per_1M
     + (completion_tokens / 1,000,000) × completion_price_per_1M

Example: a GPT-4o request with 500 prompt tokens and 200 completion tokens:

	Tokens	Price per 1M	Cost
Prompt	500	$2.50	$0.00125
Completion	200	$10.00	$0.00200
Total			$0.00325

Use GET /v1/models to fetch live prices for every model. Prices may vary as providers adjust their rates.

Markup

ModelSwitch adds a small markup over provider prices to cover infrastructure and support. The prices shown via GET /v1/models already include this markup — there are no hidden fees.

Balance

Your balance must be positive to make requests. When your balance reaches zero, requests return an error until you top up.

Build low-balance monitoring into your application or enable email notifications in your dashboard settings. A depleted balance will cause requests to fail immediately.

Your balance is stored in USD internally. It may be displayed in your local currency in the dashboard.

How billing works under the hood

ModelSwitch uses pessimistic holds to prevent overdraft, particularly under heavy concurrent usage.

Hold placed

When your request starts, an estimated cost is reserved (held) from your balance. Your available balance is reduced immediately.

Request forwarded

The request is forwarded to the upstream provider. If the request fails before completing, the hold is fully released and your balance is restored.

Hold settled

When the response completes, the hold is adjusted to match the actual token usage. If the real cost is less than the estimate, the difference is returned to your balance.

This means you will never spend more than your balance, even if multiple requests are running at the same time.

Managing your balance

Top up

Add funds to your balance via card payment

Auto top-up

Set up recurring top-ups so you never run out

Documents

Download invoices, acts, and UPD documents

API Keys

Switch models

⌘I

How costs are calculated
Markup
Balance
How billing works under the hood
Managing your balance

Get Started

Core Concepts

Guides

Billing & Payments

Dashboard

How costs are calculated

Markup

Balance

How billing works under the hood

Managing your balance

Top up

Auto top-up

Documents

Get Started

Core Concepts

Guides

Billing & Payments

Dashboard

​How costs are calculated

​Markup

​Balance

​How billing works under the hood

​Managing your balance

Top up

Auto top-up

Documents

How costs are calculated

Markup

Balance

How billing works under the hood

Managing your balance