Billing overview

ModelSwitch uses a prepaid, pay-as-you-go model. You add funds to your balance, and the cost of each request is deducted automatically. There are no monthly fees, no minimum commitments, and no charges for idle time. Prices are listed in USD. Top-ups are made in Russian rubles (RUB).

How a request is charged

Hold placed

When you send a request, ModelSwitch places a hold on your balance for the estimated cost. This prevents your balance from going negative under concurrent requests.

Request forwarded

The request is forwarded to the model and processed.

Cost settled

Once the provider returns the actual token counts, the hold is adjusted to match the real cost.

Failure refund

If the request fails at any point, the full hold is released and your balance is restored.

Cost formula

The cost of a request is calculated separately for prompt (input) and completion (output) tokens:

Cost = (prompt_tokens / 1,000,000) × prompt_price_per_1M
     + (completion_tokens / 1,000,000) × completion_price_per_1M

Example: GPT-4o

GPT-4o is priced at

2.50 per 1M input tokens and

10.00 per 1M output tokens. For a request with 500 prompt tokens and 200 completion tokens:

Component	Calculation	Cost
Prompt	(500 / 1,000,000) × $2.50	$0.00125
Completion	(200 / 1,000,000) × $10.00	$0.00200
Total		$0.00325

Markup

ModelSwitch adds a small markup over provider prices to cover infrastructure costs: servers, network, monitoring, and platform development. The markup is a fixed percentage already included in the prices shown in the model catalog.

No hidden fees, surcharges, or inactivity charges.
Live prices are always available via GET /v1/models.

When your balance runs out

If your balance reaches zero, API requests return 403 Forbidden. Top up your balance to resume.

Requests in flight when your balance is exhausted may also be rejected if the hold cannot be placed. Keep a buffer above zero for uninterrupted usage.

Next steps

Top up balance

Add funds by card or invoice

Auto top-up

Set up recurring top-ups

Documents

Invoices and accounting docs for legal entities

Manage keys

Top up balance

⌘I

How a request is charged
Cost formula
Example: GPT-4o
Markup
When your balance runs out
Next steps

Get Started

Core Concepts

Guides

Billing & Payments

Dashboard

How a request is charged

Cost formula

Example: GPT-4o

Markup

When your balance runs out

Next steps

Top up balance

Auto top-up

Documents

Get Started

Core Concepts

Guides

Billing & Payments

Dashboard

​How a request is charged

​Cost formula

​Example: GPT-4o

​Markup

​When your balance runs out

​Next steps

Top up balance

Auto top-up

Documents

How a request is charged

Cost formula

Example: GPT-4o

Markup

When your balance runs out

Next steps