Start free — no credit card

$50 in free credits, plus $20 for every teammate who joins.

Every new project starts with $50 in free credits, and you get $20 more for each collaborator you bring in. On an open-weight model — where the same work costs a fraction of the OLUs — that's enough for thousands of simple tasks before you pay anything.

Starter

$0.20$0.14 per OLU

$50 in free credits, plus $20 for every teammate who joins.

Introductory at-cost pricing — no markup; $0.14/OLU is 1× the token cost.

An OLU is a normalized unit of tokens, across token types & models. read more ↓

Every OLU is all-in at $0.20 — tokens + infra + sandbox hosting.

No minimums, pay as you go — prepaid balance, auto-pauses at zero, per-user quotas.

Enterprise

Custom pricing

Everything in Starter, plus:

Advanced security and fine-grained permissions

Data-access audit trails — see what data was used in which AI thread, by which user

Dedicated VPC or bring your own cloud (BYOC)

Private networking and VPC peering

Single Sign-On (SSO)

Bring your own LLMs

Forward-deployed engineering support

Model → OLU multiplier

Your model choice changes your mileage.

Frontier models like Claude Opus do the most per token but cost the most OLUs; lean, open-weight models (DeepSeek, Kimi K2, GLM) do the same work for a fraction of the OLUs. Switch to different models for different threads — or even mid-thread.

Model

Type

OLU multiplier (vs Opus 4.6)

Input tokens / OLU

Output tokens / OLU

Cached tokens / OLU

Cache creation tokens / OLU

Notes

Open-weight models · served via Fireworks

DeepSeek V4 Flash (Official API)
DeepSeek (official)
Open-weight

0.018x (57x cheaper)

952,400

595,200

59.5M

1.2M

First-party DeepSeek API pricing.
DeepSeek V4 Flash (Fireworks)
Fireworks
Open-weight

0.024x (41x cheaper)

952,400

595,200

6.0M

1.2M

Served via Fireworks. Low-latency/volume workhorse.
Llama 3.3 70B
Fireworks
Open-weight

0.036x (28x cheaper)

493,800

617,300

6.2M

617,300

Meta; served via Fireworks.
DeepSeek V4 Pro (Official API)
DeepSeek (official)
Open-weight

0.053x (19x cheaper)

306,500

191,600

46.0M

383,100

First-party DeepSeek API pricing.
Qwen 3.7 Plus
Fireworks
Open-weight

0.071x (14x cheaper)

333,300

104,200

4.2M

416,700

Alibaba, vision-capable; served via Fireworks.
Mistral Large 3
Fireworks
Open-weight

0.077x (13x cheaper)

333,300

83,300

4.2M

416,700

Mistral AI; served via Fireworks.
Kimi K2.7
Fireworks
Open-weight

0.17x (5.8x cheaper)

140,400

41,700

1.8M

175,400

Moonshot AI 1T-param MoE; served via Fireworks.
GLM-5.2
Fireworks
Open-weight

0.23x (4.3x cheaper)

95,200

37,900

1.2M

119,000

Z.ai (Zhipu), MIT-licensed; served via Fireworks.
DeepSeek V4 Pro (Fireworks)
Fireworks
Open-weight

0.25x (4x cheaper)

76,600

47,900

1.1M

95,800

Served via Fireworks (what PromptQL uses for open-weight).

Proprietary models

Claude Haiku 4.5
Anthropic
Proprietary

0.2x (5x cheaper)

133,300

33,300

1.7M

133,300

Fast tier.
GPT-5
OpenAI
Proprietary

0.3x (3.4x cheaper)

106,700

16,700

1.3M

133,300

GPT-5 base.
GPT-5.1
OpenAI
Proprietary

0.3x (3.4x cheaper)

106,700

16,700

1.3M

133,300

Same rate as GPT-5.
Gemini 3.1 Pro
Google
Proprietary

0.42x (2.4x cheaper)

66,700

13,900

833,300

83,300

Google flagship Pro.
GPT-5.2
OpenAI
Proprietary

0.42x (2.4x cheaper)

76,200

11,900

952,400

95,200

Flagship step-up.
GPT-5.4
OpenAI
Proprietary

0.52x (1.9x cheaper)

53,300

11,100

666,700

66,700

Reasoning/coding/agentic gains.
Claude Sonnet 4.5
Anthropic
Proprietary

0.6x (1.7x cheaper)

44,400

11,100

555,600

44,400

Balanced tier. (Sonnet 4.6 is blocklisted in PromptQL.)
Claude Opus 4.6
Anthropic
Proprietary

1.0x (baseline)

26,700

6,700

333,300

26,700

Anchor (1.0x).
Claude Opus 4.8
Anthropic
Proprietary

1.0x (baseline)

26,700

6,700

333,300

26,700

Current GA Opus; PromptQL Powerful tier. Same rate as 4.6/4.7.
GPT-5.5
OpenAI
Proprietary

1.0x (baseline)

26,700

5,600

333,300

33,300

Latest flagship. >272K-context billed 2x in / 1.5x out.
Claude Fable 5
Anthropic
Proprietary

2x (2x cost)

13,300

3,300

166,700

13,300

New frontier flagship (Jun 2026); most capable, highest cost.
GPT-5 Pro
OpenAI
Proprietary

7.22x (7.2x cost)

8,900

1,400

11,100

11,100

Pro reasoning. No prompt-cache discount.
GPT-5.2 Pro
OpenAI
Proprietary

10.11x (10.1x cost)

6,300

992

7,900

7,900

Pro reasoning. No prompt-cache discount.
GPT-5.4 Pro
OpenAI
Proprietary

13.53x (13.5x cost)

4,400

926

5,600

5,600

Pro reasoning. No prompt-cache discount.
GPT-5.5 Pro
OpenAI
Proprietary

13.53x (13.5x cost)

4,400

926

5,600

5,600

Top GPT pro tier. No prompt-cache discount.

Anchor: Claude Opus 4.6 = 1.0×. Lower multiplier = cheaper, i.e. more tokens of work per OLU (per $0.20). PromptQL serves open-weight models via Fireworks (DeepSeek is shown for both its first-party Official API and the Fireworks-hosted rate PromptQL bills on). PromptQL gives you access to every model available, and the lineup expands over time — pick your model per thread. The GPT *-pro tiers run higher mainly because they have no prompt-cache discount. Token figures are representative; your exact mileage depends on the task.

Frequently asked questions

An OLU (Operational Language Unit) is just a normalized unit of tokens — it rolls up the different token types (e.g. input, output) and different models (e.g. Opus, GPT, GLM) into one consistent unit, so your bill doesn’t change shape every time you switch models. More complex tasks use more OLUs; simpler ones use fewer. You can see exact OLU consumption on every thread (with a per-step breakdown) and in your usage dashboard.
$0.14 per OLU, introductory — that’s our at-cost price (1× the underlying token cost, no markup), for a limited time. The standard rate is $0.20 per OLU (~1.4× token cost). Either way it’s all-in: the price covers the model tokens plus all the infrastructure, sandbox hosting, and orchestration that turn a raw model into a working agent. No separate platform fee, no per-seat license, no hidden infra bill.
Yes — the per-OLU price is the same, but the model you pick changes how many OLUs a task consumes. Open-weight models like DeepSeek, Kimi K2, and GLM do the same work for a fraction of the OLUs — as little as ~1/10th, often 10×–40× cheaper than a frontier model like Claude Opus.
See the model → OLU table above for a representative comparison. You can switch models per thread, and the list keeps growing.
It depends on the complexity of your query and business domain. At the $0.14 intro rate, typical usage:
  • < 2 OLUs for simple data tasks (<$0.28)
  • ~10 OLUs for a complex report (~$1.40)
  • ~40 OLUs for a deep investigation (~$5.60)
You can see OLU consumption in real time as a thread runs, including a per-step breakdown, plus a full summary in your usage dashboard.
Yes — every new account starts with free credits, and no credit card is required to get started. Credits are managed at the project level:
  • $50 when you create your project.
  • $20 each time a collaborator joins.
Credits apply to your first created project and each collaborator’s first joined project. PromptQL is built for teams — its value compounds with collaboration, so explore and evaluate together from day one.
Add a payment method and switch to prepaid billing with optional auto top-up. Work auto-pauses if your balance hits zero until you top up again — no surprise bills. You control your spending limits and top-up thresholds.
Yes. You can set per-user quotas and spending alerts in your billing dashboard. When you reach your limit, queries pause until you raise the cap. Enterprise plans add per-team and per-project budget controls.