Token Cost Estimates

How can I reduce my costs?

Model selection — switching to a smaller or less expensive model for tasks that do not require the most capable model
Prompt optimization — reducing token counts by writing more concise prompts
Sampling rate — adjusting the sampling rate for online evaluations to run them on a subset of completions rather than all of them
Evaluation frequency — running test evaluations less frequently or on smaller datasets during development

How is the per-token cost calculated?

Freeplay uses publicly available pricing from each LLM provider to estimate the cost per input and output token. These rates are updated regularly to reflect the latest published pricing.

What if I have my own models or custom provider agreements?

If you use self-hosted models, custom deployments, or have negotiated pricing with providers, the default per-token costs may not reflect your actual spend. For providers that support it (such as LiteLLM), you can configure custom per-token costs in Freeplay to get more accurate estimates. For other providers, treat the estimates as a relative benchmark for comparing costs across prompts and models.

Overview

Where to find cost estimates

Account settings —> Usage

Estimated evaluation costs

How costs are calculated

Custom per-token costs

Frequently asked questions

​Overview

​Where to find cost estimates

​Account settings —> Usage

​Estimated evaluation costs

​How costs are calculated

​Custom per-token costs

​Frequently asked questions

Overview

Where to find cost estimates

Account settings —> Usage

Estimated evaluation costs

How costs are calculated

Custom per-token costs

Frequently asked questions