Filters and automations in Freeplay work together to help you monitor, curate, and take action on your production data automatically. Filters let you identify specific completions based on any criteria, while automations perform actions on those filtered results without manual intervention.
Filtering Your Data
Filters in Freeplay allow you to search and narrow down your production data based on virtually any field recorded to Freeplay. Filtering is essential for identifying specific completions or patterns in your system, whether you’re investigating edge cases, building datasets, or analyzing system performance. You can create filters directly in the Observability dashboard by selecting criteria such as:- Input and output values
- Evaluation results and scores
- Custom metadata (user IDs, session types, versions)
- Prompt templates/versions
- Metrics (cost, latency, token usage)
Automations
Automations take action on filtered data. Once configured, automations run in the background — continuously monitoring your production traffic and executing actions when completions match your filter criteria. This allows you to build powerful workflows that ensure the right data gets reviewed, tested, and acted upon without manual effort. Freeplay supports four types of automations:Add to Review Queue
Automatically route filtered completions to a review queue for evaluation.
Add to Review Queue
Automatically route filtered completions to a review queue for evaluation.
Rather than manually searching for problematic completions, automations ensure they’re surfaced to your team automatically. Configure which review queue to use, assign team members as reviewers (completions are automatically distributed), and set your sampling frequency, limits, and strategy (random or most recent). This will start adding completions to the review queue in the background for you!Example use case: Automatically add guardrailed responses to a review queue so your team can evaluate why guardrails were triggered and identify patterns in edge cases.

Add to Dataset
Automatically grow your test datasets from production data.
Add to Dataset
Automatically grow your test datasets from production data.
This is particularly useful for building datasets from completions that have been reviewed, validated, or scored highly in production. Select which prompt template and dataset to target, and completions will be added to this dataset. This ensures your test coverage grows organically as your system encounters new scenarios in production.Example use case: Automatically add reviewed completions with high eval scores to a golden dataset for regression testing and prompt optimization.

Run Evaluations
Automatically execute evaluations on filtered completions.
Run Evaluations
Automatically execute evaluations on filtered completions.
This allows you to run specific evals only on relevant subsets of your data, saving costs and focusing evaluation effort on what matters most.Choose which evaluation criteria to run and set your sampling frequency and limits. This is especially useful for running detailed or expensive evals only on completions that meet certain conditions.Example use case: Run detailed evals only on completions that already passed basic checks, or run specialized evals on specific use cases to understand nuanced quality metrics.

Get Notified
Receive alerts when filtered conditions are met.
Get Notified
Receive alerts when filtered conditions are met.
Stay informed about important patterns or issues in your production data without constantly monitoring the dashboard.Configure your notification channel (Slack, email, or other integrations), sampling frequency, and threshold conditions for when notifications should trigger.Example use case: Get notified when a certain number of completions fail a critical evaluation within a time period, or when guardrails are triggered more frequently than expected.

Creating an Automation
To create an automation, start by defining or selecting a filter in the Observability dashboard. The filter determines which completions your automation will act on. Once you’ve applied your filter:- Click the “Add automation” button in the filter interface
- Give your automation a descriptive name that clearly indicates its purpose (e.g., “Add to Guardrail Review” or “Add to Router Golden Dataset”)
- Select the automation type (Review Queue, Dataset, Run Evals, or Notify)
- Configure the specific options for your automation type:
- For review queues: select the queue and assignees
- For datasets: choose the target prompt and dataset
- For evals: select the target prompt and which evaluations to run
- For notifications: configure the channel and thresholds
- Set your sampling frequency (hourly, daily, or weekly)
- Set the limit for maximum completions per sampling period
- Choose your sampling strategy (random or most recent)
- Click “Save”
Managing Automations


