Agentic Testing for AI Agents

Run in Claude Code

$ /plugin install fluxloop@fluxloop

agent.ts

1 import { Agent } from '@fluxloop/sdk';

3 @fluxloop.agent({

4 name: 'booking-assistant',

5 model: 'gpt-4-turbo'

6 })

Story-Driven Casting

Don't just pick random personas. FluxLoop generates test stories (edge cases, failure modes) and casts the perfect personas to act them out.

No more random testing. Strategic casting for every scenario.

Book a flight to Tokyo

Verbose I would strictly like to book a flight...

Edge Flight to NRT, pay with crypto?

Persona As a senior citizen, I need...

Automated Simulation

Run hundreds of simulations in parallel.

Watch how your agent handles frustrated users, technical errors, and complex requests. Simulate realistic user behaviors at scale.

No CSV files No manual setup

Performance Regression

Duration: 5.2s (Threshold: 2s)

Recommended Action

Add index to `flights` table

Quantitative Evaluation

Get alignment scores, success rates, and actionable insights.

✗ Response time exceeded 5s

→ Optimize database query in booking flow

Move beyond "vibes" to data-driven agent quality.

Seamless Integration

Works with Claude Code, CLI, and Web.

Test locally or in the cloud without changing your stack.

agent.ts

1 import { Agent } from '@fluxloop/sdk' ;

3 @fluxloop.agent ({

4 name: 'booking-assistant' ,

5 model: 'gpt-4-turbo'

6 })

7 export class BookingAgent {

Book a flight to Tokyo

Verbose I would strictly like to book a flight immediately...

Edge Flight to NRT, pay with crypto?

Persona As a senior citizen, I need assistance...

Performance Regression

Duration: 5.2s (Threshold: 2s)

Recommended Action

Add index to `flights` table on `destination_id`

Get Started with FluxLoop.

From installation to first insights in under 5 minutes.
Simple integration. Instant simulation. Everything in your workflow.

FLUXLOOP PLUGIN

AVAILABLE

Install Plugin

Run /plugin install in Claude Code.
One-click setup. Ready in seconds.

Project Folder /src/agents

API Key

Python Env Scanning...

Fluxloop CLI Checking...

Fluxloop MCP Checking...

Project Setup

Auto-detects project folder, API key, and CLI.
The Plugin handles the configuration for you.

Playground

Generate test inputs from one sentence.
Run experiments via Plugin or CLI.

Generate New Inputs...

INPUT

verbose

error_prone

typo

GENERATED INPUTS

3/3

"flight time?"

"check flight status"

"when does flight UA123 land"

report.html

Success

Marginal

Failed

Review

AssertionError: PII Leak Detected

Agent output included plain-text credit card pattern in response body.

Critical Auto-blocked

Insights

Get comprehensive evaluation reports.
See what broke, why it failed, and how to fix it.

Beyond Scores with FluxLoop

Sample Report Desktop only

Sample Report

Join the waitlist!

Test AI agents in your browser. No setup, no code.
Connect Git and start evaluating in minutes.

fluxloop.web

Coming Soon

FluxLoop in Your Browser

Searching your agent...

agent.py

tool.py

memory.json

Wanna start with this?

Persona Developer

Input "Book a flight..."

Evaluation Results PASS

98%

Success

Fail

Book flight to Tokyo

Cancel reservation

Name

Position

How did you hear about us?

What AI agent would you like to test?