New FluxLoop Web Console Available Now

Agentic Testing for AI Agents

Run in Claude Code
$ /plugin install fluxloop@fluxloop
agent.ts
1 import { Agent } from '@fluxloop/sdk';
2
3 @fluxloop.agent({
4 name: 'booking-assistant',
5 model: 'gpt-4-turbo'
6 })
01

Story-Driven Casting

Don't just pick random personas. FluxLoop generates test stories (edge cases, failure modes) and casts the perfect personas to act them out.

No more random testing. Strategic casting for every scenario.

Book a flight to Tokyo
Verbose I would strictly like to book a flight...
Edge Flight to NRT, pay with crypto?
Persona As a senior citizen, I need...
02

Automated Simulation

Run hundreds of simulations in parallel.

Watch how your agent handles frustrated users, technical errors, and complex requests. Simulate realistic user behaviors at scale.

No CSV files No manual setup
Performance Regression
Duration: 5.2s (Threshold: 2s)
Recommended Action
Add index to `flights` table
03

Quantitative Evaluation

Get alignment scores, success rates, and actionable insights.

✗ Response time exceeded 5s
→ Optimize database query in booking flow

Move beyond "vibes" to data-driven agent quality.

04

Seamless Integration

Works with Claude Code, CLI, and Web.

Test locally or in the cloud without changing your stack.

Get Started with FluxLoop.

From installation to first insights in under 5 minutes.
Simple integration. Instant simulation. Everything in your workflow.

Install Plugin

Run /plugin install in Claude Code.
One-click setup. Ready in seconds.

Project Folder /src/agents
API Key
Python Env Scanning...
Fluxloop CLI Checking...
Fluxloop MCP Checking...

Project Setup

Auto-detects project folder, API key, and CLI.
The Plugin handles the configuration for you.

Playground

Generate test inputs from one sentence.
Run experiments via Plugin or CLI.

Generate New Inputs...
INPUT
verbose
error_prone
typo
GENERATED INPUTS
3/3
"flight time?"
"check flight status"
"when does flight UA123 land"
report.html
0
Success
0
Marginal
0
Failed
0
Review
AssertionError: PII Leak Detected
Agent output included plain-text credit card pattern in response body.
Critical Auto-blocked

Insights

Get comprehensive evaluation reports.
See what broke, why it failed, and how to fix it.

Beyond Scores with FluxLoop

Sample Report Desktop only

Join the waitlist!

Test AI agents in your browser. No setup, no code.
Connect Git and start evaluating in minutes.

fluxloop.web
Coming Soon
FluxLoop in Your Browser
Searching your agent...
agent.py
tool.py
memory.json
Wanna start with this?
Persona Developer
Input "Book a flight..."
Evaluation Results PASS
98%
Success
2%
Fail
Book flight to Tokyo
Cancel reservation