StoriesChat Debug

Story

Models

Player model

Grader model

Player and grader should be different models to avoid self-bias.

Settings

Turns

Use LLM player

Auto-evaluate

Player persona

Grader persona

Actions

Stats

Turns

Avg ms

Tokens

Export

Select a story and click Run to begin.

Evaluation

Run a test with "Auto-evaluate" enabled
to see grader results here.

Scores Leaderboard

Loading scores...

Test Cases

Select or create a test case to edit it.

Advanced Options

Agent Behavior

Stop on game end

When enabled, the agent stops as soon as the story reaches an end condition and shows a victory screen. Disable to continue playing after game-over signals.

🧪 STORIESCHAT DEBUG