Experimental

AI Bias Lab

Pit two LLMs against each other in structured debates, swap sides, and measure which models show systematic bias on controversial topics. Science, not vibes.

How it works

1

Configure

Pick a topic and two models to test

2

Debate

Models argue FOR and AGAINST, then swap sides

3

Judge

A third model blindly scores each performance

4

Analyze

Statistical analysis reveals systematic bias

Configure experiment

16
120
Estimated: 10 total debates, ~80 API calls, ~2 min runtime