Experimental
AI Bias Lab
Pit two LLMs against each other in structured debates, swap sides, and measure which models show systematic bias on controversial topics. Science, not vibes.
How it works
1
Configure
Pick a topic and two models to test
2
Debate
Models argue FOR and AGAINST, then swap sides
3
Judge
A third model blindly scores each performance
4
Analyze
Statistical analysis reveals systematic bias
Configure experiment
16
120
Estimated: 10 total debates, ~80 API calls, ~2 min runtime