Create a battle, hide the model names, collect votes, and let Elo ratings update from real pairwise preferences.