Benchmark suites instead of leaderboards for evaluating AI fairness
{{output}}
Benchmarks and leaderboards are commonly used to track the fairness impacts of artificial intelligence (AI) models. Many critics argue against this practice, since it incentivizes optimizing for metrics in an attempt to build the "most fair" AI model. However,... ...