Benchmark suites instead of leaderboards for evaluating AI fairness

Benchmarks and leaderboards are commonly used to track the fairness impacts of artificial intelligence (AI) models. Many critics argue against this practice, since it incentivizes optimizing for metrics in an attempt to build the "most fair" AI model. However,... ...

请注册登录后继续浏览