Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol

Mar 1, 2025·
Roham Koohestani
Roham Koohestani
,
Philippe De Bekker
,
Maliheh Izadi
· 1 min read
PDF
Type
Publication
arXiv preprint arXiv:2503.05860

Review and tooling for elevating benchmark quality in AI4SE; introduces BenchScout and an enhancement protocol.