Can We Trust Ai Benchmarks? An Interdisciplinary Review of Current Issues in Ai Evaluation

AuthID
P-018-HD8
7
Author(s)
Eriksson, M
·
Purificato, E
·
Noroozian, A
·
Chaslot, G
·
Gómez, E
·
Llorca, DF
Tipo de Documento
Article in Press
Year published
2025
Publicado
in CoRR
Volume: abs/2502.06559
Indexing
Publication Identifiers
DBLP: journals/corr/abs-2502-06559
Export Publication Metadata
Info
At this moment we don't have any links to full text documens.