go to ORKG: http://orkg.org/orkg/predicate/P154044

evaluation benchmark

Benchmark dataset used to evaluate a system. It can be an external or self-constructed one.