Veček Niki (University of Maribor, Slovenia), Mernik Marjan (University of Maribor, Slovenia), Črepinšek Matej (University of Maribor, Slovenia), Hrnčič Dejan (University of Maribor, Slovenia)
A Comparison between Different Chess Rating Systems for Ranking Evolutionary Algorithms
Annals of Computer Science and Information Systems, 2014, vol. 2, s. 511 - 518, rys., tab., bibliogr. 31 poz.
Algorytmy, Ranking, Eksperyment badawczy
Algorithms, Ranking, Scientific experiment
Chess Rating System for Evolutionary algorithms (CRS4EAs) is a novel method for comparing evolutionary algorithms which evaluates and ranks algorithms regarding the formula from the Glicko-2 chess rating system. It was empirically shown that CRS4EAs can be compared to the standard method for comparing algorithms - null hypothesis significance testing. The following paper examines the applications of chess rating systems beyond Glicko-2. The results of 15 evolutionary algorithms on 20 minimisation problems obtained using the Glicko-2 system were empirically compared to the Elo rating system, Chessmetrics rating system, and German Evaluation Number (DWZ). The results of the experiment showed that Glicko-2 is the most appropriate choice for evaluating and ranking evolutionary algorithms. Whilst other three systems' benefits were mainly the simple formulae, the ratings in Glicko-2 are proven to be more reliable, the detected significant differences are supported by confidence intervals, the inflation or deflation of ratings is easily detected, and the weight of individual results is set dynamically.(original abstract)
