Thursday, September 27, 2007

Computers choose: who was the strongest player?

Hereby I announce a new series of related posts on ratings, and computor or heuristical evaluation of GM or club performance as the case may be. I will be reproducing entire articles, with appropriate attributions in all cases, with the simple goal of sharing a series of articles from different sources.

My goal is simply to share things some of you may not have seen and hopefully, and in placing each in close proximity, perhaps shed new light on previous discussions--either as fresh, established, or older. I sincerely hope that you enjoy them, and I can tell you, that text like below does take real effort and time to copy faithfully. These publications will be rapid. So if you can come by as much as three or more times per week cannot be a total loss. :) Their sequence will be cumulative. Without further ado, quoted from

[Appologies for imperfection. Many of the graphs reproduce well, but not all of them. Please see original link immediately below for detailed view.]

Computers choose: who was the strongest player?
30.10.2006 – Who is the best chess player of all time? Entire books have been devoted to the subject, but all have one major flaw: they are mainly subjective. Necessarily so, since there is no direct way of comparing Morphy to Fischer, Lasker to Kasparov. Or is there? Two scientists from Slovenia try it with computers and statistics. The results might surprise you.

Who was The Strongest Chess Player of All Time?
Computer Analysis of World Chess Champions
By Matej Guid and Ivan Bratko
University of Ljubljana, Faculty of Computer and Information Science, Artificial Intelligence Laboratory, Ljubljana, Slovenia. This article is based on a paper by the same authors published in the ICGA Journal; full reference is given below.

Who is the best chess player of all time? Chess players are often interested in this question to which there is no well founded, objective answer, because it requires a comparison between chess players of different eras who never met across the board. With the emergence of high-quality chess programs a possibility of such an objective comparison arises. However, so far computers were mostly used as a tool for statistical analysis of the players' results.

Our approach was different: we were interested in the chess players' quality of play regardless of the game score, which we evaluated with the help of computer analyses of individual moves made by each player. A method to assess the difficulty of positions was designed, in order to take into account the differences in players' styles and to compensate for the fact that calm positional players in their typical games have less chance to commit gross tactical errors than aggressive tactical players. We also give a carefully chosen methodology for using computer chess programs for evaluating the true strength of chess players.

The fourteen classic-version World Champions, from the first World Chess Championship in 1886 to the present, were evaluated. Matches for the title of »World Chess Champion«, in which players contended for or were defending the title, were selected for analysis. Several different criteria were designed. The basis for evaluation was the difference between the position values resulting from the moves played by the human and the moves chosen as best by the chess program. We also calculated the average number of blunders and observed how would the players perform providing they would all deal with equally complex positions. Our analyses, among other things, also clearly show that the percentage of best moves played depends on analysed position itself and that is in very high correlation with the difference of best two moves evaluations (according to the computer): the bigger the difference between best two moves evaluations – the easier it is to find the best move. By observing the average material quantity during the games, we tried to determine players inclination to simplify positions.

Generally, our computer analysis seems to have produced sensible results that can be nicely interpreted by a chess expert. Anyway, many will find some of the results quite surprising. The winner according to the main criterion, where we measured average deviations between evaluations of played moves and best evaluated moves according to the computer, is Jose Raul Capablanca, the 3rd World Champion.

As we did in the study, this result should be interpreted in the light of the comparatively low complexity of positions in Capablanca's games. Anyway, he was also on top according to other criteria where we measured quality of play and was only beaten according to one criterion (albeit a very important one), namely the quality of play provided that all players dealt with equally complex positions, by the present World Champion, Vladimir Kramnik. Both Capablanca and Kramnik distinctly deviated from the rest of the players.

Separatelly, we conducted analysis of the latest World Championship match between Kramnik and Topalov. The results (mean loss per move: Kramnik 0.1220, Topalov 0.1328; only the games with classical time control were taken into account) tell that Kramnik's play was somewhat better, while the overall quality of play in the match was on quite decent level (comparing to other World Championship matches), although the record (Kramnik's 0.0903 in the London 2000 match against Kasparov) was really not in danger. Note that the lower this measure the better the performance.

The chess program Crafty was used to perform the analyses. We needed an open source program in order to slightly modify it, as is described in the article. One may argue that Crafty is weaker than at least some of the fourteen World Champions who were taken into consideration. However, altogether more than 37,000 positions were evaluated and even if evaluations are not always perfect, for our analysis they just need to be sufficiently accurate on average since small occasional errors cancel out through statistical averaging. Anyway, we would like to encourage other researchers that might have access to source code of the strongest commercial chess programs, to modify and run them in the way we proposed, in order to obtain a comparison between various different engines as well.

The basic criterion for evaluating World Champions was the average difference between moves played and best evaluated moves by computer analysis. According to this analysis, the winner was the third World Champion, Jose Raul Capablanca. This result should be interpreted in the light of the comparatively low complexity of positions in Capablanca's games which is quite in line with the known assessments in the chess literature of his style. For example, Garry Kasparov in his set of books My Great Predecessors when commenting Capablanca's games speculates that Capablanca occasionally did not even bother to calculate deep tactical variations. The Cuban simply preferred to play moves that were clear and positionally so strongly justified that calculation of variations was simply not necessary. He also describes Capablanca with the following words: “He contrived to win the most important tournaments and matches, going undefeated for years (of all the champions he lost the fewest games).” and “his style, one of the purest, most crystal-clear in the entire history of chess, astonishes one with his logic.”

The results of blunder-rate measurement are similar. We expected positional players to perform better by this criterion than tactical players, since in quiet positions there were less opportunities to blunder. Note the excelent result of Tigran Petrosian who is widely renowned as a pure positional player. In compliance with this observation, Steinitz, who lived in an era of tactical romantic chess, took clearly last place.

Capablanca is renowned for playing a 'simple' chess and avoiding complications, while it is common that Steinitz and Tal faced many 'wild' positions in their games. The results of complexity measurement clearly coincide with this common opinion.

The method for assesing the complexity of positions is described in detail in the original article (see also the reference below). Graph of errors made by players at different levels of complexity clearly indicates the validity of the chosen measure of complexity of positions; the players made little errors in simple positions, and the error rate increased with increasing complexity.

We used the above mentioned metric of position complexity to determine the distribution of moves played across different intervals of complexity, based on positions that players had faced themselves. This, in turn, largely defines their style of play. For example, Capablanca had much less dealing with complex situations compared to Tal, who is to be regarded as a tactical player.

The main deficiency of the two criteria, as detailed in the previous subsections, is in the observation that there are several types of players with specific properties, to whom the criteria do not directly apply. It is reasonable to expect that positional players in average commit fewer errors due to the somewhat less complex positions in which they find themselves as a result of their style of play, than tactical players. The latter, on average, deal with more complex positions, but are also better at handling them and use this advantage to achieve excellent results in competition. We wanted to determine how players would perform when facing equally complex positions. The winner was the fourteenth World Champion Vladimir Kramnik. Kramnik also had the best performance of all the matches; his average error in his match against Kasparov (London, 2000) was only 0.0903. We also tried to determine how well the players would play, should they all play in the style of Capablanca, Tal, etc. It is interesting to notice that Kasparov would outperform Karpov, providing they both played in Tal's style.

The percentage of best moves played alone does not actually describe the quality of a player as much as one might expect. In certain types of position it is much easier to find a good move than in others. Experiments showed that the percentage of best moves played is highly correlated to the difference in evaluations of the best and second-best move in a given position. The greater the difference, the better was the percentage of player's success in making the best move.

Based on that observation, another criterion was the expected number of best moves played providing that all players dealt with positions with equal difference between the best two moves, as was described in the previous section. It represents another attempt to bring the champions to a common denominator. See the results right below.

Kramnik, Fischer, and Alekhine had the highest percentage of best moves played, but also the above-mentioned difference was high. In contrast, Capablanca, who was right next regarding the percentage of the best move played, on average dealt with the smallest difference between the best two moves. The winner by this criterion was once again Capablanca. He and Kramnik again clearly outperformed the others.

The purpose of calculating the average material quantity, that is the sum of the numerically expressed values of all pieces on board, was not to determine the quality of play, but to collect additional information on a player's style of play. We mainly tried to observe a player's inclination to simplify positions.

Among the players who stand out from the others, Kramnik obviously dealt with less material on board (remember his early queen exchanges in his Berlin Wall games against Kasparov?). The opposite could be said for Steinitz, Spassky, and Petrosian.

The authors

Matej Guid has received B.Sc. degree in computer science at the University of Ljubljana. He works on his Ph.D. thesis at Artificial intelligence Laboratory, Faculty of Computer and Information Science of Ljubljana University. His present research includes machine learning, computer chess, heuristic programming and robot learning by experimentation. At the present time he works on programming an automated Chess Tutor that will be able to intelligently comment chess games in a language comprehensible to humans. Chess has been one of Matej's favourite hobbies since he was a little kid, he was also a junior champion of Slovenia a couple of times.

Ivan Bratko is professor of computer science at University of Ljubljana, Slovenia. He is head of Artificial intelligence Laboratory, Faculty of Computer and Information Science of Ljubljana University, and has conducted research in machine learning, knowledge-based systems, qualitative modelling, intelligent robotics, heuristic programming and computer chess (do you know the famous Bratko-Kopec test?). Professor Bratko has published over 200 scientific papers and a number of books, including the best-selling Prolog Programming for Artificial Intelligence. Chess is one of his favourite hobbies.

The original paper was published in the ICGA Journal, Vol 29, No. 2, June 2006, pages 65-73. It was also presented at the 5th International Conference on Computers and Games, May 29-31 2006, Turin, Italy, and was published in the proceedings book.


Blogger transformation said...

a GM wrote me in late October the following:

"Agreed, the article was of interest but I saw many problems with it.

"In the first place it isn’t just a question of a relatively small sampling of the games of Capa or of Bobby. There was the time controls. Which have a huge impact on the quality of play. The truly “classical” time control was forty moves in two and a half hours, followed by ADJOURNMENT. The players could study the position and would then play sixteen moves in one hour. In case of several sessions they would ADJOURN again. Hence, when you think of Bobby, for example, he could play (approximately) twenty moves of opening theory – in the cases of Ruy Lopez or Sicilian defenses even more – adjourn, study the position and play incredibly accurately. Witness for example his adjourned ending in his match with Taimanov. Outstanding play!

"The next problem I see is with Kramnik. For example, look at his match with Leko. [remark deleted by me]! So many short draws, virtually the “whole” game was opening theory with very little in the way of “original” moves. Kramnik did much better in his match with Topalov as he was forced to play all the games. Not his choice at all."

Thu Sep 27, 02:34:00 AM PDT  
Blogger transformation said...

a second comment, writen to me after above:

"Much greater or larger theory means to suggest that the openings have been “better mapped” out then previous generations. Without question. However, many of the older GM’s remained faithful to a remarkably small number of openings and defenses. For example, Smyslov played the Slav Defense. Full stop. So his own opening theory was remarkably well developed within a very narrow range of openings and defenses. If you look at the Capa v Alekhine match, you’ll see endless games of QGD positions. So while opening theory has a much broader reach today, the old guys repeated the same lines time after time.

Thu Sep 27, 02:36:00 AM PDT  
Anonymous Sciurus said...

Interesting post! Of course, everybody interprets these things in his/her own context. In my case, I am currently thinking what kind of master games I should study to improve my games. And my personal conclusion from this study was that I should look into Capablanca's games. Not only because he made few mistakes but also because of the low complexity. There might be a chance that I understand the games while I believe that the games played by today's GMs are simply to complex to grasp for me.

The second thing that came up in my mind was, "interesting, so Capablanca played like Crafty (or other way around)". I think one should be careful with these things. Who knows, may be the maker of Crafty likes Capablanca's games and sub-consciously programmed it to play in this style ;-)

Thu Sep 27, 06:36:00 AM PDT  
Blogger likesforests said...

I'm glad Capablanca scored so highly! Although, I suspect Crafty couldn't fully appreciate his skill in the endgame. In any event, this means I probably made a wise choice buying "Capablanca's Best Chess Endings".

Thu Sep 27, 11:36:00 AM PDT  
Blogger Blue Devil Knight said...

Fantastic. Great article, and great comments from the anonymous GM. Thanks for providing this.

Thu Sep 27, 04:49:00 PM PDT  
Blogger Wahrheit said...

Wait a sec--I enjoyed this article too (saw it awhile back and I think i even linked it) but "Smyslov played the Slav Defense. Full stop." (?!?!)

Smyslov had a wide variety of defenses to 1 d4, which I've seen in his own books--he played the King's Indian, a large number of Nimzo-Indians and had a variation of the Gruenfeld named after him, and played the Slav and occasionally other varaitions of the QGD. In fact he had a wider variety of defenses than almost any other top GM of all time.

So I'm not sure where that remark was coming from

Fri Sep 28, 10:15:00 AM PDT  

Post a Comment

<< Home