Fixing Scrabble

Joshua Lewis has run the numbers on letters, and he’s discovered that Scrabble’s point system is statistically suspect. Indeed, he suggests, it may be downright suboptimal. Lewis, a scientist and entrepreneur, developed a software program, called Valett, for “determining the appropriate letter valuations in word games.” The program’s algorithm “analyzes the corpus of a game’s legal plays and provides point values for the letters in the game based on a desired weighting of their frequency, frequency by length and the entropy of their transition probabilities.” He ran Scrabble’s tile values through the algorithm and discovered all sorts of anomalies, which he attributes to historical changes in the set of words that can be played legally in the game.

The big problem seems to be that Q, which Lewis terms an “outlier” (I can attest, anecdotally, to the truth of that description), is substantially undervalued. To bring the game up to statistical snuff, the values of many other letters would, as a result, need to be “compressed.” As the BBC explains:

According to Lewis’s system, X (worth eight points in the current game) is worth only five points and Z (worth 10 points now) is worth six points. Other letter values change too, but less radically. For example, U (one point currently) is worth two in the new version, G (two points) becomes three and M (three points) becomes two.

But Lewis’s plan is founded on a misunderstanding. By accounting for recent changes in word frequency and transition probability entropy, he seems to believe that he can return the Scrabble scoring system to a statistically pristine state. But that pristine state, that Eden where all the numbers line up, never existed. The points system was a kludge from the get-go, as the analytically minded have long known. The game’s inventor, Alfred Butts, “calculated a value for each tile by measuring how frequently each letter appeared on the front page of the New York Times.” Explains John Chew, of the North American Scrabble Players Association, “Butts had a selection bias in favour of printed newspaper English which many people have suggested ought to be rectified.” But changing the system at this point, Chew says, with considerable understatement, would inspire “catastrophic outrage.”

It would also make the game less fun, because it would make it more difficult for novices to occasionally beat veteran players. The scoring system’s lack of statistical rigor, it turns out, has the unintended but entirely welcome effect of adding a little extra dash of luck to the game, as Lewis himself points out. The apparent weakness is a hidden strength.

Let the statistically impure thoughts of Alfred Butts serve as a lesson to us all about the dangers of our current fixation on the analysis of large data sets. Armed with a fast computer, a wonky algorithm, and whole lot of Big Data, a geek will begin to see problems everywhere in our messy human world. And by correcting every statistical anomaly or inefficiency, he’ll not only clean up the messiness, he’ll remove the fun. To a statistician, a blank tile has no value. The rest of us know better.

Photo by openfly.

7 thoughts on “Fixing Scrabble”

1. Daddy B.

As a dedicated on-line scrabble player, I say at least 12 points for Q is in order and V’s should be banished.

2. Kevin

Interesting post. It reminds me of George Van Dyne’s systems ecology study, where he tried to find a stable underlying pattern of nature by virtually replicating a grassland in Colorado. It would be the “real” Eden, where the wild would be tamed into a serene stability of data sets. When the computer came back with nothing, his solution was to add more data. He kept on adding and adding to no avail, until he died in 1981 and the project was shutdown. He failed to comprehend his gross simplification of the real dynamic and chaotic forces of nature. The poor guy was picking food out of animals mouths just so he could create his computer model. Some things just cannot be left to pure logic.

Anyway, I’d love to hear your opinion of the cybernetic influence on ecology, Nick. Thanks!

3. Anton Sirius

“It would also make the game less fun, because it would make it more difficult for novices to occasionally beat veteran players. ”

Absolute nonsense. Reducing the point value of overvalued ‘rare’ letters, which aren’t actually that rare because of obscure Scrabble-legal words that novices won’t even know, would make it easier for veterans to get toppled, not tougher.

Maybe next time you feel the desperate need to heap scorn on statistics, you should actually try to construct a coherent argument against them.

4. Nick Post author

Sorry to upset you, Anton. I was drawing on the analysis of Mr. Lewis himself:

Even Joshua Lewis, inventor of the new system believes the traditional valuations can make the game more exciting. “You’re really lucky if you pick an X because it’s over-valued and unlucky if you pick a V. So if they were to re-do the values of the tiles that would reduce the level of luck. That might be desirable in tournaments but it might not be as good in casual play where you want the less skilled players to have a shot periodically at beating the more highly skilled players.”

As I wrote, it’s about increasing the relative role of luck in the play of the game and its outcome.

5. dsn

Chew’s and O’Laughlin’s approach to reconsidering the face values of the tiles involves adjusting their equity values. Equity value in Scrabble is similar to advanced baseball stats that compare players to an “average” replacement. Scrabble theorists have been calculating this stat—let’s call it VORT, or Value Over Replacement Tile—since the 1980s. “The Barry Bonds of the Scrabble set,” O’Laughlin said, is the blank, with a VORT of about 25 points.

