Wordle, the daily word guessing game, is taking Twitter, the world, and my relationship by storm.

If you haven’t heard of it—how?—the rules are simple. Every day there’s a new five-letter word (a Wordle.) You get six attempts to guess it, and after each one the color of the tiles change to tell you whether a letter was in the word and in the right place (green), in the word but not in the right place (yellow), or not in the word (gray.)

These limitations are what make Wordle so fun. Everyone in the world (and, in particular, in my house) is attempting to guess the same word in the fewest number of guesses. What’s clever is you can share your progress after you succeed—but all the letters are disguised as colored blocks. So you can gloat without giving anything away.

But never let it be said there’s a game that can’t be beaten (or, according to my girlfriend, ruined) with a bit of research, analysis, and time. So, if you’ve ever wondered what the best strategy is for winning at Wordle, let’s break it down.

Letter Distribution in the English Language

Letter frequency analysis is the study of how often and where letters occur in words. It’s pretty foundational to cryptography, because if you have to decode a secret message like we’re kind of doing with Wordle, it’s useful to know that you are more likely to see an E than a Q.

While exact letter frequency distribution changes based on the source text, the most common letters don’t really change.

Peter Norvig, director of research at Google, used the data from Google Books to come up with this list of the top 12 most common letters in the English language:

  1. E (in 12.49 percent of words)
  2. T (9.28 percent )
  3. A (8.04 percent)
  4. O (7.64 percent)
  5. I (7.57 percent)
  6. N (7.23 percent)
  7. S (6.51 percent)
  8. R (6.28 percent)
  9. H (5.05 percent)
  10. L (4.07 percent)
  11. D (3.82 percent)
  12. C (3.34 percent)

There’s one issue with this list for us Wordlers, though. It’s based on a natural-language source text, which means the word the kind of messes things up for us. The is by far the most common word in the English language, representing 7.14 percent of all words in the Google Books source text, followed by of (4.16 percent), and (3.04 percent), and to (2.6 percent). This means the position of T and H in the list are higher than they should be.

Another option is to just look at the distribution of letters in dictionary words. An analysis of the Concise Oxford Dictionary (9th Edition, 1995) found the 12 most common letters were:

Leave a Reply

Your email address will not be published. Required fields are marked *

You May Also Like

Chromecasts now support multi-room music playback

Chromecasts can now be used to play music in sync across multiple…

People are just realising a little-known Wi-Fi hack makes router upgrades less of a nightmare – it’ll save loads of time

THERE’S one issue people complain about time and again when upgrading their…

SEC Probes Trades by Elon Musk and His Brother

WASHINGTON—The Securities and Exchange Commission is investigating whether recent stock sales by…

Blue Origin’s next space tourism flight is scheduled for October 12 

Blue Origin announced on Monday that its next human spaceflight will happen…