Word Counter ยท 7 min read
25 Fun Facts About Words, Letters, and Language
Language is full of structural quirks, mathematical patterns, and historical accidents that most people never notice even though they encounter them every day.
Letters and Their Frequencies
1. E is the most common letter in English โ by a large margin
The letter E accounts for approximately 11โ13% of all letters in typical English text. The full frequency ranking (from most to least common) is: E, T, A, O, I, N, S, H, R, D, L, C, U, M, W, F, G, Y, P, B, V, K, J, X, Q, Z. This is why Wheel of Fortune contestants almost always guess R, S, T, L, N, E first.
2. The least common letters are Q and Z
Q and Z each appear in less than 0.1% of English text. Q is almost always followed by U in English (queue, queen, quiet), with rare exceptions in borrowed words like qoph, qigong, and qanat.
3. The most common vowel is E; the least common is U
Among the five standard vowels (A, E, I, O, U), E is far ahead in frequency, followed by A, O, I, and then U trailing significantly. Y functions as a vowel in about half its appearances (gym, system, style) but is typically counted as a consonant.
4. "Rhythm" is the longest common word without a standard vowel
"Rhythm" (6 letters) and "syzygy" (6 letters) are among the longest words using only Y as their vowel-sound letter. "Rhythms" (7 letters) extends this further. Words like "crwth" and "glyph" are also vowel-free except for Y.
Sentences and Grammar
5. The shortest grammatically complete sentence in English is two words
"Go!" or "I am." are both complete sentences. "Go!" contains an implied subject (you) and a verb. "I am." contains an explicit subject and a linking verb. Whether one-word commands like "Stop!" count as complete sentences is a matter of grammatical interpretation โ they have an implied subject but no explicit predicate.
6. "Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo" is a valid sentence
This eight-word sentence is grammatically correct. It uses three meanings of "buffalo": the city Buffalo (NY), the animal buffalo, and the verb "to buffalo" (to bully or intimidate). Parsed: "Buffalo [bison] from Buffalo [city] that [other] Buffalo [bison] from Buffalo [city] bully [also] bully Buffalo [bison] from Buffalo [city]."
7. A pangram uses every letter of the alphabet at least once
The most famous English pangram is "The quick brown fox jumps over the lazy dog" (35 letters, 9 unique letters repeated). It has been used to test typewriters and fonts since at least the late 19th century. The shortest known meaningful pangram in English is "Sphinx of black quartz, judge my vow" (29 letters).
8. "Stewardesses" is typed entirely with the left hand on a QWERTY keyboard
At 12 letters, "stewardesses" is one of the longest common English words typed entirely with the left hand. Conversely, "lollipop" and "monopoly" are typed entirely with the right hand. "Typewriter" can be typed entirely on the top row of a QWERTY keyboard โ a coincidence that may have influenced the original keyboard arrangement.
Numbers in Language
9. One to nine hundred ninety-nine contains no letter A
The numbers one through nine hundred ninety-nine โ when spelled out โ contain no letter A. The first number to contain an A when spelled out in English is one thousand. (British English: "one thousand" also counts. The letter appears in "thousand.")
10. "Four" is the only number whose letters match its value
The word "four" contains exactly four letters. No other number in English has this property. Zero has four letters; one has three; two has three; three has five; five has four (another one!). Wait โ actually five also has four letters. "Four" is the most famous example because four = 4 letters precisely.
Words and Their Origins
11. About 60% of English words have Latin or French roots
Despite English being a Germanic language at its grammatical core, the majority of English vocabulary comes from Latin (often via French, after the Norman Conquest of 1066). Germanic words dominate everyday speech โ house, eat, walk, strong โ while Latin-derived words dominate formal and academic registers โ residence, consume, ambulatory, robust.
12. English borrows from over 350 languages
The Oxford English Dictionary has traced English words to origins in over 350 languages. Algebra comes from Arabic. Bungalow from Hindi. Chocolate from Nahuatl (via Aztec). Kayak from Inuktitut. Sauna from Finnish. Robot from Czech. Ketchup possibly from Hokkien Chinese (kรช-tsiap).
13. The word "set" has the most definitions in the OED
The verb "set" holds the record in the Oxford English Dictionary with 430+ definitions, making it the most complex word in English. "Run" and "go" follow closely. These words are so common and ancient that they have accumulated hundreds of subtly different uses across contexts.
14. "Cleave" means both to split apart and to stick together
English has a category called contranyms (or auto-antonyms) โ words that are their own opposite. "Cleave" can mean to split (as in a meat cleaver) or to adhere tightly (as in "cleaving to one's beliefs"). "Sanction" means both to approve and to penalise. "Oversight" means both supervision and failure to notice something.
Writing and Text
15. The average English word is 5.1 characters long
Analysis of large text corpora consistently finds the average word length in English is between 4.5 and 5.5 characters. This is why five characters is used as the standard "word" unit in typing speed tests โ it approximates the statistical average.
16. The word "and" appears approximately once every 20 words in English text
In typical English prose, "and" accounts for roughly 2.5โ3.5% of all words โ one of the highest rates for any content-bearing word. It is the most frequently used conjunction and the fifth most common word overall.
17. Every English syllable must contain exactly one vowel sound
A syllable is defined by its vowel nucleus. English words can have complex consonant clusters โ "strengths" has one syllable and 9 letters โ but each syllable has exactly one vowel sound at its core. "Strengths" vowel: the E. "Squirrel" has two syllables because it has two vowel sounds (the U and the second syllable schwa).
18. The dot over the letters i and j has a name: a tittle
The small stroke or dot placed above a lowercase i or j is called a tittle. The phrase "jot or tittle" (as used in the Bible, Matthew 5:18) refers to the smallest possible units of written language. A jot is the smallest letter in the Hebrew alphabet; a tittle is the smallest diacritical mark.
Unusual Words
19. "Defenestration" means the act of throwing someone out of a window
This very specific word (from Latin fenestra, window) entered common usage after the Defenestrations of Prague in 1419 and 1618, when Bohemian protesters threw Catholic officials from windows. The First Defenestration (1419) helped trigger the Hussite Wars; the Second (1618) helped trigger the Thirty Years' War.
20. "Petrichor" is the smell of rain on dry earth
This word was coined in 1964 by Australian scientists Isabel Joy Bear and R. G. Thomas in an article in Nature. It combines Greek petra (stone) and ichor (the fluid that flows in the veins of the gods). The smell is caused by geosmin, a compound released by soil-dwelling bacteria, along with plant oils absorbed by dry earth.
21. "Sonder" describes the realisation that every stranger has a life as vivid as your own
Technically a neologism from the Dictionary of Obscure Sorrows (John Koenig, 2012โ2021) rather than a traditional dictionary entry, "sonder" has entered informal usage as a precise word for a feeling English had no previous term for. Language regularly acquires words this way โ through felt need and gradual adoption.
22. The ampersand (&) was once the 27th letter of the alphabet
The symbol & (ampersand) was considered a letter in the English alphabet through the 19th century. Children reciting the alphabet would say "...X, Y, Z, and per se and" โ meaning "and, by itself, and" โ to distinguish the symbol from the word. The phrase "and per se and" contracted over time into "ampersand."
23. "Laughter" and "slaughter" rhyme with "after" and "daughter" despite appearing identical
English spelling and pronunciation are famously misaligned. The word "ghoti" โ proposed satirically as an alternative spelling of "fish" โ illustrates this: GH as in enough (F sound), O as in women (I sound), TI as in nation (SH sound). English has about 1,100 different ways to spell its 44 phonemes.
24. A word spoken once per minute would be said 525,600 times in a year
The average person speaks around 16,000 words per day, hearing and producing far more. Over a lifetime of 70 active speaking years, that is approximately 400 million words spoken โ a staggering figure that nonetheless represents only a fraction of the stored vocabulary a literate adult carries.
25. The longest word you can type on a single row of a QWERTY keyboard is "typewriter"
At 10 letters, "typewriter" can be typed entirely using the top row of a standard QWERTY keyboard (Q-W-E-R-T-Y-U-I-O-P). This is widely cited as having been deliberately designed by early typewriter manufacturers to make it easy for salespeople to demonstrate the machine, though the historical evidence for this is debated.
References
- Crystal, D. (2003). The Cambridge Encyclopedia of the English Language (2nd ed.). Cambridge University Press.
- Pinker, S. (1994). The Language Instinct. William Morrow.
- Norvig, P. (2013). English letter frequency counts. norvig.com.
- McWhorter, J. (2014). The Language Hoax. Oxford University Press.