Naughty Letter Frequencies in English
Here’s a community-maintained "List of Dirty, Naughty, Obscene, and Otherwise Bad Words" across various languages on Github. I was curious about a naïve frequency distribution of consonants across the English-language corpus (NSFW, obviously) and wrote a small script. Here are the results:
Letter | Count |
---|---|
t | 211 |
s | 208 |
n | 193 |
r | 186 |
l | 167 |
g | 147 |
c | 124 |
b | 121 |
p | 116 |
h | 97 |
d | 91 |
m | 91 |
k | 72 |
y | 70 |
f | 48 |
w | 41 |
v | 29 |
j | 21 |
x | 19 |
z | 7 |
q | 5 |
Not sure what I’m going to do with this information but here it is. 🤬