With the last post, we looked at words of three letters in the Google corpora for six languages, to understand which language has the most three-letter words. But our original question was which one has the most ratio of words among trigrams that “look like” real words, and today we are going to answer that question.
Being born as a native English speaker is a great luck these days. For the simplicity of its grammar and a predominance in the cultural world from the twentieth century onwards, English, or more precisely one of its dialect is the de facto international language, and being able to speak it natively is a huge advantage. If you are like me, instead, no matter how expert you may become, your level of comprehension of the subtleties will always be lower than that of any native speaker, you will always retain at least a bit of your original accent; you’ll never be able to pronounce all the words in the correct way, and there always will be some word you won’t know.
Among these, the biggest issue with English is its inconsistent pronunciation. Since even native speakers in the same country cannot reach an agreement on how to pronounce a word , you may get away with mistakes by claiming that your weird pronounce is actually used by some native English dialect .
Something you cannot get away with, though, is the lexicon. English has a huge lexicon. Similar to ancient Romans, English conquered and assimilated, not only people and land but words . I jokingly stated that in English almost any sequence of three letters that look like a word is actually a word. Is this just a joke or is there some truth behind?
- I am not talking about accents here, these are different dialects. [↩]
- Admittedly, you will sound weird anyway, as you will invariably make mistakes in different directions and your sentence will be a mix of different dialects. [↩]
- This particular made easier for English to become the de facto international language, as it shares a big part of its vocabulary with most Western languages. Incidentally, it is also one of the reason behind the different ways of pronouncing words. [↩]
In the first post of the series I gave some high level suggestions on writing a paper with LaTeX. Instead, from now on I will focus on specific typographical areas. The first I want to explore is spacing, which often goes unnoticed (because, well, you cannot see spaces) but is nonetheless one of the most important aspects in ensuring that a paper flows correctly.