Huliganov’s first ever rant!

Production date:	30 September 2006
Playout date:	30 September 2006
Camera:	Logitech Webcam
Post Production:	Windows Movie Maker – slight use
Location:	Home
Other people featured:	None
Genre:	Hulirant
Music used:	Gremin’s Aria, Eugene Onegin, Tchaikovsky
Languages used:	English Russian
Animals featured:	None

This piece is the first ever Huliganov rant, and actually I’m a but disappointed that a lot of people who watch and say they enjoy Huli‘s lessons didn’t also look up the rants by the same persona. This remains at under a thousand views, and not much discussion or rating.

Hulliganov offers here his disappreciation of noisy neighbours and his appreciation of the Chinese people for not making themselves unnecessarily tall.

Eugene Onegin in English (ask.metafilter.com)
Huliganov’s Speaking Mandarin Video (huliganov.tv)
Music Review: Need a Gala? Tchaikovsky Is a Go-To Guy (nytimes.com)
Music Review: Rich Talent and Promising Voices Well Rewarded (nytimes.com)

The Calamity Club: A Novel

(47533557)

Now retrieving the price.

(as of 29/07/2026 01:01 GMT +02:00 - )

This Inevitable Ruin: Dungeon Crawler Carl, Book 7

(48542757)

Now retrieving the price.

(as of 29/07/2026 01:01 GMT +02:00 - )

Word Frequency Issues for Language Learners

Peter and Jane book 1a Play with us — One well-known and practical application of word frequency analysis, but is the word list still current?

I wanted to address some issues with the whole question of using word frequency analysis when learning languages. It is obviously a good idea to use frequency studies if they are available.

They can function both as checklists to ensure that our courses (whether we are users or compilers of courses) contain a complete coverage of the most frequent words. I do think that a course which claims to give 2,000 words lets its users down if a noticeable percentage of say the top 1000 words are still missing, whereas students are expected to learn words for wainscotting, walruses and woodpeckers.

One big problem with frequency dictionary analysis and word count – especially when comparing between languages or methods – is what does it mean? If we are talking about uninflected languages then the number of individual words is shorter. The original poster referred to Italian, and here it is a problem, because every single verb has umpteen forms, so is that one word or is that umpteen words?

If you use a machine to collate the frequency, then “has”, “have” and “had” will all show up as different words. Should it be three or one? In those cases where a noun has 12 separate declined forms in Czech or Polish, is that 12 words or one word? Or is it something in between, with the forms guessable out of usual paradigms being counted as the same, but the irregular parts being considered separate?

And are words like “jack” or “rose” which have so many individual meanings counted as one word or as ten or so words? If a machine does it the count is objective, but again not true to the substance of the matter. If a human intervenes, then subjectivity enters the frame and the way one person collates it may have very different outputs to the way another person would do it.

That’s why I take statements like “80% of the words used are in 10,000 words” with a pinch of salt. Assuming the same rules for collation of word variation under headwords apply through the list, it is probably proper to parrot Pareto and say that 20% of the vocabulary gives 80% of the effect, but that is comparing relative numbers to relative numbers, which is probably wiser, and more likely to hold good when comparing different languages together, than comparing absolute numbers to relative numbers

And then another problem is the way word frequency changes over time. Even more subtle markers of how languages change over time than the incursion of new words (which obviously don’t show up at all in the old frequency lists) is the change not even so much in meaning but in fashionability of words which are there in the language the whole time. Some of these are glaringly obvious – nobody uses “comrade” in Russia as much as it was used 30 years ago. But in fact it applies to a greater or lesser degree to every word we use.

Frequency studies, then, ironically, need to be carried out far more frequently! I answered a question the other day on frequency lists for what used to be Serbo-Croat. Quite a lot of change has entered that language, especially as emerging ex-Yugoslav states have sought to distinguish their language from other similarly speaking states, including new letters in their languages and emphasising either regional words or regional pronunciations. There is of course a degree of artificiality about all that, but it cannot fail to influence the language really spoken by people.

However, from my research the last available frequency studies for Serbo-Croat were from the 1960s. They are not available online for free you have to pay to get them, which I did not do. I can only therefore imagine how erroneous they must be by now. The half life of usefulness of such a study is probably ten years, and these were done 50 years ago, in a different country constellation, a different political regime, with a different world going on around it with different things to do than now and different ways to live. High time for new frequency studies to be made in that language, but also in many other languages. I just wonder how old by now the frequency studies are which Ladybird books admirably used to create the Key Words Reading Scheme so known and loved by children in the English speaking world? Can anybody tell me that?

New research demonstrates language learners’ creativity (eurekalert.org)
Top 30 Languages to learn for 2050 (huliganov.tv)
Question on lexical sufficiency (huliganov.tv)
We Are the Words (technologyreview.in)

Red Rising

(465105978)

Now retrieving the price.

(as of 29/07/2026 01:01 GMT +02:00 - )

Harry Potter and the Goblet of Fire (Full-Cast Edition)

(485105369)

Now retrieving the price.

(as of 29/07/2026 01:01 GMT +02:00 - )

Война в Украине: ВСУ атаковали НПЗ в Рязани и Перми, Wildberries сообщили об эвакуации сотрудников в Рязанской области
Последние новости, комментарии и видео о войне России против Украины.
Война в Украине: хроника событий с 29 июня по 28 июля 2026 года
Последние новости, комментарии и видео о войне России против Украины.
Четыре женщины обвинили Джареда Лето в преступных действиях сексуального характера 29/07/2026
В общей сложности 10 женщин, с которыми побеседовала Би-би-си, утверждают, что актер и музыкант сексуально домогался их, когда они были подростками. С некоторыми Лето, как утверждается, вступал в половую связь.
ФСБ заочно обвинила Павла Дурова в содействии терроризму. Что известно 29/07/2026
Создателю Telegram Павлу Дурову предъявлено обвинение в содействии террористической деятельности, сообщила ФСБ. Поводом для преследования стал чат-бот для знакомств «Дайвинчик» и другие «многочисленные каналы, чаты и боты», которые отказалась удалять администрация Telegram: как считает ведомство, украинские спецслужбы через них вербуют россиян для диверсий.

	David J. James on Takes Two and Three of Trailer…
	Anonymous on Takes Two and Three of Trailer…
	David J. James on Takes Two and Three of Trailer…
	Anonymous on Takes Two and Three of Trailer…
	David J. James on Friday AI Day #3: Employing a…
	Anonymous on Friday AI Day #3: Employing a…
	Nemo on Deal with it!

M	T	W	T	F	S	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

Day: March 16, 2011

Huliganov’s first ever rant!

The Calamity Club: A Novel

This Inevitable Ruin: Dungeon Crawler Carl, Book 7

Like this:

Word Frequency Issues for Language Learners

Red Rising

Harry Potter and the Goblet of Fire (Full-Cast Edition)

Like this:

Related Articles

Tell all your friends and share the love!

Like this:

Related Articles

Tell all your friends and share the love!

Like this: