language

...now browsing by tag

 
 

Language Thursdays: Parsing Chinese 1.0

Friday, May 7th, 2010

canadagoose_300_tcm9139738_thumb3444

I was flying back from Chongqing recently when I was reminded of the very frustrating problem of reading Chinese. There was a movie on the cabin TV and it had a particularity: it carried subtitles in Chinese and English in parallel, in two lines of comparable font at the bottom of the screen.

As I watched I kept forcing my eyes to stick to the Chinese subtitles in order to exercise my reading (the sound was off) but it was pointless. Every single time, before I had finished reading the Chinese I already knew the meaning of the line anyway. The words in English just seemed to transmit their meaning even if I was not looking at them.

Reading Chinese

We already spoke last year about the problem of Reading Chinese functionally. It is very important for advanced students of Chinese, because progress beyond a certain level depends largely on this ability. Many foreigners are able to read slowly and even do good translations of Chinese texts with the help of a cursor dictionary. But to read functionally, in my definition, is a completely different thing. It means to be able to read all sorts of general texts as quickly and reliably as an average native. Click to continue »

卖抠 and the Tianya BBS Experience

Monday, May 3rd, 2010

SP32-20100503-124440Here is the post I promised analyzing the fate of our friend Michael in the Tianya BBS.  Michael (卖抠) is the main character in the little Chinese story I wrote last week. I didn’t write the story particularly for this purpose, but once it was there I thought it would be a good idea to repost it to the Chinese BBS forums. Most of my readers here are not Chinese speakers, and I was curious to see how it fared among natives.

Many of us visit forums like Tianya or Mop to read the hottest topics and get a feel of what is trending on the Chinese internet. However, we don’t usually take an actively role, at least I had never tried posting before. This experience with Michael has taught me a few things about how these forums work, and in particular one of the biggest, Tianya. I know many people out there are interested in this, so here are the points I noted for your info: Click to continue »

Language Thursdays: Shanghainese Writing

Friday, April 30th, 2010

canadagoose_300_tcm9139738_thumb3444

This week I have little time to do the Language post, partly because I have been busy writing a short story, partly because I have already discussed a good deal about language in other blogs. I take advantage of this to do the post with my final views on Shanghainese after the long discussion we had on the Wu blog.

The discussion started with an unrelated comment on a language learning site. But what really got me heated up is the painful realization that many Shanghainese speakers – or more precisely Wu speakers – not only don’t protect their beautiful language, but they are in fact actively destroying their rich cultural heritage out of pure ignorance.

If you have been reading me for a while, you probably know that I feel very strongly about languages, and particularly about disappearing ones. Perhaps part of the reason is that I come from a culture where we spend a significant amount of our resources to promote a minority language, so small and useless that it has about 1% of the speakers of Shanghainese. Stupid perhaps, but it is our language. Click to continue »

译不达意: Language Drama in 2 Acts

Thursday, April 29th, 2010

Here is my first short story in Chinese. The title is “Lost in Translation”, and it illustrates the potential consequences of bad mandarin pronunciation. If you don’t read Chinese I left a little summary in comments, or else use G Translator to get the enhanced experience [1].

UPDATE: I have reposted this on Tianya to give it some air time among Chinese readers. By now the post has stabilized at around 3000 reads and 50 comments, I don’t think it will go much further. It was a nice experiment in Chinese BBS propagation, I will analyze the results soon.

.

译不达意

帮助迷失于中文中的老外找回爱之路

卖抠是我的好朋友。我们在美国老家是小学同班。虽然好几年没见面,但是我们的关系还是很密切。所以上个星期收到他的求救信让我很惊讶。他居然在中国! 还说他一个人无友可靠!

我马上回邮请他来上海我家住几天,看看能不能帮助他。

他写的让我太诧异了。更奇怪的是,居然我发现他会中文。我迫不及待的要他说这是怎么一回事。他说一年前,在我们美国的老家,因为那个金融风暴他的公司倒闭了。他失业了不知道该怎么办,有一天在路上看到了一个广告说“学会中文掌握未来!”就决定了报名上中文课。谁想到卖抠爱上了他的老师曹晓琳,一个来自江西的留学生。不到三个月他们就谈了恋爱。 Click to continue »


NOTES:
  1. that is, you will be lost in the translation of Lost in Translation []

Languages Thursdays: Punctuation Hell

Friday, April 23rd, 2010

canadagoose_300_tcm9139738_thumb344Today I just wanted to comment on the mysterious world of Chinese punctuation. It is a fascinating field in these times when everyone accuses Chinese of discriminating against our foreign symbols. In fact, there is a kind of foreign symbols that are used in practically every sentence of modern Chinese: the points, the commas, and all the rest of punctuation signs.

As is natural in any language, when the Chinese decided to adopt these signs to clarify their script, they set their own rules for using them. There are many examples of punctuation marks that are apparently identical in Chinese and in Western languages but in fact have different meanings and uses. This is not the main point of the post, but I will stop slightly on one of the example that I think is fun.

The sighing mark

For some reason the (!) that is known in the West as exclamation mark got translated in to Chinese as 叹号, that is, the 叹 mark. This 叹 character is most commonly used today in expressions like 叹气, and its meaning is closer to sigh or acclaim than to exclaim. My theory is this is the reason behind that quirk of the Chinese netizens who write “!” marks on every second sentence. Click to continue »

Photo of the Weekend: The Stars Exams

Tuesday, April 20th, 2010

Saturday there was some alarming movement down the road. Starting early morning masses of unidentified individuals concentrated near the intersection, partially blocking the traffic. They were visibly nervous, but their expression was firm, clearly they intended to hold the position. They had been there for almost 2 hours when I arrived with the camera.

IMG_2477-1

When I approached, I saw they were parents waiting for their children. It was the “Shanghai City Common English for Children Stars Level Exam“, an official certification of English level for children in 小学 (6-11 yo). From what I understood, it is organized by some bureau of the City of Shanghai one of many private companies. The levels are given in stars: 1 star, 2 stars, 3 stars, and the exam emphasizes oral communication. Click to continue »

Language Thursdays: Language Protectionism

Thursday, April 8th, 2010

canadagoose_300_tcm9139738_thumb34In this week’s language post I want to speak about language protectionism. I am not sure this is the word I am looking for, but if you have been following the blogs for the last couple of weeks you probably know what I mean. It all started with this proposal last month to ban English words from the media in order to preserve the “purity of Chinese language”. Now it looks like the authorities have taken it seriously, and yesterday the TV channels were officially notified of the new language policies.

I am of the opinion that the blogosphere, including some respected linguist sites, have made a lot of noise for no reason. Or rather, for two reasons: one is the old problem of Chinese messing up their PR (the word “purity” is a particularly bad choice in the context of culture). The other one is that the China blogging scene is overwhelmingly American, and it is difficult for Americans to understand the problem of language colonization.

I am a big admirer of the openness and flexibility of the English language. Reading blogs like the Language Log I learnt to appreciate the descriptivist approach to linguistics (to study how a language is, instead of vainly dictating how it should be), and I believe this laissez-faire attitude has helped to make English the richest language in the World. Click to continue »

Language Thursdays: The Holy Fractions

Thursday, March 25th, 2010

canada-goose_300_tcm9-139738

This is a new feature in my blog. It is a follow up of the initial Language and Culture posts last year, and I commit from now on to continue the series every Thursday that I feel like it. The idea is to post about those language curiosities that I encounter in my study of mandarin and I jot down directly on my study desk.

Professionals like you find in the Language log like to mark the difference between a linguist and a polyglot, and I completely agree with them. While I am a fan of linguistics, particularly those of the descriptive kind, I have never studied the discipline seriously and I couldn’t tell a preposition from a palmiped. I am just a curious language learner, and I’ll stick to what I know.

This thought has discouraged me for a while from writing about language,  considering the rich selection of linguist blogs already available. But then I thought, there is a certain level right between anthropology and linguistics, a space wide open to the speculation of non specialists, where living in language immersion is as important as formal training.

I am referring to the observation of how Language and Culture interact with each other, and how a certain character and a view of the World gets imprinted into the language, form the fossils of the remote past to the process ongoing even today. This is the point of the Language and Culture Series, which consists more of questions than of answers. Here is an example: Click to continue »

Did China wreck the Copenhagen deal?

Tuesday, December 29th, 2009

The summit of Copenhagen has inspired some hot debate on the media, for the most part more related to international politics than to climate change. Some spectacular pieces like Mark Lynas’ on the Guardian have been followed by more moderate opinions, like those appeared on Danwei and Inside Out, trying to understand the roles of China and US in this affair.

But of all I have read on the subject, the best information around is still to be found on the Shanghai Scrap blog. He reminds us that climate change is not and cannot be the first priority for the government and the people of China today. It is an "uptown" concern, completely foreign to those who are still worrying whether their drinking water contains lead, or whether they will need a gas mask to breathe the Beijing air tomorrow.

But back to the question: Did China really wreck the Copenhagen deal? In other words, was there really a deal ready to be signed and China unexpectedly rejected it, ruining the heroic efforts of the Western World led by president Obama?

What the hell happened in Copenhagen

Have you noticed that, when there is something really important in stake, governments organize summits as small as possible to get a meaningful deal, and only reluctantly they accept new participants in the G groups? The climate summits are just the opposite, everyone is invited, carbon footprint and all, the more the merrier. The World has become so multilateral today… especially when multilateralism is in our own interest.

Climate change is always a great subject for politicking, because the success in the negotiations or the problems arising from the failures will not be felt during the political life of the protagonists. It is one of those subjects where the only real measure of success is the perception of the home public immediately after the meeting. And clever politicians don’t let the opportunity pass to fabricate a good story.

For the Obama administration the objective of the negotiations can be summarized as: ensuring a deal is signed sufficiently meaningless to cause little problems with the industrial lobbies, and sufficiently powerful that the climate change enthusiasts are satisfied. Since this is obviously impossible, there is a plan B: ensure that whatever happens, it is somebody else’s fault. This is where China makes an excellent partner.

Partly for the reasons given in the Shanghai Scrap posts, and partly because it is not a democracy and it can control the information circulating internally, China is much less worried about the Copenhagen game than Obama. Free of internal pressure and faced with very mild international pressure, Chinese leaders will logically reject any deal that involves a sacrifice for their country. They will also neglect to give a coherent explanation in the language of the international media, offering a great target for post-Copenhagen accusations.

Now, I know Obama’s ambassador is not not an expert in China, but I can’t believe he was so incompetent to ignore the facts above. Obama himself was in China a month ago, and it is impossible that he didn’t know the obvious: that China was never going to sign an agreement forcing her to accept international inspectors with access to virtually every strategic industry, and with the power to expose to the World and to the Chinese public all the weaknesses of the Chinese system.

I wouldn’t go as far as to say that China and US position was pre-arranged between Obama and Hu, but I think the American delegation, during and after the visit to China have been more intelligent than what most observers imagine.

Ever since the times of Kyoto, the US was at the head of the evil carbon emitters. Obama had to make a difference with his predecessor, and for the moment he has already worked a miracle: without making any major concession, the US have now become the World Champions of climate change policies.

In the meantime, the EU countries, the only ones that take this climate change thing seriously, are again pushed into the background because of their lack of credible leadership… and Obama, the clever American, has made the most of it for himself and for his country.

And in the meantime, the climate is changing…

One of the funniest accusations I have read in the media after the Copenhagen summit is that China has prevented the developed countries from signing a deal to limit their own emissions. This is so stupid that it could make it into a China Daily headline. How can China prevent the US/EU/Japan from signing a deal among themselves to reduce their own emissions?

No, seriously, if we are going to act against climate change, I would propose: what about reaching an agreement among the developed countries first, like we did for so many things before, and put it into practice even without China?

Yes, I know, to make a carbon reduction effective, all countries should participate. But the same could be said of the GATT/WTO and many other deals at the time, and this didn’t stop us from signing it and push China into it much later. Once the developed World is united, it is always much easier to lobby together for the respect of some standards, or to impose sanctions to non compliant countries.

But why do all that, when it is easy to content the public with less?

China and the World Map of the Internet

Friday, December 4th, 2009

I was tinkering with some statistics last night, considering that strange idea of the Insularity of the Chinese Internet that we’ve been discussing lately. The expression itself is odd, because “internet” and “insularity” form an oxymoron, but you hardly notice these things when you live here. It’s normal routine in the land of socialist market economy.

Whatever we make of the phrase, the fact is that it comes up every time, whether we are speaking of language, media or politics,  all seems to point in that direction.  The pictures below are my attempt to draw a World Map of the Internet to illustrate this insularity, using the data from the site Internet World Stats.

Here is the first idea I had: I got the statistics of all countries with more than 10 Million internet users, that makes 32 in total, from China to Morocco. Then I did an Excel chart where each bubble has an area proportional to the internet users of the country, and crucially, I filled the bubbles with code from the Matrix. Result: the World Map of the Matrix:

SP32-20091204-143947

The World Map of the Internet Matrix

One interesting thing in the map above is that Asia is already the largest internet area in the World. Amazing—but not really, after all, it has by far the largest population. And this is nothing compared to what is coming: with the growth of India and China the internet is going to be an Asian joint in the next few years. No hit will be really global on the net without them. Up to now, most people on the net were from developed countries, from now on the majority will be from developing ones. The close contact between our societies will have important consequences online and off. That is, supposing we really manage to connect.

But when we speak of the internet, it doesn’t make much sense to look at political boundaries. There is no such a thing as border controls online, what really unites or divides the peoples is culture. An in particular, the most important parameter is language: regardless of your national origin, what defines you as an user is the language you surf in. That is the reason why my browsing habits look more like this blogger’s than like anyone in my country: ESWN and I have completely different backgrounds, but we have in common our surfing languages.

So I looked up the statistics of the 10 most used languages on the internet, from English to Korean. This time I coloured the bubbles with flags, and I placed them roughly on the center of gravity of their community of speakers. The result is the map of Surfing Languages:

SP32-20091204-151433

The World Map of the Surfing Languages

Still, the map is not great. Many of the speakers in the massive English bubble are actually Indians, Spanish should be both in America and in Europe, and Australia is completely out of the picture. Physical distance has no meaning on the net, even less than political boundaries. It becomes clear that geography is of little use for my purpose, so we might as well dump  Gmaps and stick to the bubbles.

My new diagram looks like this, where all the major internet communities are represented together in a Cloud. We are all interconnected, and the only solid differentiator is language. Two people might share a hobby, like soccer , but they don’t go to the same websites if they surf in different languages. Most of the media and resources on the internet are not translated into other languages, but rather re-written and re-interpreted by native bloggers/journalists, who function as border control among the communities.

image022

Improved World Map of the Internet: the Cloud

One of the things we see on the Cloud is that all the communities are touching each other. But I’m afraid this is not a very precise picture. Normally Russians don’t translate Japanese content, neither do Portuguese translate Arabic. The English language has a crucial role on the internet today, because in most cases it is through English that the rest of the languages communicate: Most content is translated first to English and from there to the other communities. The English bubble, including users from all over the World, is the Center of the Internet.

Another problem with the Cloud is that it shows all the communities equally interconnected, which is not very realistic. Users who speak European languages are much more likely to read English. The Spanish community, for example, includes many Americans who surf English sites as much as their own language. Actually, most of the language bubbles share a significant part of their pixels with the English bubble, so we can represent the Map as a sort of Venn diagram:

SP32-20091204-184148

Second Iteration: the Venn Diagram Map

We see the new Map is very different from the previous one. Now there is a cluster of Western languages that share a lot of content with English, two more languages that share a bit, Russian and Arabic, and then the three languages that form the core of the Asian internet today: Chinese, Korean and Japanese. And you may have noticed that I have drawn Chinese at a distance from the rest.

For various reasons that we will see, Chinese don’t use Facebook, or Twitter, or Youtube, or MySpace, or eBay. They don’t read Boing Boing or the Huffington post, and they chat in their own QQ chatrooms. They rarely receive the viral emails that we receive, and instead they get others like this one. They have all the things that we have and some more, but they built them in parallel in their separate parcel of the internet.

Whereas the sizes of the bubbles above are based on quantitative data collected by a respected source, the positions are only decided by semi-informed feeling. Any reader could argue that China should not be so far right. There is Hong Kong,  Chinese-Americans, even mainland Chinese who do surf in English. And I will be forced to admit that the Venn Map is flawed, because it fails to show this.

But in such a fast changing World like the Internet, position really means nothing. What holds today may be different tomorrow. What is really significant is the dynamics: which direction is China going, and how will the internet look in 10 years? Everybody agrees that China’s internet community is growing very fast, and that is natural. The worrying part is that it might also be moving away from the rest.

image3

Third iteration: The Dynamic Map

Because in Western countries internet penetration is already very high and India is still lagging behind, in the next 10 years the Chinese internet will become almost as big as all the rest together. If it continues to diverge, it may grow into a parallel network, like a dark side of the moon, a vast, self-sufficient island that the government can cut out at any moment and most people inside it don’t even notice the difference. This defeats the whole idea of the www.

Whatever the real magnitude of the problem, it is clear to most observers that there is a disconnect between China and the rest of the Internet, and there are powerful forces pulling them further apart. Fortunately, there are also forces working to balance this, and the results in the coming years will very much depend on how those factors play against each other. Here is how my new map looks now:image4

The Forces of the Internet

As we saw before in this blog,  some of the main factors that keep China separate from the World are the following, shown in red in the chart:

  • Linguistic, as we saw in this post, where we proved that Chinese language is beautiful and unique in many ways, but it makes it very difficult for Chinese and foreigners to connect.
  • Cultural, in the broad sense of the word, meaning that the communities have so different views and values that they cannot understand each other. This includes the problems with the Media.
  • Political, the deliberate actions of the CCP in  multiple forms, including Nannies, the Great Firewall of China (GFW) and directly arresting people, as we saw here.

And in green the main factors that go in the opposite direction. Here they are in detail, for the optimists to rejoice:

  • The growing number of bridge bloggers and other internet uses that work to connect the two communities. These include not only the English language Chinablogs, but mainly Chinese people who translate foreign media and other content on the Chinese internet. From this humble blog I also did my bit against the GFW.
  • The post 90s and 80s generations that already dominate the Chinese internet. Their personal tastes in arts, music or cinema will probably be more international, and push them to connect with the World. This point is object of debate though, and some Westerners are very skeptical of the post 80s.
  • Business is one of most important factors that link China to the World. Since the construction of the EU, it is no secret that commerce can achieve the most ambitious goals in World Peace, so whatever your take is on those business minded Chinese, they are probably the main force that is still keeping the Chinese Island connected and holding the World Wide Web together.

What do you think? 你有什么想法?

Do you think I am exaggerating? Or is the problem even worse than this? Any factor I missed in the Internet Maps? Internet friends: you are the pixels inside the coloured bubbles, you know all about this World because it is your home: comment and help me improve my Map!

你觉得这很夸张吗?还是认为问题写得还不够严重?你知道我在互联网地图里忽略了哪些元素吗?网友们:你们是小圈里面的像素,那里就是你们家,帮助我改进我的地图!  U5KMU63NGPP2

Chinese the most Difficult… (and 3)

Tuesday, November 24th, 2009

In the first two posts of this series, we saw that Chinese is the last language in the World to maintain a complete set of independent vocabulary roots and a non-phonetic script to represent them, what we might call a separate Word System. For this reason I argued that Chinese may be the most difficult language to obtain full fluency, regardless of the linguistic background of the student.

But there are more interesting implications than the mere difficulty of the language, in particular cultural and political ones. Because the refusal to use loans and phonetic script is the result of conscious decisions. There is nothing in the language itself that forbids import of foreign words or use of an alphabet, indeed, there are already some exceptions of direct loans in current use that are written in latin letters, such as DVD or KTV.

Chinese has a parallel Word System diverging from the rest of the World, and the government has an active role in the maintenance of this system. However, this policy is not unilaterally imposed from above. It is certainly encouraged by the education system, but Chinese speakers seem to follow it naturally and often prefer Chinese roots even when not supervised. This is in contrast with the situation in many countries where the system tries to protect local terms, only to find that people still prefer “email” to “courier electronique”.

Anyone living in China long enough realizes how aware Chinese are of their long history and their status as a different civilization. This discourse is irritating for Westerners, because it reminds too much of ultra-nationalistic creeds back home. But it has one essential difference with those creeds: in the case of China, it is true. As we said before, China is justified to see itself as a cradle of civilization, and it is the only such culture that has survived practically independent from World mainstream till modern times. This cultural awareness is the main reason for the preservation of the language as we know it, surviving different regimes and even periods of chaos.

When we study Chinese we are not merely learning another language, we are learning the words of a parallel World, the last independent system of vocabulary and writing that humanity still has. It is the most similar experience available on Earth to learning the language of another planet. If Chinese is really so hard to learn, this should provide enough motivation for anyone to try it.

Political considerations

Mandarin is not in itself a very difficult language, what makes it hard is its complex Word System, which is for the most part not essential (that is, the language could still exist with loans and an alphabet). This System makes it hard for foreigners and Chinese to communicate, and it is a serious obstacle in the education of the Chinese. In the last century,  development has been the main priority of China in order to recover her past glory, and inefficient relics have been torn down without blinking, just like the Walls of Beijing. Chinese words and characters are the last of those obstructive monuments to remain, and by far the oldest of all. It is a miracle that they have survived till today.

The invention of convenient methods to input characters on a keyboard has made the future of the characters seem more secure, but their permanence is by no means ensured. Many famous linguists have argued for the use of pinyin as main written language and elimination of the characters from daily life, not least of them Lu Xun, or the late John de Francis. Much as I admire these men and their work, I am completely opposed to their position as a matter of principles. I don’t suppose anyone will believe me in this age of economists, even less in the China of the new philosophies, but I have this to say: Efficiency is not a supreme value. In fact, it is not even a value in itself, but just a means. And a sad means it would be to recover the greatness of China, if there were nothing left to recover.

I think it is clear to most Chinese today that their Word System is too precious to abandon it for the sake of efficiency. However, some reasonable concessions can be made which might ensure the very survival of the System in the long term. In particular, the acceptance of foreign loans for new technical words might facilitate the access of Chinese to foreign research and the incorporation of foreign talents when the real Chinese brain-drain starts in earnest. The complete acceptance of latin script to represent phonetically foreign Proper Nouns (which is already used informally) would also be a step towards efficiency without sacrificing the heart of the system, and would be of great help for all the Chinese trying to learn English.

Apart from the practical issues considered, no less important is the mentality underlying the Chinese Word System. The growing common vocabulary in all the languages in the World represents the recognition by most cultures that there is a large part of common human culture, and that, since this part is only going to become larger with the progress of technology, the sensible solution is to adopt a common language to communicate it. By deciding to stay apart from this system, the linguistic choice of China represents a stance opposed to the rest of the World, and in a certain way it perpetuates the traditional isolation of the Middle Kingdom even in the age of Global interconnection. The insularity of the Chinese internet community and the misunderstandings between cultures that have arisen from it are, to some extent, a consequence of this choice.

The part played by the language in China’s relations with the World is probably not of the first importance. But even today this part is not negligible, and with the advances in communications, nobody knows how vital it will become in the future. Ultimately, it is only up to the Chinese to decide what language they want for themselves. We can only wait and see, and hope that they find a way to stay connected with us, while preserving their unique heritage of Words.

Chinese most Difficult Language in the World (2)

Monday, November 23rd, 2009

Last Friday I wrote a very long post where I ended up including too many ideas. The main point got a bit obscured as a result, but it was simply this: that vocabulary plays an essential role in learning a language, and that because of this Chinese is not only extremely difficult at an advanced level, but also growing more difficult with time.

I don’t suppose this is groundbreaking research, but it is interesting because most people are not aware of it, and also for its implications in the limit betwen language and politics, two fields we like to cultivate in this blog. Here is the argument in full with conclusions, for examples and details see the previous post and its comments:

  • To learn a new language the main knowledge required is in three areas: grammar, phonetics and vocabulary. Grammar and phonetics differ essentially from vocabulary in that the first two are rules applicable to infinite cases, whereas the latter is raw data. We can call them the Code and the Data elements of the language. The Code elements are finite and not growing. The Data element is practically infinite and growing, to the point that it is not completely mastered even by native speakers.
  • When studying a language, the Code elements play an essential role in the basic and intermediate levels, but at advanced level the real obstacle for communication—and therefore for progress—is Data.  For example, in German advanced students may sometimes use the wrong declension, and in Spanish they may fail to differentiate “rr/r”sounds. These things tend to not hamper communication because human languages are highly redundant. I would never understand “pero” (but) when a speaker says “perro”(dog). Ultimately,  imperfections in the Code elements amount to the same as having an accent: most of the times they are only relevant as metadata.
  • But while Code above a certain level is highly redundant, Data remains essential at every level. Borrowing from this great article: The phrase “Jacuzzi is found effective in treating Phlebitis”is meaningless when either or both of the nouns are unknown. A single missing word can often obscure the meaning of a whole paragraph or article.
  • The number of words used passively in real life far exceeds the typical standard lists of language levels. This is because semi-specialized words—such as ionic, jacuzzi or matrix—are not included in vocabulary lists as they are considered too rare. Certainly each of these words is rarely used, but there are so many of them that as a whole they are actually very often used. This Data element is so large that it cannot be memorized in a classroom, and the only way to acquire it is through many years of immersion.
  • The reason why most language learners never realize this problem is because they are “cheating”. In most languages in the World, this high level vocabulary is practically identical and it doesn’t need to be learned. There is a certain limit level for each language above which most modern words are international and the Data is no more specific of the language .
  • This limit level of vocabulary convergence is different for every language, but it doesn’t so much depend on the language family or geographical origin, rather it depends on the size and the development of the community of speakers. That is the reason why even non indo-European languages like Basque are extremely easy above the intermediate level: the community is not big enough to support complex terms, and all higher Data is adopted from International words. Most people tend to misunderstand and attach too much importance to the concept of language families, and they come up with absurd lists like this one.
  • The internationalization of vocabulary is growing with the advances in telecoms and globalization, especially since English has become the only language of scientific research. There is little point in inventing new Swedish terms in science, for example, when all the scientific community are reading/writing their papers in English. Often, in spite of political efforts to promote a local vocabulary, the economics of language revert the higher Data back to Internationalese.
  • There is only one language in the World that for historical, political and demographic reasons has remained an exception to this trend: that language is Chinese (Mandarin, Cantonese or others, the difference is irrelevant  here). It constitutes a parallel system of high level Data that has very few words in common with the rest of the Word. Japanese and Korean are partial exceptions in that they draw from both the Chinese and the International System, but modern words are increasingly International and these languages are converging with the rest.
  • In addition to this, Chinese has a ridiculously difficult writing system unique for its lack of a functional phonetic script. This compounds the vocabulary problem: not only there are more words to learn than in any other language, but each word  contains much more information as it needs to be associated with its corresponding characters.
  • Moreover, since there is no standardized way to transcribe foreign Proper Nouns, even names of places and persons tend to be “translated” into Chinese, sometimes completely departing the original phonetics and becoming Chinese Names in their own right. This adds to the already massive Data element in the Chinese language.

All this takes us to the conclusion: Chinese is the most difficult language to learn at a high level, regardless of the origin of the student.

This is particularly interesting because up to now the right answer to this question was only: “depends on your own mother tongue”.  With the possible  exception of Japanese/Korean students,  this post justifies that Chinese is actually the hardest for everyone else.  Inversely,  it is also very difficult for Chinese to learn other languages, although this is mitigated by the fact that other languages do have functional phonetic scripts.

Another interesting conclusion:  Chinese is not only difficult, it is actually growing in difficulty.

As the World grows more interconnected and technology occupies a more important part of our lives, new semi-specialized vocabulary takes an increasing part in everyday language. Expressions that refer to international concepts such as “spam”or “plasma TV” increasingly take the place of expressions referring to  local cultural heritage.  In this sense, we can say that all languages in the World are converging, while Chinese is an island diverging from all the rest.

Then there are the political conclusions that we can draw from this, but I am committed to writing shorter posts, so we will leave that for the next day. Comments and corrections are welcome to my arguments above.