GFW

...now browsing by tag

 
 

What is going on with Google in China?

Wednesday, January 13th, 2010

images First of all, read this article posted on the Google official blog. It is all you need to read for the moment because there is no more first hand info out there yet.

It was published some 5 hours ago. What it says in a rather muddled way is essentially:

  1. That Google has detected attacks resulting in the theft of intellectual property, in particular on Gmail accounts in China, not through Google servers but just hacking users computers.
  2. That Google has evidence that similar attacks happened also to other major Western companies in various industries.
  3. That the information targeted was related to advocates of human rights in China.
  4. That because of all this, Google is not willing to continue censoring results on Google.cn and that “we will be discussing with the Chinese government the basis on which we could operate an unfiltered search engine within the law, if at all. We recognize that this may well mean having to shut down Google.cn, and potentially our offices in China.”

This is very surprising news and it is quickly making the rounds of the World Media. Here are some preliminary midday break thoughts. Excuse the Bullet points but I am too excited to do real prose:

Regarding the message and intentions

  • The message sounds inconsistent, because it is complaining against 2 completely different problems. 1- The email hacks affects many companies and it is not necessarily done by the Chinese authorities, neither it is directly related to Google. 2- The Google.cn Search Engine manipulation or SEM that we already saw here.
  • By involving other Western companies Google is apparently sending a signal to them that either they support Google in its plight or else they will be mentioned by name and bear with the PR consequences of that (G is dreaming if it thinks they will follow, as if Chemical companies have much left to loose in this department already)
  • Nowhere in the message it says there is evidence that Chinese authorities are responsible for the email hacks. While this might seem obvious, in Western culture there is a presumption of innocence to apply. The normal sequence is first to seek justice, and only when the authorities refuse justice then complain.
  • You may believe or not in the “non evilness” of Google, but for a company that is handling so much of our personal information, this is not completely disinterested. Non-evilness is Gold for the G, and the minute the World stops trusting Google, the whole expansion plan of of Google apps+phones goes down the drain.
  • It is not impossible then that a calculation is involved: by standing up to China, Google can gain more credit points Worldwide than what it loses leaving China, where its operations are probably not very profitable today. With the new Google phone, the battle to rule the Tech World is at its peak, and goodwill is going to be an important weapon against Apple and Microsoft.
  • Is Google essentially Non-evil, or is it Non-evil just because it suits its business? Is a lion evil because it eats a gazelle, is an oil company evil because it gives you products you want to buy? Such philosophical questions people will be asking today, but I think there is no point in going down that way. Google is a corporation, not a charity, and we should judge its actions first from that perspective.
  • For a company to try to “change the World” on its own is completely out of scope, it is pointless, it leads to its ruin, and it amounts to pursuing political objectives for which it has no legitimacy. If Google doesn’t want to have Google.cn censored, then they are right to force this, but coming up now with a sort of “retaliation” to the Chinese government for hacking activist emails is a different thing altogether.
  • In conclusion, the message sounds inconsistent and improvised, it is difficult to believe that it comes from a careful calculation.  I wonder who really writes that blog, but if this really comes from Google executives it is scary, especially from the shareholders POV. Regardless of the real intentions of Google, my first assessment is that the post is a BAD decision.

Some more thoughts on the consequences coming in my next post.

China and the World Map of the Internet

Friday, December 4th, 2009

I was tinkering with some statistics last night, considering that strange idea of the Insularity of the Chinese Internet that we’ve been discussing lately. The expression itself is odd, because “internet” and “insularity” form an oxymoron, but you hardly notice these things when you live here. It’s normal routine in the land of socialist market economy.

Whatever we make of the phrase, the fact is that it comes up every time, whether we are speaking of language, media or politics,  all seems to point in that direction.  The pictures below are my attempt to draw a World Map of the Internet to illustrate this insularity, using the data from the site Internet World Stats.

Here is the first idea I had: I got the statistics of all countries with more than 10 Million internet users, that makes 32 in total, from China to Morocco. Then I did an Excel chart where each bubble has an area proportional to the internet users of the country, and crucially, I filled the bubbles with code from the Matrix. Result: the World Map of the Matrix:

SP32-20091204-143947

The World Map of the Internet Matrix

One interesting thing in the map above is that Asia is already the largest internet area in the World. Amazing—but not really, after all, it has by far the largest population. And this is nothing compared to what is coming: with the growth of India and China the internet is going to be an Asian joint in the next few years. No hit will be really global on the net without them. Up to now, most people on the net were from developed countries, from now on the majority will be from developing ones. The close contact between our societies will have important consequences online and off. That is, supposing we really manage to connect.

But when we speak of the internet, it doesn’t make much sense to look at political boundaries. There is no such a thing as border controls online, what really unites or divides the peoples is culture. An in particular, the most important parameter is language: regardless of your national origin, what defines you as an user is the language you surf in. That is the reason why my browsing habits look more like this blogger’s than like anyone in my country: ESWN and I have completely different backgrounds, but we have in common our surfing languages.

So I looked up the statistics of the 10 most used languages on the internet, from English to Korean. This time I coloured the bubbles with flags, and I placed them roughly on the center of gravity of their community of speakers. The result is the map of Surfing Languages:

SP32-20091204-151433

The World Map of the Surfing Languages

Still, the map is not great. Many of the speakers in the massive English bubble are actually Indians, Spanish should be both in America and in Europe, and Australia is completely out of the picture. Physical distance has no meaning on the net, even less than political boundaries. It becomes clear that geography is of little use for my purpose, so we might as well dump  Gmaps and stick to the bubbles.

My new diagram looks like this, where all the major internet communities are represented together in a Cloud. We are all interconnected, and the only solid differentiator is language. Two people might share a hobby, like soccer , but they don’t go to the same websites if they surf in different languages. Most of the media and resources on the internet are not translated into other languages, but rather re-written and re-interpreted by native bloggers/journalists, who function as border control among the communities.

image022

Improved World Map of the Internet: the Cloud

One of the things we see on the Cloud is that all the communities are touching each other. But I’m afraid this is not a very precise picture. Normally Russians don’t translate Japanese content, neither do Portuguese translate Arabic. The English language has a crucial role on the internet today, because in most cases it is through English that the rest of the languages communicate: Most content is translated first to English and from there to the other communities. The English bubble, including users from all over the World, is the Center of the Internet.

Another problem with the Cloud is that it shows all the communities equally interconnected, which is not very realistic. Users who speak European languages are much more likely to read English. The Spanish community, for example, includes many Americans who surf English sites as much as their own language. Actually, most of the language bubbles share a significant part of their pixels with the English bubble, so we can represent the Map as a sort of Venn diagram:

SP32-20091204-184148

Second Iteration: the Venn Diagram Map

We see the new Map is very different from the previous one. Now there is a cluster of Western languages that share a lot of content with English, two more languages that share a bit, Russian and Arabic, and then the three languages that form the core of the Asian internet today: Chinese, Korean and Japanese. And you may have noticed that I have drawn Chinese at a distance from the rest.

For various reasons that we will see, Chinese don’t use Facebook, or Twitter, or Youtube, or MySpace, or eBay. They don’t read Boing Boing or the Huffington post, and they chat in their own QQ chatrooms. They rarely receive the viral emails that we receive, and instead they get others like this one. They have all the things that we have and some more, but they built them in parallel in their separate parcel of the internet.

Whereas the sizes of the bubbles above are based on quantitative data collected by a respected source, the positions are only decided by semi-informed feeling. Any reader could argue that China should not be so far right. There is Hong Kong,  Chinese-Americans, even mainland Chinese who do surf in English. And I will be forced to admit that the Venn Map is flawed, because it fails to show this.

But in such a fast changing World like the Internet, position really means nothing. What holds today may be different tomorrow. What is really significant is the dynamics: which direction is China going, and how will the internet look in 10 years? Everybody agrees that China’s internet community is growing very fast, and that is natural. The worrying part is that it might also be moving away from the rest.

image3

Third iteration: The Dynamic Map

Because in Western countries internet penetration is already very high and India is still lagging behind, in the next 10 years the Chinese internet will become almost as big as all the rest together. If it continues to diverge, it may grow into a parallel network, like a dark side of the moon, a vast, self-sufficient island that the government can cut out at any moment and most people inside it don’t even notice the difference. This defeats the whole idea of the www.

Whatever the real magnitude of the problem, it is clear to most observers that there is a disconnect between China and the rest of the Internet, and there are powerful forces pulling them further apart. Fortunately, there are also forces working to balance this, and the results in the coming years will very much depend on how those factors play against each other. Here is how my new map looks now:image4

The Forces of the Internet

As we saw before in this blog,  some of the main factors that keep China separate from the World are the following, shown in red in the chart:

  • Linguistic, as we saw in this post, where we proved that Chinese language is beautiful and unique in many ways, but it makes it very difficult for Chinese and foreigners to connect.
  • Cultural, in the broad sense of the word, meaning that the communities have so different views and values that they cannot understand each other. This includes the problems with the Media.
  • Political, the deliberate actions of the CCP in  multiple forms, including Nannies, the Great Firewall of China (GFW) and directly arresting people, as we saw here.

And in green the main factors that go in the opposite direction. Here they are in detail, for the optimists to rejoice:

  • The growing number of bridge bloggers and other internet uses that work to connect the two communities. These include not only the English language Chinablogs, but mainly Chinese people who translate foreign media and other content on the Chinese internet. From this humble blog I also did my bit against the GFW.
  • The post 90s and 80s generations that already dominate the Chinese internet. Their personal tastes in arts, music or cinema will probably be more international, and push them to connect with the World. This point is object of debate though, and some Westerners are very skeptical of the post 80s.
  • Business is one of most important factors that link China to the World. Since the construction of the EU, it is no secret that commerce can achieve the most ambitious goals in World Peace, so whatever your take is on those business minded Chinese, they are probably the main force that is still keeping the Chinese Island connected and holding the World Wide Web together.

What do you think? 你有什么想法?

Do you think I am exaggerating? Or is the problem even worse than this? Any factor I missed in the Internet Maps? Internet friends: you are the pixels inside the coloured bubbles, you know all about this World because it is your home: comment and help me improve my Map!

你觉得这很夸张吗?还是认为问题写得还不够严重?你知道我在互联网地图里忽略了哪些元素吗?网友们:你们是小圈里面的像素,那里就是你们家,帮助我改进我的地图!  U5KMU63NGPP2

America against the GFW

Saturday, August 15th, 2009

I just learn from Reuters that U.S. is testing system to break foreign Web censorship. This is the first news I have that the US government is trying to outsmart the GFW. Fantastic, after the anonymous hackers now it is the most powerful state in the World that will confront the dreaded wall. The war of the internets is here again.

I am not usually supportive of external efforts to force political change in China. Democracy is like love, I say, it has to come from the inside if it is going to be true. But when it comes to breaking the GFW any help is welcome. GFW censorship is a shameful activity and it amounts to lying to the people, China deserves better than that.

Now, the only problem I see here is that the program is not looking great. I will not criticize the technical part of it, I am sure Mr. Berman has hired the best brains in Silicon Valley to ensure the solution is sound. But like in all internet applications, it is the final user’s point of view that has the last word, and from this perspective I have strong objections. Here is why I think it is dumb:

The real challenge of the GFW is not for final users to be able to access information on the web, this is already done in many easy ways, not to mention that RSS feeds are not censored and any blocked website can be read simply by opening its feed on Google Reader.

No, the real challenge is for content providers, including dissident bloggers, Chinese NGOs, discussion forums, etc. to be able to serve their content in a way that is immediately accessible to all. Because the objective of those sites is NOT to be read by their fans, but rather to spread the word into the general population. And the general population has been proven once and again too lazy to use the GFW bypasses linked above, unless it has a definite purpose to use them (usually porn).

In a nutshell: “Voice of America” is offering a service for fans of VOA to subscribe and access content that they can already access anyway. What those guys need is not to access content, but to SERVE it.

So it looks already like GFW 1 – USA 0.  I wish I could say good try, but really they are not even trying. What is this, a VOA publicity stunt? Whatever, all agencies have a budget to spend, I guess.

Instructions to deal with the GFW

Wednesday, July 8th, 2009

I have written a lot recently about the Great Firewall of China (GFW). I had my site blocked for two weeks and this inspired some frustrated posts until eventually I worked my way through the Wall. The good news is I learnt a lot in the process, and now I can write some tips to help others with the same problem. Anyone who has a website hosted outside China can use these instructions to try to keep it accessible here. Here is the index, follow the links for details.

Prevention – Try to stay out of trouble

From the beginning when you set up your website, there are a series of measures you can take to reduce the probabilities of getting blocked and/or making your life easier if this happens. If you follow these points hopefully you will never need get to the next Section.

  • Be careful with what you publish. >>>
  • Try to avoid writing GFW keywords. >>>
  • Choose where you want to be hosted. >>>
  • Choose a good, flexible hosting service. >>>
  • Host your blog/site on a subdomain. >>>

Action – When trouble is at your door

Then one day you realize that your Chinese readership has fallen to zero, and you wonder why you can’t open your website from China. If this happens to you, these are the simple steps to follow:

  • Make sure it is really the GFW. >>>
  • Check if there is an IP block. >>>
  • Find out if the target is really you. >>>
  • Check if there is an URL block. >>>
  • Move to a new IP address. >>>
  • Change your URL and Redirect. >>>
  • Check that you don’t have links. >>>
  • Try to eliminate the keywords. >>>
  • Take it easy, and send feedback. >>>

Notes and Disclaimers

  • Don’t forget to read the party of the first part >>>

Click to continue »

Normal Service Resumed

Sunday, July 5th, 2009

After a terrible weekend in front of the computer I have managed to re-open my site on a new URL. I am fed up of the internet right now and I am going out to enjoy the Shanghai Sun for a few hours.

I will try not to write more about this for a while, one never knows who is watching, and I don’t want to sound like I am rubbing it in. For those of you whose sites have the same kind of problems send me private message, I might have some useful tips.

Actually, the general idea is very simple. There is not such a thing as internet censorship, it is just a well-known bug on the internet that apparently trips on some keywords and then makes some URLs and some IP addresses inaccesible…

All very annoying, these internet bugs! ehzrg7f6xk

UPDATE: This afternoon/evening I still had an intermittent block, but I have finally figured it out. Some of the elements in my blog were still pointing to the old /eng/ directory, and for some reason my Regex search was unable to find them. Thanks to my friend and IT genius Giom I have found a firefox add-on to see the traffic from my site, and corrected all the references. Blog should be completely “debugged” now.

Crossing the GFW and one interesting Idea

Friday, July 3rd, 2009

This week I had some interesting conversations on other blogs, mostly regarding my state of internet blockdom and the possible actions that a webmaster can take to solve this problem. I will share here some conclusions that might be of interest.

Just to make sure we don’t forget anything, I will go first over the most obvious points:

1- If you are any kind of commercial undertaking, or if you depend on your site for a living, please pay attention to what you publish. Sites in English have quite some leeway to publish political content, but the bigger you get the tighter the line will be, and any kind of political activism can get you down.

2- The worst position is when you are big enough to attract the censors attention, but small enough to be insignificant in the general scheme of the internet. Say the BBC gets blocked: this makes a lot of noise, and eventually the Chinese government feels the pressure to reopen it. Inversely, if you stay small enough, you will never be blocked regardless of what you write. When you are in the middle, like these sites, the risk is biggest.

3- Finally, if you are already blocked, you can try your luck at 9 Dongdajie, Qianmen, Beijing, as a commentator suggested (this is the address of the Beijing Public Security Bureau) or any official body of your choice. I have no experience with this, and I am very skeptical about the results, but it is not impossible that the legal system works once in a while. We have seen stranger things in China.

Getting through the block

Once you have gone through the points above and decided that none applies to you, here are the typical solutions for users to get through the Wall. There are many of them, so I will just list the most well known, such as: lists of free web proxies, ad-supported or fee-based VPNs, networks like Tor or activist software like Freegаte*.

I will not go over each of these because you can find lots of information on the internet already, but I have tried a few of them and they all more or less do the trick: you can open in China sites that have been blocked by the GFW. These solutions are well known to the Chinese netizens users, as you can see in this Chinese blog which has even more options, such as giving a SSH number and code to your users.

So, you might think, what’s the big deal with the Great FWall? It is full of wholes big enough for a whole horde of Mongols, like it’s always been.

You are right, and yet, the GFW is a powerful system. For anyone who had a website blocked, it is very easy to see the impact on the stats of incoming hits from China. Depending on your size and content, it can be down to a 25%, and if you remain blocked for some time, chances are most readers will not find their way back to you. My guess: a combination of laziness, hi-tech aversion, and the excess of info flowing on the net means that a missing site is quickly forgotten, and few go through the trouble of opening a proxy for you. Click to continue »

The War of the Internets

Thursday, July 2nd, 2009

So there you are. July 1st passed without any major incident and the famous Anonymous Netizens didn’t show up. I am as blocked as ever and the Nutty Nannies of China are still running loose on the web, unimpressed by the headless suit .

I cannot say it is a surprise, frankly the chances of anything significant happening were one in a wan*. As I said in a previous post, these anonymous Netizens are not Chinese, but Western, from the mostly American chan boards, in particular chan888 (no link here, I have enough trouble as it is with the GFW to get me the hackers as well). These guys surely had some Chinese to advise them, but the initialive looks entirely Western, and the style was very similar to their -quite succesful- attacks on Scientology.

There are at least 2 reasons why their attack on the Chinese censors was destined to be a failure: In the first place, China is not a website that you can hack, it is country, and pretty massive at that. You could manage to confuse the GFW for a while with some coordinated attacks, but that would not change the – mostly offline – internal censorship of Chinese websites, which is what really matters here.

Secondly, the kind of attacks that the Anonymous do are not applicable in China, because they are based on giving negative publicity to the victim. But this country is already such an accomplished expert in creating PR trouble for itself, and in the most prominent media in the World, that one occasional attack by hackers, no matter how succesful, would hardly make any difference.

The China Internet Isle

But there is one fundamental reason why these Western initiated internet attacks have no hope of succeding here. The internet is a very powerful tool of social mobilisation, but only through the voluntary participation of the netizens in one community. The power lies not on the web itself, nor on its pirates, but on the millions of users that get connected for a common cause.

Let me remind you here of that misunderstanding that got my blog blocked in the first place: A famous New York newspaper took me for a Chinese hero fighting for Liberty, and then the censors of China agreed with it. Following that glorious moment of Chinayouren, I got some fellow fighters offering all sorts of contributions to the cause, such as banners to hang on websites. You can see some in the comments here .

It became clear to me then the little awareness in the West of the significance of the Chinese internet. The Chinese internet is not only the single largest national community of netizens, it is also a largely isolated island, with very few connections with the outside World compared to its size.

Partly for language reasons, partly because of the GFW, but I guess mostly because of cultural differences, the Chinese live on a parallel dimension of the web. They don’t use the facebooks, or Youtubes, or Yahoo news, or IRC chats. They have their own means to communicate on the internet, and this largely excludes interaction with people outside China.

And that is where the problem comes. It is the same situation for a company seeking to advertise itself on the Chinese internet as for a social movement who tries to push its way here: you need to be inside the island to have any impact. You need to understand the Chinese and they need to be part of your idea, and only when the wans of Chinese feel that this movement belongs to them, only then the internet can become the most terrible of weapons.

So yes, I do think the internet has still its last word to say in China. But I am pretty sure that when this happens, it will be a Chinese initiative.

*I coined this the other day. Wan is 10,000 in Chinese. And yes, I find it hilarious.

Firefox 3.5 Finally

Thursday, July 2nd, 2009

imagesIt was about time Mozilla issued their new revision. Ever since Firefox emerged as the big challenger of Explorer many of us switched to this swift browser with the unlimited add-ons.

As time passed, we grew so used to all the fox capabilities that it became normal for an internet browser to perform the most various functions: Firefox was my Chinese dictionary, my anti-GFW proxy, my image editor, my wikipedia link, my financial consultant, my bookmark and my fluffy nail brush. It was Jesus, and it was perfect.

And then, suddenly,the fox got old. The add-ons started to weight on its worn out bones, and one day we found ourselves waiting 10 seconds for the browser to open. Never again. But what could we do, we were addicted to the add-ons, and way too proud to fall back on Explorer.

For months (years) on end there was no solution forthcoming, and no amount of reinstalling helped to solve the issues. Then the shrewd guys at Google jumped at the chance. They launched the new Chrome, a browser that actually does nothing at all except – guess what – browse at the speed of light. The first day I downloaded it “just to give it a try”, and after that test flight I never opened the Fox again. Instead, I got myself a dictionary.

So now the Fox is back. Its 3.5 iteration feels certainly much faster than the previous, and it supports, if not all, at least the most important of my add-ons. The influence of Chrome is very obvious, with new functions like the “private” surfing, one-click bookmarking and that little button on the right hand side of the tabs that comes so handy to open a new one.

As for the speed, I seriously doubt it can beat Chrome (look at the announcement, Mozilla compares it with Explorer, but avoids mentioning Google). But as long as it is reasonably fast and it continues to brush my nails, I think it deserves another chance. Let’s see how it goes.

I am back with the fox. I hope it rocks.

GFW 1st July: Waiting for my Anonymous saviours

Monday, June 29th, 2009

So OK, I am censored, but why NOW?

I mean, I haven’t been writing anything for ages, is the Propaganda Department punishing me for being lazy? Has some big Chinese BBS  linked to me recently, is Uln hot now? As I was looking around for an answer, I found out that the Peking Duck blog was blocked more or less at the same time as mine, and it was asking the same kind of questions.

That is when I got this idea of the LIST, which I wrote on their comments. Everyone knows that GFW is unpredictable, it starts and stops and nobody ever knows why, if you don’t believe me look at this funny chronology. But this random behaviour usually affects only some websites, and never touches others. So necessarily, the guys at the GFW Control Deck are working with a number of websites that have been shortlisted beforehand.

I am quite sure of the existence of this LIST, because I noticed very precisely the moment my blog was shortlisted. It happened earlier this year with that political post that was picked by the New York Times. Since then I had strange things happening, with miniblocks now and then, a perceived slower speed loading in China, and, of course that particular Chаrter 08 post has been blocked ever since (even as the rest of the blog remained open). Also, look at that weird comment in Chinese in that post, where the guy says I am interfering in China’s internal affairs… could be a troll. Or could be not.

Anyway, my guess is that this blog and the PKD’s block have probably nothing to do with our recent activity, but rather with the tense atmosphere in the censors office these last weeks, after the Green Dam fiasco and the Google affair. At some point someone must have said: “hey, let’s block some more sites”, and we were unfortunately the next names on the LIST. And, unlike Google, I am afraid sites speaking specifically of politics are blocked permanently, such as this one, or this one. I hardly imagine the censors taking the trouble to monitor our blogs every day to see if we are behaving better. So my guess is, both for me and for PKD, that the block is here to stay and there is no solution.

… or perhaps there is?

july1anonymous.jpg

Click to continue »

CHINAYOUREN Blocked

Monday, June 29th, 2009

censored

So guess what now:  I am blocked.

I am banned, prohibited, harmonized, river-crabbed. Censored, in short, by the Great FireWall of China. If you are reading my blog now and have not noticed anything strange, it is because either:

1- You are reading the blog from outside China and therefore you are not going through the GFW (Chinayouren is hosted outside China, you are on the same side of the Wall). Or else,  2- You are reading this from mainland China and you are using some means to connect anonymously and pass through the wall. Most probably a web proxy or  VPN, which is what I am using now. Click to continue »

China’s Internet Censorship Explained

Thursday, January 22nd, 2009

Since I started posting about censorship I’ve noticed that the basics of the system are not clearly understood by many readers outside China. This post is to classify and explain the system in the most simple way possible. It is largely drawn from my own experience as a user in China and from the studies by Rebecca Mackinnon.

The internet censorship in China is a complex system in constant evolution, both technologically and in terms of the content censored. It is managed by the State Council Information Office – Internet Management Division. Until recently it was mostly referred to by foreigners as the Great FireWall of China (GFW), but today the name of Net Nanny is more in use, especially since studies like this one exposed the limitations of the GFW metaphor.

In fact,  both names can be used, as they refer to different mechanisms of the censorship system and they help visualize the basics for non China-dwellers. Man gave names to all the animals, and let’s give clear names to these ones too so that we can avoid further confusion. China’s Censorship system is composed of: the Net Nanny, the Great Firewall (GFW), and the Search Engines Manipulation (SEM). Note the important differences between the three, which can be summarized as follows:

  • the Nanny eliminates content, by forcing self-censorship.
  • The GFW blocks content from access in mainland China.
  • The SEM hides content, making sites unsearchable/invisible.

These three elements or any combination of them are currently used to censor content on the Chinese internet.

1- The Net Nanny

Like a nanny does with naughty kids, the government scolds rebellious citizens who publish content of “vulgar” or political nature. The Net Nanny is the mechanism that controls content by putting pressure on the publishers to self-censor. Of course, Net Nanny methods are only applied when publishers are in some way subject to the power of the Chinese government. Normally because either they are Chinese, have business in China, or have their websites hosted in China.

The Nanny’s power comes from its ability to close down a website, take away the business license or directly impose “stern punishment” on offenders. The Nanny monitors compliance using a large human workforce aided by sophisticated devices that sweep or sniff the data moving about the Chinese internet.  She regularly warns the publishers, either privately or in public inquisitorial lists that make the headlines in Western media.

Final users suffer the Nanny in one of the two following ways:

  • The site where they read/publish content is found non-compliant and closed down, like recently happened to bullog.
  • The site where they read/publish content is self-censoring, erasing individual user’s content or refusing to publish it.

In all cases, content censored or “harmonized” by the Nanny is not accessible from anywhere, regardless of the use of coded connections. This content is not blocked, but simply eliminated from the internet.

2- The Great Firewall of China (GFW)

The Great Firewall is a different creature altogether, although closely related. It is another tool that the Information Office uses to control access to content. As opposed to the Nanny, the GFW is not directly  based on human interaction, but rather on a series of technological devices that are able to detect the sensitive content entering the Chinese internet and block it, whether the original site is in China or not. Depending on the devices used, the GFW can come in different flavours, such as “Reset Connection” or “Time Out”, but the result is always the same: the page cannot load in mainland China.

The blocks applied by the Great Firewall of China are often very quick, automated, and without previous notice to the publishers. In fact, it can happen that the owners of the site go for a long time without noticing, especially if China is not an important part of their business.

Other characteristics of the GFW are:

  • It is only visible to users in mainland China.
  • It is erratic and unpredictable, block can last hours or years.
  • It is easy to bypass using coded connections, like VPN or web proxies.
  • It can affect a single post, a website or a whole host/subnet.
  • GFW often tries to disguise itself as technical problems of the Chinese network.

GFW is the most annoying part of the Chinese censorship. One might think it is worthless, since it can be bypassed by widely available free proxies. In fact it is extremely effective, due to a mixture of laziness and lack of information of the public. Using myself as an example, there are some excellent blogs I had not visited for months just to avoid the (minor) hassle of connecting through proxy. How many Chinese would go out of their way to access political documents like Chrter 08 that they’ve never heard of and they cannot locate in their Search Engines anyway? (see below SEM)

But the worst aspect of GFW is that it embodies the complete lack of respect of the censors for the individual rights of the users. Indeed, to avoid access to a few pages, the GFW regularly blocks whole subnets without previous notice, affecting thousands of users that had nothing to do with the non-compliance in the first place. There are many examples of this, one of them is the major blog hosting service “Blogger”, which has been blocked in China for years.

3- The Search Engine Manipulation (SEM)

This is the part of the censorship system specifically dedicated to Search Engines. Technically it is not a new mechanism, but a caffeinated Net Nanny applied to Search. The main difference lies in the essential role of the Search Engines in directing internet traffic, and the enormous potential for manipulaton that Search Result lists provide. Note that SEM refers only to the List of Search Results itself, and not to the possible blocks happening when clicking on one of the individual resuts, which would belong to point (2) above.

When an internet user looks for a term in a Search Engine, he is trusting this Engine to bring him the most relevant results for that Search. A List of Search Results that is manipulated to show only what the government wants to show is one of the most powerful tools of deception, and one that is less obvious to the final user than the plain blocking of websites. The websites that don’t appear on the list are not perceived as “censored”, they are simply nonexistent.

Like any other websites, the Search Engines can suffer the 2 kinds of censorship described above.

1- They “harmonize” their Result Lists, following the Nanny. This is properly SEM.
2- They get some Search Strings blocked by the GFW.  This is just a URL block of the GFW.

Note that, while (1) is a flagrant case of Search Engines actively collaborating with the system, in (2) it is the URL of the search that trips the GFW, and the Search Engine’s responsibility if any is ony passive (ie. they don’t fight against it) (*). As far as I have seen, all the search engines based in China, including Google, Yahoo and Baidu practice SEM,  the most form of censorship. I already did a little study of SEM recently where you can see some results.

(*)UPDATE: Following suggestions by international expert Nart Villeneuve: I have introduced a few changes of my own in my SEM post. It is very important to understand the role of Search Engines in GFW censorship: to get the details of this complex question you should read proper research papers like this one, or this one.

Also by same author a suggestion of what could be the 4th and newest animal in the Censor’s farm: application-specific censorship such as the censoring of IM’s by qq and Skype.

UPDATE2: Coming back to this post one year later I realize that I have learnt a lot since then, and I have corrected a few details. There are also some aspects that are missing, like details about how the GFW works, the IP, URL and keyword blocks, etc. which I learnt when this blog was GFWed in June 09. You can read all about that in the comprehensive instructions I did after I managed to unblock it.

.

NOTE: Comments and corrections welcome. Also please let me know if something is not clear enough so I can edit/clarify.

A little Study of the Internet Censorship in China

Wednesday, January 14th, 2009

Last Sunday I did a post on internet censorship in China where I mixed in various different ideas and I’m afraid the final result regarding Search Engine Censorship didn’t come out as clear as I would have liked. I think it is an important subject, so here are the complete results:

We will be looking at Google.cn, Google.com and Baidu.com, and we will try in each of them 3 different kind of search terms.

A- Chrter 08: In all its combinations, which are 08宪章 and 零八宪章
B- Political Terms: Tiananmen incidents (天安门六四事件), FLG.
C- Vulgar words: Sex. I will employ the “blog job” and the “chicken bar”.

It is understood that in all cases the search terms are in Simplified Chinese. The browser is Firefox 3.0.5. and the connection is a normal home DSL by China Telecom. The possible results are:

  • Free Search – Results look consistent and realistic, like the ones obtained in the West.
  • Reset Connection (RC) – This can only be seen in Mainland China. The result is an image like the one below and the search engine cannot open anymore for a while (I estimate 30 seconds). RC is not directly done by the Search Engine. Wikipedia internal search also gives RCs for B Terms.
  • Forbidden Message (FM)  – This is the forbidden Message that, with slight variations, is the same as shown below. It says something in the lines of: “Some results are not displayed according to the local laws, regulations and policies”.
  • Manipulated Results (MR)- This is the case where the results are obviously manipulated, for example in the search of 天安门六四事件 (Tiananmen incident) on Baidu, where all the results are official newspapers such as People’s Daily, etc. Sometimes it can also carry on top of the page a FM.

Google.com
A -Free Search.   (But click some individual results gives RC).
B- Reset Connection
C- Manipulated Results.

Google.cn
A- Forbidden Message and (sometimes *) Manipulated Results
B- Reset Connection.
C- Forbidden Message. When used “” gives Manipulated Results.

Baidu.com
A- Manipulated Results. When used “” gives Forbidden Message.
B- FM and Manipulated results.
C-FM and Manipulated Results.

Conclusions

1- The results are somewhat erratic and it is difficult to see a pattern: it all looks like a series of patches on top of each other rather than a systematic implementation. Also, things change in time, as in *, where the Manipulated Result I saw Sunday cannot be seen anymore.

2- Baidu has a different system from Google: it has no Reset Connections. This is very advantageous for Baidu and I understand it is unfair competition, as a RC is one of the worst experiences while surfing.

3- This might be due to Google’s own preference server location: the involvement of the Search Engines in the RC is unclear no direct involvement (even Wikipedia has RCs!) whereas Manipulated Results obviously requires their action, and can more easily attract attention from Advocacy Groups. Of course, in the case of sexual terms (C), this is not a problem as the Manipulated Results can just be called “Safe Search”.

4- The Chrter 08 has different treatment than other political terms, but it might just be because it was banned urgently and suddenly, so it is only a quick fix added to existing structure. It does not provoke RC in any case. It looks like they have decided to leave it alone on Google.com to avoid attention from Western advocacy groups, but in exchange Google has had to give up Google.cn and apply the infamous “porn block” to it which is active censorship by SE. Why the FM and not RC? Who knows, I am guessing perhaps RC is more complicated to implement.

5- In any case, and however negative, I understand it is always better to show FM than Manipulated Results, because the former is openly admitting censorship, whereas the latter is a lie and a distortion of reality. Forbidden Message does increase transparency, yet does not justify involvement in political censorship. From this perspective, Google is closer to the truth than Baidu. Baidu seems indeed a more active participant in the government’s information control schemes, and Chinese users of Baidu are clearly the most exposed to Search Engine brainwash.

UPDATE: Following corrections by international expert Nart Villeneuve below: I have introduced a few changes of my own (in blue). In any case, this post is just a very basic review of the SE Censorship system from the perspective of a normal user. If you really want to understand how the GFW works, you should read proper research papers like this one, or this one.

.

IMAGES:

1- FORBIDDEN MESSAGE (FM)

2- RESET CONNECTION (RC)

NOTE: If someone is interested in this or has some more information to share please put it in comments. Unfortunately my time is very limited so I only ran 2 or 3 terms for each of the classes A, B and C above. There might be things I overlooked and I would be grateful if you can point them out.