Skip to main content

Using Wikipedia to change the language of the Web


Nothing exemplifies the power of Wiki – the open and collaborative platforms for content creation – like the online encyclopaedia, Wikipedia. The site that everybody can freely and collaboratively edit is credited not only with having created a massive repository of knowledge but also democratising the presentation of content on the Web.

Wikipedia, which turns 10 this January, has over 35 lakh articles. In 2010, we saw several Wiki Media Foundation bigwigs visit India, hold public meet-ups with the Wiki community and appoint the first Indian to sit on the board of the Wikipedia Foundation, Bisakha Dutta. Apart from the formal Indian Wikipedia chapter, that has been on the anvil for some time now, Wikipedia Foundation has also chosen India to set up its first offshore office.

Why India?

But why India? The large number of potential Net users here, and the ‘ground support' that exists in the form of a passionate community of Wikipedians, drive these “offshore efforts”. However, they realise, that the ‘Indian Internet' is by no means a homogenous entity. During recent visits to India, Wikipedia co-founder Jimmy Wales has repeatedly articulated the need to approach Wikipedia growth here from a strictly ‘localised' perspective — by expanding the user base for the local language Wikipedias.

Yes, the Internet, with English as its predominant language, can barely make inroads into vast areas of the country. Indian language content on the Internet is low, and is restricted either to niche blogs or news content. Internet firms are also interested in changing this by enabling web advertisers aim to target larger local audiences.

So, how can Wikis help drive this change? It seems natural that a massive task like this one — that of creating and expanding local language content — is best tackled ‘collaboratively'. And that is just what Wiki communities do best. As of today, there are Wikipedias in over 20 Indian languages. While there are 58,000 articles in the Hindi Wikipedia, Telugu and Marathi too have been growing steadily, clocking 47,000 and 32,000 articles respectively. The Tamil Wiki has around 26,000 articles, Bengali (22,000), Malayalam (16,000) and Kannada (9,900). Together, Wikipedia is arguably the single largest source of Indic content online.

Early challenges

Enthusiastic Wikipedians (Wikipedia editors/contributors) will tell you that this growth has been all but easy. Buggy fonts, lack of platform-independent fonts and the lack of a common standard for keyboard layouts marred early efforts. Data input, though much improved today, is still a challenge for non-technical folks. Most operating systems, particularly proprietary ones, still do not support Kannada fonts ‘out of the box', points out Hari Prasad Nadig, an active Kannada Wikipedia editor.

“Data input was a huge challenge when we started building the Kannada Wikipedia in 2004. Even the Nudi font for Kannada, declared a standard by the Karnataka Government, worked only on machines running on Microsoft Windows. There too, Windows XP, the most popularly OS, still doesn't offer complete support for Unicode Kannada,” he explains. Most Indian languages face similar issues with rendering of Indic fonts.

Comments

Popular posts from this blog

4 Free Apps For Discovering Great Content On the Go

1. StumbleUpon The granddaddy of discovering random cool stuff online, StumbleUpon will celebrate its 10th anniversary later this year — but its mobile app is less than a year old. On the web, its eight million users have spent the last decade recommending (or disliking) millions of webpages with a thumbs up / thumbs down system on a specially installed browser bar. The StumbleUpon engine then passes on recommendations from users whose interests seem similar to yours. Hit the Stumble button and you’ll get a random page that the engine thinks you’ll like. The more you like or dislike its recommendations, the more these random pages will surprise and delight. Device : iPhone , iPad , Android 2. iReddit Reddit is a self-described social news website where users vote for their favorite stories, pictures or posts from other users, then argue vehemently over their meaning in the comments section. In recent years, it has gained readers as its competitor Digg has lost them. ...

Evolution Of Computer Virus [infographic]

‘Wireless’ humans could backbone new mobile networks

People could form the backbone of powerful new mobile internet networks by carrying wearable sensors. The sensors could create new ultra high bandwidth mobile internet infrastructures and reduce the density of mobile phone base stations.Engineers from Queen’s Institute of Electronics, Communications and Information Technology are working on a new project based on the rapidly developing science of body-centric communications.Social benefits could include vast improvements in mobile gaming and remote healthcare, along with new precision monitoring of athletes and real-time tactical training in team sports, an institute release said.The researchers are investigating how small sensors carried by members of the public, in items such as next generation smartphones, could communicate with each other to create potentially vast body-to-body networks.The new sensors would interact to transmit data, providing ‘anytime, anywhere’ mobile network connectivity.Simon Cotton from the i...