The Tower of Babel : normalizing language representation

Posted on August 23rd, 2009 by metasj.
Categories: SJ, chain-gang, international, popular demand, wikipedia.

Part of a series on difficult topics from the Wikimedia community

There are some perennial projects that take more than a single barnraising to understand and plan for. One is the issue of supporting different languages equally — the world’s largest and smallest languages are both underrepresented among the projects.  While I would like to see Wikimedia become a model for the rest of the online world in this area, how a global community can provide support, bugfixes, and advice to different/new language groups is an issue for many multilingual projects.  So I offer these questions to all readers – feel free to answer them for the projects you are most familiar with.

  • What technical and other support do various language projects need to become awesome?
  • What variations are needed for projects whose main goal is language and cultural preservation?
  • What sharing of advice or practices would make starting new projects easier?
  • How can established projects help new projects with outreach, communication, and planning?

Let me offer one example of how this has been difficult to grasp within Wikimedia: discussions on the early international list were generally in English.  This led to a certain founder effect among participants, and in how the projects are today framed to the world, from elaborations of the vision to interface design.  And this has forked discussions of what language projects need – those in the language of the project, which can happen easily and fluidly among its participants and contributors, and those meta-discussions in one or two shared languages with the potential of setting Wikimedia-wide policy or affecting all projects.

As another example: non-Latin character sets, and cultural differences about editing and participation across different parts of the world, have always been part of discussions about how Wikipedia and its sister projects should advance.  Nevertheless, the early language communities drawn to the project were largely European, and issues that only affect non-Latin readers can still take a while to fix (for instance, replacements for Roman-alphabet captchas, or fixes to javascript and css layouts in corner cases).

What are your examples? What am I leaving out?  How can the global community and the Foundation better support small and underrepresented languages?  Feel free to leave links to current or historical discussions about problems and opportunities.

1 comment.

ICT4Dev and three-legged stools

Posted on August 20th, 2009 by metasj.
Categories: Uncategorized, popular demand, wikipedia.

The ICT4Dev aggregators on technology and learning have been covering some excellent topics over the past few months, and doing a good job of bringing some new commenters into these discussion online.

Here is a series, part of the Educational Technology Debate, on ebooks and affordable access to [preexisting] content, featuring Dick Rowe (Olé!)and Angus Scrimgeour. People still avoid talking about building new materials from scratch – the sort of work that a skillful teacher engages in every week – which is when another leap forward will begin. But they are keen on finding ways to let interactivity and creativity improve and annotate books and class materials.

Do we need a three-legged stool? Will it balance?*  What else is missing?

* I can see a whole new series of YouTube videos based on this hook… including everything from architecture to ontologies.

1 comment.

My Wikimedia platform

Posted on July 30th, 2009 by metasj.
Categories: chain-gang, international, popular demand, wikipedia.

logometa

I’ve organized my thoughts about being a good Board member in my platform for the Wikimedia Board.

The most common questions I have heard since this year’s elections began are, what does the Foundation do? and what is the Board of Trustees for? I posted answers to these questions and a few more.

People also ask, how do I qualify to vote? To be eligible to vote,

  • You must have 600 edits as of June 1, and 50 within the past 6 months.
  • You may need to create a Unified Login to count edits on more than one project; or to vote from your main wiki.
  • If you are not eligible, you can still encourage fellow Wikimedians to vote, or leave suggestions for future elections

I am more intent on this year’s election than I have been in any year past – in part because the Board’s role has been shifting away from one that actively engages and challenges the community, something valuable Agnela and Anthere brought to the Board that I miss.  I am deeply concerned by the lack of community growth for the past two years, and the complete stagnation of new project development (despite the growth of new independent educational free knowledge projects that requested Wikimedia hosting). And I was just talking to my friend Bibhusan Bista, who said that there is definite interest in the Foundation in Nepal, and in contributing to Wikipedia’s spirit of openness; but of course few editors there feel they can engage in related discussions (and none, for instance, would be eligible to vote).

So I have two goals for my campaign beyond getting elected: to inspire people to vote and remind them why a good foundation matters, and to encourage them to raise community priorities and requests of the foundation, while attention is on governance over the next two weeks (and while you can get an immediate response from at least three future Board members, something often hard to come by).

My request to you, if you appreciate Wikipedia and want to see it thrive, whether or not you have the edit count needed to vote: please leave suggestions about how Wikimedia should grow, blog about the election and your reasons for caring about it, and help support the election in smaller languages and projects.

4 comments.

Offline reading and editing of the world’s knowledge

Posted on July 26th, 2009 by metasj.
Categories: international, popular demand, wikipedia.

Offline wikireaders have been around for over a decade in various forms, but still it seems few of them are really excellent.  (If you’re interested in such things, I have a mailing list for you…) At OLPC I’ve worked on various ways of sharing content with groups of students who are offline, and last year Chris Ball and Wade Brainerd built a WikiBrowse application, based on Patrick Collison’s iPhone Wikipedia app, that has been downloaded by 400,000 children and teachers in English and Spanish.  This was the first reader to store a compressed dump, expanding pages as they are read, and including a few images.  But it still doesn’t allow you to easily compile your own version of WikiBrowse based on your preferred title list, and it doesn’t support full-text search or offline editing.

Now Pascal Martin of Linterweb and Wikiwix fame has released a new product : Okawix, the engine behind a new DVD snapshot of wikipedia; it is now linked from download.wikimedia.org.  You can reads more about it on their Wikiwix blog.  This could be the foundation for a fully functional Wikipedia on a Stick project, with editing and commentary, as the WikiStick hackers from Taiwan envisioned a couple of years ago.  See for yourself!

1 comment.

on the future of Wikipedia

Posted on July 22nd, 2009 by metasj.
Categories: international, wikipedia.

A number of recent initiatives have been started to plan for the future of Wikimedia projects and of Wikipedia in particular. The Foundation has made a 12-month Strategic Planning initiative one of their top priorities for the coming year, and hired three staff and an outside consultancy for the purpose of organizing input from the communities.

On the English Wikipedia, the Arbitration Committee tried to organize a community think tank to provide research and advice on community development and long-range plans, something which is generally wanted and needed by the community, but which people didn’t like having associated with the AC. (personally I think the idea will work fine once people get rid of application processes and acceptance metrics, and simply encourage everyone to take part in a focused sort of brainstorming, in a well-ordered way.)

At the New York Wikiconference this coming weekend, a number of the talks are about planning for outreach and future chapter and project growth — something it would be good to see more of at local events and on-wiki.  And I am running for the Wikimedia Board in part to help vitalize and expand Wikipedia’s sister projects, which have never emerged from its shadow (while still promising the same sort of universal single-source for free knowledge that we would all love to see and use).

So… what would you like to see in Wikipedia’s future?  What have you been waiting to happen for years that hasn’t yet come to pass?  What would you like to see from Wikibooks, Wikiversity, Wiktionary, Wikinews, or Wikisource?  Are you still secretly hoping that Wikispecies will merge with the Encyclopedia of Life?  Do you want Wikiquote to be as popular as LyricWiki, only legal? Are you happier with Enciclopedia Libre and WikiZnanie?  Let me know.  The best ideas will be thrown up on the whiteboard at the wikiconference.

4 comments.

wikiboarding

Posted on July 20th, 2009 by metasj.
Categories: SJ, Uncategorized, wikipedia.

I am running for the Board again this year, with the hope of bringing a stronger community voice to the Board, and organizing good and frequent open discussions between the Board and community about priorities, core services, new initiatives, and the like.  Angela organized a few open meetings long ago when she first joined the Board which I really appreciated, and which encouraged some previously invisible community members to come forward with good ideas.

Meanwhile, my friend Kat Walsh has not yet stood for re-election to the Wikimedia Board of Trustees, though I hope she will!

Update: she did, and she was reelected for another term!  Congratulations :)

She is among the last of a certain breed of board members who have been strong advocates for community involvement in key decisions, and we could use more.  The current Wikimedia Foundation is strongly in support of openness even without nagging from the Board – for instance in framing the upcoming year-long strategic planning as a process to facilitate and crystalize plans from the many communities – but without active community trustees we might no longer be so lucky a few years from now.

My official statement, and throwback to an earlier era, after the jump.

(more…)

2 comments.

on disambiguation and The Atomization of Meaning

Posted on June 25th, 2009 by metasj.
Categories: Glory, glory, glory, Uncategorized, metrics, wikipedia.

Disambiguate has been a somewhat obscure term for ’specify’ for ages.  And the noun form, disambiguation, has been used even more sparingly.  At some point in the last century, perhaps in the 1950s, it became a popular term in computational linguistics.   And before that it was basically only used by one person, writing about logic and semantics in the early 19th century.  All of this sprang to my mind because of the tremendous popularity of the word in and through Wikipedia.  In the encyclopedia, it is the canonical way to describe the clarification of an ambiguous term, the indication of type used to specify the context of an article title.

A bit of background.  The word disambiguation was not popular before the 50s.  It is used in quotes in a 1954 federal court case, expressly referencing the earlier work of the one philosopher and author who consciously used it for a specific purpose: Jeremy Bentham.  But who introduced it into the jargon of linguistics?  And to the original point, who introduced it to Wikipedia?

bentham-ontology-exposition

The word’s recent history touches on Rush, Nirvana, Invictus, Larry, and Magnus… and started with a page on Naming conventions/Disambiguating.  Details after the jump.

(more…)

2 comments.

weddings and nostalgia

Posted on June 13th, 2009 by metasj.
Categories: Uncategorized, wikipedia.

I was at the wedding of my dear friend Erik Cohen-Levy two weeks ago, in Texas – quite a lovely and relaxing celebration.  And was bitten by something unpleasant which over time made me quite ill.  It took a while and some divergent opinions to get a blood test… I should know more Monday about what it is.  But it’s nice in a way to know I don’t have the flu.

So I’ve been exercising, hydrating like I had a concert every night, and feeling inexplicably nostalgic.  And as I’ve been too tired to move around much, I had time to get to a piece of sleuthing I’ve been meaning to do for a while : to track down who popularized the term ‘disambiguation‘, which Wikipedia has now made a household word !  The nostalgia wiki was helpful, and I’ve turned up some interesting leads, which I will share in another post.  But if you have information on the topic, please share.

1 comment.

Another reason not to copyright logos

Posted on May 28th, 2009 by metasj.
Categories: Uncategorized, wikipedia.

…and should leave their source files publicly available.  Because most logos need work, and you never know who’s going to improve on yours for their own reasons.  (Of course you should still trademark them.)

Bjorn S. reflects on designing the first Wikipedia logo (designed for Nupedia but never used on that project), and describes how it led to today’s silver ball… and how he didn’t know about any of this until a week ago.

0 comments.

Wikipedia now incompatible with third-party GFDL text

Posted on May 27th, 2009 by metasj.
Categories: Uncategorized, chain-gang, international, wikipedia.

The GFDL 1.3 allows collaborative sites to switch from the GFDL to CC-BY-SA 3.0 as their license, under limited circumstances.

Wikimedia has been advocating for this change for some time, and with much effort from the FSF and Creative Commons a solution was worked out last November: such a transition would be available only for massively collaborative projects, and only for a limited time.  If a project opted for this transition, it could not incorporate any new GFDL material after the release date of the new license (November 3, 2008); and it had to decide by August 1, 2009.

Given the first date, one would assume a site would want to move as quickly as possible to decide, to avoid a prolonged period when no outside material under most any free license could be incorporated.  Nevertheless, it took us over 6 months to decide to make the transition.  Now we are faced with two hurdles: ensuring that no GFDL material has been migrated into a Wikimedia project since November, and far more complex, communicating with the hundreds of smaller GFDL wikis who chose their license for compatibility with Wikipedia, to ensure they know about this change and what it means for them.  They only have until the first of August to figure it out.

So I’ve started compiling a list of GFDL wikis and other collaborative sites that have not yet indicated any awareness about the license switch or considered switching themselves.  This includes at least half of the 20 largest GFDL wikis other than Wikipedia, both major medical wikis (Medpedia and WikiDoc), PlanetMath, and the old Spanish Wikipedia fork.  Please help contact these sites and update their status on this project page: [[m:Licensing update/Outreach]]

3 comments.

Twitterpedia FTW

Posted on May 17th, 2009 by metasj.
Categories: Too weird for fiction, Uncategorized, indescribable, wikipedia.

It has been claimed that twitterpedia will one day replace Wikipedia, at the point where everyone needs no more than a tweet about any given topic.

FT2 astutely comments:

the predictable twitterpedia sequel follows:
- user#217869: pov warring!!
- @83476238 not so!
- @217869 is so!!
- @83476238 not so!
- @both: u blocked 24 hrs 3rr
- @admin plz no?
- @217869 o ok
- @admin kthxbai
- @83476238 u block I not u suxxor pov war!!!
- @admin u involved,, @arbcom plzdesysopkthx?
- @user no wai!!
- ……….

A tip of the hat to all involved for scrying the essential parts of our post-singularity knowledge landscape.

0 comments.

Wikipedia researchers wanted!

Posted on May 4th, 2009 by metasj.
Categories: chain-gang, metrics, null, wikipedia.

Do you know people who are currently doing statistical and social research about Wikipedia, or have good ideas about this they haven’t had time to work on?

I’m trying to build support for continual, detailed statistics generation from Wikipedia data, possibly at the Harvard-MIT Data Center.  There is still time to come up with good ideas for lightning talks and discussion groups at Wikimania 2009 this summer in Buenos Aires.  And there is a research-related Wikimedia job available starting this summer.

I am uncomfortable with many of the details of said job posting*, but as long as its up the best people should apply.

(more…)

1 comment.