Archive for December, 2004

Monday, December 13th, 2004

Maybe I’m cheesy but I really like the idea of a Voice Recording Picture Frame as a present for my mom.

Monday, December 13th, 2004

Lou has some questions about Enterprise Topic Maps.

Monday, December 13th, 2004

I met with Mica, Kenyatta, Josh and Jay today to discuss the future of me-tv. I will talk about that more later, meanwhile, here’s the video (Quicktime, about 4 Megs). I was psyched when I realized what a fantastic group of people were together in that room! The movie includes the first secret preview of the upcoming ANT tool which is like ipodder for video.

Saturday, December 11th, 2004

I am looking for a piece of PHP code that returns URL’s found in piece of text. I need it to recognize the urls in the following example (found in an RSS feed), which my current code can’t handle.


<a href="http://www.michaelverdi.com/video/watermark.mov"target="_blank"><img src="http://www.michaelverdi.com/images/watermark.jpg" border="0" /></a> <br />That's right, I spent all weekend at my computer for your viewing pleasure.

Saturday, December 11th, 2004

Two good friends got married today, to Willie and Melina, I wish them all the best. I made a movie too, the quality is really bad, I’ll figure out how to make that better and maybe post it again tomorrow. But here it is .

Friday, December 10th, 2004

me-tv is still in alpha, but already some cool stuff is becoming possible. Watching video online is just a drag, and me-tv makes it easier, and you know what, for watching old-school TV it sucks, but for watching all that other stuff, it rocks. Feedster has a videoenclosures feed, which I fed through me-tv. This is what it looks like through me-tv:

Friday, December 10th, 2004

Google Suggests (beta) is great fun, try it out.

Guide to Ease �

Friday, December 10th, 2004

As an experiment (this should work), I’m reblogging a link to a video interview I did with Jay. If I add in the link to the video in this post, me-tv should pick up this video as well, once I run the cron job.

Thursday, December 9th, 2004

What is a camel? A camel is a horse designed by committee.

Bienvenido | Colombia Guide

Thursday, December 9th, 2004

I have an HTML/CSS question about this page. You see where it says “Submitted by Peter”, and there’s a star next to the name. Why does it wrap to the next line? (Firefox and IE) I can’t figure it out, except that it might have something to do with the css (that I didn’t write myself). Anyone?

Thursday, December 9th, 2004

James
is doing some interesting innovation with taxonomies and folksonomies and what not. By the way, the aifia-members list has a great conversation going on about folksonomies - join aifia to get the goods!

Wednesday, December 8th, 2004

Challenges with automated downloads: Blogdigger Development Blog: “Even moderately to unpopular podcasts could incur large bandwidth costs (case in point: within a few hours of putting the Blogdigger podcast on the Blogdigger media feed, it was downloaded several hundred times; still I’ve yet to receive an email or comment telling me how hilarious it was, which obviously means no one listened to it ;).”

Wednesday, December 8th, 2004

Selfpromotion: Blogdigger Development Blog says (about http://me-tv.com): “It is simply brilliant.” Thanks.

Locals playing with digital camera | Poorbuthappy guide to India

Tuesday, December 7th, 2004

Tuesday, December 7th, 2004

I added signing up to the me-tv alpha today. A feature/bugfix a day keeps the doctor away. It takes me about 2 hours every day. Bootstrapping. I’m happy I released early, I have already over 50 people using it and they provide great feedback on what’s important to them. It’s a videoblog browser, by the way. In some important ways, it’s different from a text blog browser. Is it different enough, or will Bloglines eat my lunch? Keep tuned.

Monday, December 6th, 2004

Anyone know how to hack the Magpie RSS Parser to support enclosures? Enclosures use attributes instead of element content, and MagpieRSS doesn’t support that.

Monday, December 6th, 2004

The crazy thing about videoblogging, and what gives me hope, is that the videobloggers (I’m not a real videoblogger - they are) are WAY excited. I mean, you remember how you were excited about your first contact with the web? Your first website? Like that.

Mica told me how Charlene found out about this new medium. Mica showed her videoblog, and Charlene came running upstairs (if I understood this correctly) all excited, and then started calling her in the middle of the night with questions about how to do this. They’ve been posting a video almost every day. Imagine the drive behind that. People want this. They want to have a voice.

Anyway, I am hacking together what will hopefully not be known as a “vogbrowser“. Check it if you’re interested. It’s still buggy, but it’s getting better every day. Check out Mica, Jay and Charlene’s feeds, as good examples. I am working on recognizing more types of feeds… (My friend thinks it’s “better than tv“).

Monday, December 6th, 2004

I just made 3 short phonecalls to Belgium (from NYC), using SkypeOut. It was cheap: only 15 cents, and worked great.

Monday, December 6th, 2004

Yahoo! News - U.S. OKs Evidence Gained Through Torture: “Evidence gained by torture can be used by the U.S. military in deciding whether to imprison a foreigner indefinitely at Guantanamo Bay, Cuba, as an enemy combatant, the government says. Statements produced under torture have been inadmissible in U.S. courts for about 70 years. But the U.S. military panels reviewing the detention of 550 foreigners as enemy combatants at the U.S. naval base in Cuba are allowed to use such evidence.”

Monday, December 6th, 2004

Topic Exchange: Channel ‘multilingual_blogging’

Monday, December 6th, 2004

Joel on the impossibility of translation.

Sunday, December 5th, 2004

ongoing � When Secrets Make Sense: “The point about incompatible architecture is right, by the way; by analogy, if the OpenOffice guys could download all of the Microsoft Office source code tomorrow, it would probably slow them down more than help them.”

Open source is something strange. I was looking at Zend’s contest for PHP5 code: most of it is typical open source programmers: programmers doing programmer stuff: message boards, application environments, all that. One project stood out for me: PHPClick: ” phpClick is a web-based web application development tool targeted at nonprogrammers.” In other words: create forms and such without programming. Now that’s something I haven’t seen done well, yet. I tried it out and was expecting a typical programmers’ tool, but behold, the interface looks promisingly usable. Here’s a screenshot.

At the right you see componens (checkboxes, text, …). If you click on them, they’ll show up in the preview screen. It’s like Visual Basic!

The project still isn’t really functional - I hope they are working on the code as much as on the interface (an unusual comment for an Open Source project!) And let’s be superficial: they have a nice logo too:

Later: I was clicking around, and behold: “phpClick is developed as part of a collaborative Virginia Tech and PennState research effort.” So that’s why the app felt so un-typical and innovative. There is solid research behind it. :)

Saturday, December 4th, 2004

Can I just say, again, how much Wordpress rocks? I went to my Video entry category, and, on a whim, added /rss2 to the URL. And behold, a beautiful RSS2 feed shows up. Nice work.

Saturday, December 4th, 2004

I almost hesitate to report on this, but might as well get it out. Momentshowing: The Story of Bandwidth II: Duncan has already used of 1000 Gigs (!) of bandwidth this month on his Typepad videoblog. (I estimate that’s costing Six Degrees US$ 400.) Big ups to Typepad for taking this kind of abuse in stride. Videobloggers (many of whom are on Typepad) are holding their breath. Then again, with the recent funding of US$ 10,000,000 Six Apart got, and their plans for expansion, surely including media like video, they probably don’t mind spending some cash for that videoblogger goodwill :)

Momentshowing: The Story of bandwidth:

Friday, December 3rd, 2004

Momentshowing: The Story of bandwidth: “hahahha I literally used up 1GB last night. Typepad only gives me 3GB a month…but still doesnt worry about us going over. So far.”

Typepad has been letting people post as much video as they want, which is why a lot of videobloggers use it. But as videoblogging becomes more popular, that just can’t last.

NY Mosaico

Friday, December 3rd, 2004

NY Mosaico a good site on cosmopolatinos in nyc.

Friday, December 3rd, 2004

Toward a Literacy of Cooperation

Friday, December 3rd, 2004

More typical american words: condo and suv. Yesterday, a Belgian colleague told me he was working on a site and they had a “community” tab in French “communite”. In Flemish (=Dutch) though, “community” translates into “gemeenschap”, which has sexual connotations.

My Dutch readers, have you seen a good translation of “community”?

Friday, December 3rd, 2004

virtualteams.com: Networks and virtual teams: article on how far-flung teams can be super effective, if they do 3 things:

  1. They exploit diversity. The team can’t just be diverse; it has to make the most of it. Our teams credit their creative breakthroughs to challenging people from different disciplines, cultures, and the like to come up with something better together. They did.
  2. They use pretty simple technology to simulate reality. By today’s standards, what they use is not very complicated. More than 80% of the teams use teleconference calls and shared websites. More than half used IM even when their companies prohibited it. Only a third used video conferencing. Some banned email.
  3. They hold the team together. It takes a lot of communication. Some leaders spent as much as a third of their time just on the phone with team members.

I am starting to work on far-flung teams, and one observation so far: we’ve been wanting to do meetings over Skype, but because of technical problems, we went IM instead. IM meetings are extremely effective. Something about IM gives you a few seconds to think before you talk. It just works. Meetings last only 10 to 20 minutes, and we usually take 2 to 4 decisions in such a meeting (2 people). If you’ve worked in coorporate America, you’ll know that’s much better than the standard (meetings 1 hour, 1 decision per 3 meetings). Imagine the time we save! We haven’t gone back to Skype yet, and I don’t think we will.

Friday, December 3rd, 2004

Mapping business: Worthwhile: Pecking orders and network orders: “Take the org chart. Treat it not in the usual way as a network of people but as a network of positions. Look at how many people report in to each position. Diagram it. The picture that emerges looks little like a neat hierarchy in which the lower you go, the more leaves there are on the tree. Instead, the org chart now looks like a diagram of self-organizing networks: A handful of hugely linked nodes and a whole bunch of ones with just a few links. Further, most of the biggest clusters belong to mid-range positions. “It’s a diamond, not a pyramid” says Jeff.”

Friday, December 3rd, 2004

I’m in the middle of another comment spam attack (which means I delete a few hundred spams, a few minutes later I have 20 more comment spams), so I installed Kitten’s Spaminator, which should automatically delete spam. Let’s see how it works. I couldn’t find a Wordpress setting that lets you temporarily just stop commenting over all the site.

Friday, December 3rd, 2004

The magic that makes Google tick - ZDNet UK Insight: “Google’s vice-president of engineering was in London this week to talk to potential recruits about just what lies behind that search page.”

At the end of the article: “One big area of complaints for Google is connected to the growing prominence of commercial search results — in particular price comparison engines and e-commerce sites. Hölzle is quick to defend Google’s performance “on every metric”, but admits there is a problem with the Web getting, as he puts it, “more commercial”. Even three years ago, he said, the Web had much more of a grass roots feeling to it. “We have thought of having a button saying ‘give me less commercial results’,” but the company has shied away from implementing this yet.”

I’ve noticed Google getting really bad results for commercially contested searches. The ads are often more relevant. That’s bad, and they should fix it. They have a lot of goodwill, but it doesn’t last forever.

Thursday, December 2nd, 2004

I have a new What is category, which is, if I may say so myself, looking promising.

Thursday, December 2nd, 2004

Using search log analysis to divine emerging stories

Thursday, December 2nd, 2004

What is a Weighted List?

Thursday, December 2nd, 2004

“1005 comments deleted.” With 2 clicks. Fuck those comment spammers. Bring it on.

Translating categories, translating terms

Thursday, December 2nd, 2004

My series of posts on international information architecture:

  1. Translating taxonomies and categories
  2. Translating categories, translating terms (this post)
  3. Translating the Dewey Decimal Classification system
  4. Designing the relationship between content and locales
  5. Emergent i18n effects in folksonomies
  6. The Maori versus Dewey, and why limiting access can be culturally appropriate.

In the previous post, I talked about the various types of problems with translating categories:

  1. Culture-specificity: not all categories exist in all cultures. For example: “chowder” soup is a typical american category.
  2. Semantic overlap: a lot of categories don’t mean exactly the same in different languages. “Fille” (French) and “girl” (English) don’t mean 100% the same.
  3. Differences in granularity: Germans don’t have a word for skidding, but they do have two words, Rutschen and Schleudern, for skidding forwards and skidding sideways.

I’d like to explore those problems a bit further today, and talk about the seemingly different positions on this in the library science world.

First, a lot of work on translating terms has been done by the thesaurus people. The International Standard Guidelines for the establishment and development of multilingual thesauri - ISO 5964, 1985 (and paralleled in the British Standard BS 6723, 1985) talks about the problems I mentioned above. (You can buy the standard for a few 100$.)

In the UK, they are in the process of revising the two BS Standards on mono- and multi-lingual thesauri as one work under the title Structured vocabularies. They regard translating thesauri, in a way, as a special case of mapping thesauri. Here are some examples of translation problems they address:

English: clocks
French: horloge et pendule
En French, you need two words to include the different types of clocks that the English word “clocks” describes.

French hierarchy: Betail > gros betail > boeuf
English hierarchy: livestock > — > cattle
In other words, English doesn’t have a category for “gros betail” (which means something like large livestock).

Another example:
German hierarchy: Gastropode > Schnecke > Gehauseschnecke
English hierarchy: Gasropods > — > snails

I will talk more about their excellent work in future posts. But what matters for today’s post is that the library science folks are well aware of the problems with translating categories.

Meanwhile, I was emailing with David Weinberger and he mentioned how the people of the Dewey Decimal Classification System explained him that it is based on a “concept tree” and therefore can be easily translated. It seems like the Dewey people believe that, because of the fact that they created a “concept tree”, that translation is just an implementation detail. And indeed, Dewey has been translated into over 30 languages.

So my question is
: are these two separate movements in the library science world? Do they know of each others’ work?

My second question is: do Dewey translators have the kinds of translations problems spelled out in my last post? If not, is it because Dewey has been carefully constructed as a concept list? Or is it for some other reason? I am having a hard time finding stuff about this on the web, so pointers related to Dewey translation are welcome.

Next, let’s look a bit at how different types of categories may pose different translation challenges.

Top level categories. Somehow, I just don’t think “About Us”, “Products & Services” or “News” would be hard to translate into any popular language. What is different about these categories?

Basic level categories. “Cat” is a basic-level category, whereas “feline” isn’t. Are basic-level categories easier to translate? You’d think that they would.

Ad-hoc categories. An as-hoc category is something like: “Things you have eaten today”, or “Books other people have also bought when looking at this book”. It’s a category that is created at the moment of using it (something like that). I would expect these would be easy to translate.

Ambiguity: the strongest predictor of how easy a category is to translate, is, as far as I’ve figured it out, how easy it was to create in the first place. In other words, if you have to do user research and such to create a category, you’ll have to put in some work to translate it as well. And it seems that, the more ambigous the category is (the more discussion possible about what goes in it and what doesn’t), the more work it is to create it.

Thoughts?

Thursday, December 2nd, 2004

A good interview with Matt Mullenweg from < href="http://wordpress.org">Wordpress.

Thursday, December 2nd, 2004

MSN Spaces(Microsofts’ just launched blogging service) seems to have a signup bug: I sign up, and then I get the same “Get started” page again?

Wednesday, December 1st, 2004

If you don’t have wireless at home yet, you can now get a Netgear MR814 802.11b Wireless 4-Port Cable/DSL Router at Amazon for, after rebates, US$7.