Making smarter readers: Lessons in Trilliscience

Last week, I was chatting via IM with my brother about a party he hosted. Still in college, he’s 7 and a half years younger than me, so when he described the music playlist at his party as “crunk,” I had no idea what he was talking about. But in less than a second, I not only knew the meaning of “crunk” but was able to respond intelligently with: “As in Three 6 Mafia?” I had achieved instant context and instant application. How did I do it?

I was Trilliscient.

A year ago this April, Cerulean Studios, the makers of the popular chat client Trillian, released a new feature as part of version 3.0 of its software: integration with Wikipedia to provide instant lookups. Not to worry — this is not another article about Wikipedia. But what is remarkable about this enhancement is why Wikipedia was chosen for this feature and what the latter could mean for the future of content delivery.

Here’s how it works. Every word that users type in their chat session is scanned and compared against Wikipedia’s database of articles. If a word or phrase matches the title of an article on Wikipedia, that keyword or phrase is underlined in the chat window. When the user hovers over the link with the mouse, a small window (tooltip) displays the first few paragraphs of the entry. It’s a simple but potentially revolutionary innovation. Appropriately, an entry on Wikipedia notes that the effect of this innovation is called Trilliscience, a wordplay on “Trillian Omniscience.” Wikipedia’s definition of Trilliscience is “the instant application of previously unheld knowledge as the result of passive searching.”

So far, Trilliscience hasn’t revolutionized anything. It’s still basically confined to its original, solitary application in Trillian. But as media organizations continue to struggle with the opportunities provided by the Web, two closely-related industries are rapidly maturing: search engine optimization and search engine marketing. Understanding the techniques developed by these industries and the example provided by Trillian could help media organizations (independent and otherwise) both inform readers and more efficiently monetize content.

Third time’s the charm?

Cerulean Studios was not the first company to try to embed contextual references. Microsoft tried with its Smart Tags, successfully annoying the hell out of an entire generation of users. The feature — a rollover list of related links to be chosen by Microsoft and its partners — was originally intended for use with IE, but was withdrawn after criticism. Smart Tags are still available within Microsoft Office, but people who make use of them must be few. Smart Tags embeds all sorts of useless and intrusive actions such as “Send Mail” and “Schedule a Meeting” into the word processing interface. If I want to schedule a meeting, I’ll open my calendar. I’m in Word because I want to type.

Characteristically, Google was smarter (and scarier). In a new version of its Toolbar released early last year, a feature called AutoLink made its first appearance. AutoLink, while controversial due to its ability to modify and add links to a Web page, does embed links that are useful. The feature can create links to Google Maps for street addresses and ISBN lookups on Amazon for books. (It’s worth noting that Jon Udell was doing contextual ISBN lookups over three years ago.) Google’s links are less intrusive than Microsoft’s Smart Tags. Smart Tags display a block with a big triangle in the middle if you have the nerve to brush a word with your cursor. The links that AutoLink creates are just links.

But the integration that Cerulean Studios added to Trillian with Wikipedia is different from Smart Tags and AutoLink. And, from both a reader’s and a content provider’s perspective, the Trillian feature is more useful.

Here’s an example. Say I’m reading the New York Times online. I come across the name Mohamed ElBaradei and I don’t know who he is. Newspapers have an easy answer for this: they simply explain it using a handful of words. But what if, as an online reader, I am suddenly more interested in learning about Mohamed ElBaradei than I am in finishing the story? As the publisher, do you want me to type “Google.com” into my address bar and navigate away from your page in order to learn about Mohamed ElBaradei? No, you don’t. You want me to stay on your site and view more ads. So how do you meet the needs of the user (to learn more about Mohamed ElBaradei) and satisfy your own need to keep users tuned in? Make readers Trilliscient.

Making it work

Of course, implementing Trilliscience would require publishers to make site-by-site determinations as to the best interface and source of information in the rollover. For Trillian, Wikipedia served this purpose. Because it is not part of Cerulean Studios’ business model to provide content, the company outsourced it. That provided a benefit to Trillian users by allowing them to have smarter conversations, more efficiently.

Why Wikipedia? There are a number of reasons imaginable, some of them technical, and some of them legal. First, Wikipedia provides content (of questionable reliability) for free. Nobody is going to get sued for integrating Wikipedia content into their website or application. Second, Wikipedia and the wiki model of organization automatically optimizes URLs, an important component of Search Engine Optimization (SEO). This means that a page’s URL reflects the title of the page, making it easy for spiders to crawl a website and make intelligent guesses about content.

Trilliscience also adds a new twist to search. Remember the definition? Trilliscience is “the instant application of previously unheld knowledge as the result of passive searching.” Readers may not be actively searching for new content, but in those instances where their curiosity or lack of knowledge on a topic would normally switch them into active mode, websites can now provide instant gratification to that impulse. And Trilliscience lets publishers do that without losing the reader.

Websites that already publish massive amounts of content (like newspapers) can make great use of this technique. Even if they don’t have URLs optimized for crawling, they do have access to the interfaces that can provide related content. For example, to return to the New York Times, when I’m reading about Mohamed ElBaradei, what if I could hover over his name with my cursor and instantly get a short history, as well as links to other Times stories about him? I would most likely either finish reading the story first, having satisfied my initial curiosity, or click one of the links offered within the tooltip — then I would be more likely to stay on the website for the New York Times.

This also removes the burden to producers of manually adding links to an article. Some sites do a better job of providing links for referenced terms, but inevitably they either just redirect me to a search page (again, away from the article I was trying to read) or in the case of things like stock quotes, redirect me to an entirely new page, again potentially losing revenue.

It’s possible to go even further. In addition to a tooltip with a definition or introductory paragraph from a free or partnered content provider like Wikipedia or Britannica, you could provide links to other articles on your site. Or you could add search results from search engines on the same topic using keywords on which you have heavily bid, or your own text advertising. The level to which you make use of advertising in these lookups is up to you (and how much your users will tolerate), but it does present another opportunity to monetize content.

In all of this, the question of fair use and copyright violations must be addressed. The safest course for publishers is to only use their own content. A second tier would be to integrate freely available content, but again, reliability varies and it is necessary for publishers to assess the risk to their reputation against the potential for profit. Partner relationships with commercial content providers or resellers (like Amazon) may be another means for profitable affiliate payouts. The worst thing publishers could do, from a legal perspective, would be to arbitrarily include commercial content without permission. There are strict laws that govern the derivative use of works, and this is becoming stricter as courts struggle to keep up with new technology.

Keeping all of the above in mind, judicious use of the model provided by Trillian could help publishers increase profitability while enhancing the reader’s experience.