<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Online Journalism Review&#187; Google News</title>
	<atom:link href="http://www.ojr.org/tag/google-news/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.ojr.org</link>
	<description>Focusing on the future of digital journalism</description>
	<lastBuildDate>Wed, 10 Apr 2013 03:17:23 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Copy-paste journalism wants to be free</title>
		<link>http://www.ojr.org/copy-paste-journalism-wants-to-be-free/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=copy-paste-journalism-wants-to-be-free</link>
		<comments>http://www.ojr.org/copy-paste-journalism-wants-to-be-free/#comments</comments>
		<pubDate>Sat, 02 Mar 2013 10:32:39 +0000</pubDate>
		<dc:creator>Pekka Pekkala</dc:creator>
				<category><![CDATA[Business Model]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Reporting]]></category>
		<category><![CDATA[branding]]></category>
		<category><![CDATA[copy-paste journalism]]></category>
		<category><![CDATA[Google News]]></category>
		<category><![CDATA[information wants to be free]]></category>
		<category><![CDATA[personality]]></category>
		<category><![CDATA[stewart brand]]></category>

		<guid isPermaLink="false">http://www.ojr.org/?p=2519</guid>
		<description><![CDATA[If information wants to be free, then stop making copies and find a way to add value to your news product.]]></description>
				<content:encoded><![CDATA[<div id="attachment_2520" class="wp-caption aligncenter" style="width: 450px"><a href="http://www.ojr.org/wp-content/uploads/2013/03/copy-paste-tube.jpg"><img class="size-full wp-image-2520" alt="Credit: avatar-1/Flickr" src="http://www.ojr.org/wp-content/uploads/2013/03/copy-paste-tube.jpg" width="440" height="288" /></a><p class="wp-caption-text">Credit: <a href="http://www.flickr.com/photos/avatar-1/">avatar-1</a>/<a href="http://creativecommons.org/licenses/by-sa/2.0/">Flickr</a></p></div>
<p>Google News is a depressing read for a journalist. It shows you how many news outlets depend on copy-and-paste reporting, regurgitating the same press releases and quotes in an infinite loop. Who needs all these clones of the same story, with the same basic facts and sources?<span id="more-2519"></span></p>
<p>This occurred to me a few weeks ago when I was sent to the<a href="http://cesweb.org/"> Consumer Electronics Show (CES)</a> to cover it for an<a href="http://mikropc.net/"> IT magazine</a> in Finland. The story assignment was the typical “go around, see what the trends are, find a couple of non-mainstream gadgets.”</p>
<p>Events like CES used to be fun for gadget-loving journalists. You walked around, talked to people and filed a story once a night or at the end of the show. But in 2013, everything is different.</p>
<p>It is almost impossible to break any news at the event, because there are tens or hundreds of journalists covering the same press events, tweeting or live-blogging them with video. Speed is everything. How could I write anything significant for a monthly IT magazine that comes out two weeks after the show?</p>
<p>For PR departments in technology companies, this is a dream come true. Your press releases are not buried somewhere in the “news” section of your company web site, which has probably three unique visitors a week. Instead, your products get instant publicity in<a href="http://gizmodo.com/"> Gizmodo</a>,<a href="http://www.engadget.com/"> Engadget</a>,<a href="http://www.theverge.com/"> The Verge</a> or<a href="http://www.cnet.com/"> CNET</a>. Tech enthusiasts share those stories in social media. Eventually they are translated and copied to smaller tech websites around the world.</p>
<p>During the CES, I followed the most hyped topics on news.google.com. It was somewhat heartbreaking to see how many almost identical copies all the journalists covering CES produced. A search for &#8220;LG OLED CES&#8221; produced 1,307 sources. &#8220;Self-driving car CES&#8221; &#8212; 1,247 sources. &#8220;Lego EV3 CES&#8221; &#8212; 234 sources. This is just the English-language media.</p>
<p>There is nothing inherently wrong with having 1,307 LG OLED stories to choose from. However, when they all look the same, we have a problem &#8212; hundreds of copies of the same press release, slightly tweaked. And the more you have copies, the less value a single copy has. In the old days, when all the publications had their own, small print market, readers did not realize they were reading copies. Neither did advertisers.</p>
<p>But the Internet made all this transparent, and this is the main reason why traditional publishers are losing audiences, especially paying ones. Readers will not pay for stories they have already read elsewhere. It does not matter if your brand is 100 years old or you used to be the IT or business publication for the decision makers.<a href="http://justallie.com/2013/01/the-problem-with-paywalls/"> A copy is a copy, even behind a paywall.</a></p>
<p>What is even worse, advertisers realize this as well. They are not willing to pay a premium for a product that is a duplicate, no matter if it is a digital or a print copy.</p>
<p>From a journalistic perspective, this is both good news and bad. The bad news is that fewer stories are needed overall as more and more people cut out the middleman and go straight to the source. This means fewer jobs in traditional media. So if you notice yourself writing the same stories as everyone else, or even worse, using copy-paste more than before, run. Your job will become extinct.</p>
<p>However, there is some good news, too. The abundance of copies forces journalists to find their own voice, niche and style. This is why opinion pieces and columns are doing pretty well on the “most-read” story lists. A personality, at least for now, cannot be broken down to zeroes and ones and copied to hundreds of other sites. It is no coincidence that in the<a href="http://www.theverge.com/2013/2/22/4013406/i-used-google-glass-its-the-future-with-monthly-updates"> exclusive story of Google Glass in The Verge</a>, there were more pictures of the editor-in-chief, Joshua Topolsky, than there were pictures of Google Glass.</p>
<p>The new idea of “more personal” journalism is a challenge, not just for newsrooms but for journalism schools, as well. When I was in journalism school at the end of last century, I learned that journalists create similar stories when they are based on pure facts. You put 10 journalists in a room, give them the same information, and get 10 identical stories.</p>
<p>Nevertheless, as we are moving from an industrial age to a digital one, this notion of a journalist as a kind of “fact mechanic” is slowly transforming. The Internet still needs a few good, solid news pieces about CES that are based on facts. But we don’t need the massive overflow of copies or near-duplicate stories. A computer already does that faster and better with<a href="http://www.guardian.co.uk/media/2012/may/13/robot-journalist-apocalypse-news-industry"> some of the business and sports news</a>.</p>
<p>With computer-generated journalism, the old quote “information wants to be free” is becoming a reality. And it is happening exactly the way <a href="http://en.wikipedia.org/wiki/Information_wants_to_be_free">Stewart Brand</a> predicted: “the cost of getting it (information) out is getting lower and lower all the time.”</p>
<p>Luckily for journalists, the free part is only half of the quote. It actually begins with “information wants to be expensive, because it’s so valuable.” As Brand points out, some of the things you read or see can literally change your life.</p>
<p>Finding life-changing stories every day might be an impossible task. So start from the other end of the quote, by dumping the low-cost stories. Stop making copies &#8211; unless they are produced by a computer.</p>
<p>Start to look around in your organization for things that cannot be copied to zeroes and ones. Humans with personal style are a good start: who is the Andrew Sullivan or Kara Swisher of your newsroom? Or think about adopting a voice or style that is distinctive just for your publication. If you are a local newspaper, be fiercely local. Passionate about food, a sports team or cars? Let it show.</p>
<p>If nobody in the newsroom is wasting time making copies, journalists have more time to dig deeper, make that extra phone call and find another source. That is when you start producing the expensive information. As Brand would say: information so valuable that it might change lives.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ojr.org/copy-paste-journalism-wants-to-be-free/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Does your site really need to be in Google News?</title>
		<link>http://www.ojr.org/p1791/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=p1791</link>
		<comments>http://www.ojr.org/p1791/#comments</comments>
		<pubDate>Wed, 28 Oct 2009 07:23:11 +0000</pubDate>
		<dc:creator>Robert Niles</dc:creator>
				<category><![CDATA[Frontpage]]></category>
		<category><![CDATA[Google News]]></category>
		<category><![CDATA[promotion]]></category>

		<guid isPermaLink="false">http://www.ojr.org/?p=1791</guid>
		<description><![CDATA[With print newspaper circulations crashing faster than the reality-TV hopes of Balloon Boy&#8216;s family, you could forgive newsroom managers for chasing every available source of new readers. For many online publishers, affiliated with newspapers or not, the Holy Grail of traffic is inclusion in the Google News index. Get in Google News, and links to [...]]]></description>
				<content:encoded><![CDATA[<p>With print newspaper circulations crashing faster than the reality-TV hopes of <a href="http://en.wikipedia.org/wiki/Colorado_balloon_incident">Balloon Boy</a>&#8216;s family, you could forgive newsroom managers for chasing every available source of new readers. For many online publishers, affiliated with newspapers or not, the Holy Grail of traffic is inclusion in the <a href="http://news.google.com/">Google News</a> index.</p>
<p>Get in Google News, and links to your stories will be e-mailed to millions of Google&#8217;s news alert subscribers, whenever your stories hit the right keywords. Post a hot story quickly, and you could end up on Google News&#8217; highly clicked front page.</p>
<p>But is inclusion in that index or other search engines&#8217; news indices really worthwhile for the majority of online news publishers? I&#8217;m going to argue&#8230; no. (Well, at least it&#8217;s not worth making a fuss over.)</p>
<p>Why on Earth wouldn&#8217;t a news site want the higher public profile and increased traffic that inclusion in Google News could bring? Look, if your site&#8217;s goal is to appeal to a global audience, especially ones looking for news related to specific keywords and phrases, you need <a href="http://www.www.ojr.org/ojr/stories/080506niles-google-news/">to be in Google News and should do everything you can</a> to get included. If you are CNN, or the New York Times, you need to be in Google News and optimizing your pages to perform well within it.</p>
<p>But what if you aren&#8217;t looking to reach a global audience? What if your site&#8217;s focus is local, as are the readers your advertisers want to reach? What if you are trying to build an online community, cultivating ongoing relationships with a core of contributing readers?</p>
<p>&#8220;Drive-by&#8221; visitors from search engines inflate your site&#8217;s traffic stats, but they don&#8217;t help you reach <i>those</i> goals. Worse, traffic numbers plumped by infrequent visitors clicking news alerts create a distorted picture of your website&#8217;s health and viability.</p>
<p>Many newspaper executives might take some comfort in the large number of readers visiting their newsrooms&#8217; websites. But let&#8217;s look at how <i>engaged</i> those visitors are with these websites.</p>
<p>Or, more accurately, how they are not.</p>
<p>An <a href="http://www.editorandpublisher.com/eandp/news/article_display.jsp?vnu_content_id=1004029999">Editor &#038; Publisher report</a> on September 2009&#8242;s Nielsen Online report on the United States&#8217; top 30 online newspaper websites (by most most unique visitors) showed that the mean amount of time spent for that month on one of those websites was just nine minutes and 22 seconds.</p>
<p>That&#8217;s a tick under 19 seconds per day on average, if you considered each website visitor the equivalent of a daily subscriber. I doubt that even the speediest reader can get through many articles &#8211; much less any advertisements &#8211; in under 19 seconds.</p>
<p>So, clearly, online visitors are not as valuable to today&#8217;s news websites as daily subscribers to the local newspaper were a generation ago. Diminishing engagement with their audiences, whether reflected in lower print circulation numbers or by less time spent on the website, is what&#8217;s driving legacy news businesses&#8217; failure to hold on to their once-lucrative advertising market share. No one&#8217;s going to pay top dollar to reach an audience which isn&#8217;t there.</p>
<p>Start-up local news publishers must act smarter. Work to build your website by developing local community contacts, not fattening the visitor logs with out-of-market visitors driven in by search engines. Use social media to encourage current readers to invite new ones. Build content and report stories that local readers will want to recommend.</p>
<p>Looking over the metrics for the websites I manage, I see a clear pecking order in the amount of time spent on the site versus the way a visitor accessed the site. Here&#8217;s that list, from most time to least:</p>
<ol>
<li>People referred to the site via an e-mail forwarded by a friend or colleague</li>
<li>People searching for the site&#8217;s name in a search engine</li>
<li>People accessing the site via bookmark or direct-typed URL</li>
<li>People accessing the site via a link in its e-mail newsletter</li>
<li>People accessing the site via its Facebook page or Twitter feed</li>
<li>People accessing the site via a direct link from another, non-search website</li>
<li>People accessing the site via a link on another social bookmarking site (i.e. Digg or StumbleUpon)</li>
<li>People clicking from Google News</li>
<li>People searching for a term in a search engine</li>
</ol>
<p>For what it is worth, there&#8217;s a cliff-like drop-off in time spent between the social bookmark links and the Google News and search engine referrals. In my experience with my websites, people whose initial visit to the site is driven by a referral from a friend or colleague, or from searching for the site&#8217;s name in a search engine, spend far more time on the site and are far more likely to return than those referred by a search engine.</p>
<p>As an industry, we&#8217;ve got to develop a deeper reading relationship with our audience. From the data I&#8217;ve seen, the shortest route to that goal lies in building traffic through human connections, not search engines and their news pages.</p>
<p>Okay, so traffic from search engines isn&#8217;t helping build a loyal audience for community-focused publications. But it can&#8217;t hurt, right?</p>
<p>Maybe it can. Forgive me while I drift into speculation here, but I&#8217;ll do this as an appeal to readers who might be more connected with the &#8220;dark arts&#8221; of Internet marketing than I am. Of the sites I&#8217;ve run over the years, the ones included in the Google News index have encountered a far, far greater incident of spam attempts in comments and other UGC features than those not included in the index.</p>
<p>And that&#8217;s not explained simply by site popularity, either. My two biggest family-owned websites are not in the Google News index, but OJR is. And OJR elicits exponentially more comment spam submissions than the other two sites, despite the fact that those sites receive around <i>five to 10 times</i> the daily traffic of OJR. (It&#8217;s gotten so bad that we now hold all comments not from site authors for approval before posting on OJR.)</p>
<p>If you&#8217;re ready to dismiss that observation as a single data point (and you should be), allow to me suggest that others may be experiencing the same. Speaking with other Web publishers, I&#8217;ve heard those whose sites are in the Google News index report getting hit with platform-independent comment spam at a far higher rate than those whose sites are not. (This isn&#8217;t to say that sites not in Google News don&#8217;t suffer spam attacks. The highly popular sites not in the Google News index tend to be blogs and forums running off-the-shelf publishing software, which from time to time attract spam attacks targeted specifically at those publishing systems. But those attacks are aimed <i>at the publishing platform</i> more than at the individual websites.) These submissions are typically human-generated, and include link spam either in the comment itself, or on the reader&#8217;s site profile page.</p>
<p>Are spammers targeting sites in the Google News index? I haven&#8217;t spent enough time with the black hats of the &#8216;net to know, despite my suspicion. Consider this my appeal to those who have to provide an answer.</p>
<p>In the meantime, from a system administration stand-point, I want my website to be well-known to people in its target community&#8230; and completely off the radar of spammers and search engine black hats. To me, that means:
<ul>
<li>selecting a publishing system with an enthusiastic support community that&#8217;s aggressive about security,</li>
<li>making sure that my site&#8217;s home page uses <a href="http://www.www.ojr.org/ojr/people/robert/200905/1733/">sound search engine optimization techniques</a> to appear at the top of results pages for my site&#8217;s name and its community name,</li>
<li>and spending my energy to cultivate connections within my target community, offline and on, staying clear of link swaps, black hat SEO and becoming a spammer myself.</li>
</ul>
<p>Getting into Google News? (Or Yahoo! News or Bing&#8217;s news page?) Meh. Put that at the bottom of your priority list. As an online news publisher, you have better ways of building your readership community. Focus on those, instead.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ojr.org/p1791/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Rewriting history: Should editors delete or alter online content?</title>
		<link>http://www.ojr.org/070822Zwerling/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=070822Zwerling</link>
		<comments>http://www.ojr.org/070822Zwerling/#comments</comments>
		<pubDate>Wed, 22 Aug 2007 11:07:32 +0000</pubDate>
		<dc:creator>Elizabeth Zwerling</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Google News]]></category>
		<category><![CDATA[newspaper archives]]></category>
		<category><![CDATA[The New York Times on the Web]]></category>

		<guid isPermaLink="false">http://www.ojr.org/?p=1348</guid>
		<description><![CDATA[From college papers to The New York Times, sources clamor to "take it back," asking for old quotes and comments to be deleted from websites. Should they be?]]></description>
				<content:encoded><![CDATA[<p><i>Elizabeth Zwerling is an associate professor of journalism at the University of La Verne in Los Angeles County.</i></p>
<p>By the time I got the e-mail from the spokeswomen for a major credit card company asking me to delete her quotes from an article we&#8217;d run almost a year before, I was skeptical. She had already contacted the reporter with various versions of her concern: she&#8217;d been speaking off the record, the reporter must have confused her with another source, the quotes were wrong. A man &#8220;representing&#8221; her had called the managing editor urging him to omit the quotes from the archive. &#8220;I think he was a lawyer,&#8221; the managing editor told me at the time. (He wasn&#8217;t.)</p>
<p>I&#8217;m faculty adviser for the <a href=http://www.ulv.edu/ctimes/>Campus Times</a>, a 2,000-circulation weekly newspaper of the University of La Verne in Los Angeles County. My staff of undergraduates occasionally gets things wrong and corrects them. But this was a solid story by a conscientious reporter, puzzled by the content, urgency and timing of the source request.</p>
<p>Most likely the credit card spokeswoman – a woman a Google search revealed is widely quoted by Reuters and CNN, among others – had searched herself online and found our story about college students and credit card debt, in which she spoke openly, if off-message, about the age group&#8217;s unchecked spending habits.</p>
<p>Easy access to online news archives is one of the Web&#8217;s amazing benefits for journalists – or anyone wanting background on people or events. But the fact that last year&#8217;s or the last decade&#8217;s news stories are just a mouse-click away means that anything one says to a reporter – perhaps in a moment of vulnerability – can be entered into a very visible long-lasting record. The visibility of this record, its effects and what to do about those, if anything, is a contentious topic among editors and ethicists across the nation, as the sense – and the reality – of new media is that stories live long past their press dates.</p>
<p>The credit card spokeswoman scenario was fairly easy to resolve: The reporter had kept her notes, we reviewed them against the archived story and the now 2-year-old story remains unchanged in our archive. The spokeswoman&#8217;s discomfort with the story, particularly given her profession, I concluded, did not come close to a threshold for altering the permanent record.</p>
<p>A few months earlier a colleague shared a similar scenario, albeit with a more dramatic request. In late 2005 he was asked to alter the archive of a 1999 story about same-sex couples by one of the sources profiled in the La Verne Magazine. &#8220;She said she wasn&#8217;t gay anymore,&#8221; said George Keeler, journalism professor and magazine adviser. &#8220;It was a painful thing, but I wrote her back and said I wasn&#8217;t going to erase (her past),&#8221; The story, now eight years old, come up first when the source&#8217;s name is typed into Google and Yahoo!&#8217;s engines.</p>
<p>&#8220;It&#8217;s not like it used to be when clippings would just molder in the morgue of the newspaper office,&#8221; said Craig Whitney, standards editor for the New York Times, who said the Times frequently fields requests to alter archives.</p>
<p>&#8220;A source will call saying the paper reported an arrest, then didn&#8217;t report the dismissal of the case,&#8221; Whitney said. &#8220;We can&#8217;t go re-report the who (sometimes 20-year-old) story and we can&#8217;t just take their word for it: &#8216;The judge threw out the case.&#8217; &#8216;Where&#8217;s the judge?&#8217; &#8216;He&#8217;s dead.&#8217; &#8216;Where&#8217;s the record of the case?&#8217; &#8216;In some archive in Fort Dix.&#8217; We recognize it&#8217;s frustrating. We can&#8217;t do anything.<a name=start></a></p>
<p>&#8220;Sometimes it&#8217;s a case where somebody is embarrassed about a part of their past that they don&#8217;t deny, which wasn&#8217;t so prominent (before online archives and Google),&#8221; Whitney said.</p>
<p>The New York Times has received requests from divorced couples to remove archived stories about their marriages, said Leonard Apcar, former editor-in-chief of NYTimes.com.</p>
<p>&#8220;We&#8217;ve always had a sense that the archive is historical,&#8221; Whitney said. &#8220;What&#8217;s changed is now anybody can consult it from home. We haven&#8217;t figured out what to do, if anything. We&#8217;ve had some meetings and we&#8217;ll have some more to… figure out something to do that&#8217;s ethically responsible, that doesn&#8217;t compromise the integrity of the archives, but addresses the need for clarification, elaboration,&#8221; Whitney said adding that the Times has never deleted anything from its online archives. &#8220;I doubt if we ever would. The question is, is there something else we can do that falls short of rewriting history?&#8221;</p>
<p>The answer to that question seems to depend on the story, the publication and a variety of circumstances, which like the medium, are still evolving.</p>
<p>Editors at the Pasadena (Calif.) Weekly felt they found a fair solution when in 2006, they decided to remove the name of an ex-con from an archived story, six months after it came out in print.</p>
<p>Joe Piasecki, the paper&#8217;s deputy editor who also reported the story, had covered a protest at San Quentin Prison a week before the execution of Crips co-founder Stanley Tookie Williams, where he interviewed a man who said he&#8217;d been in prison with Williams. Piasecki researched the man&#8217;s background through the Oakland Tribune&#8217;s (offline) coverage of the man&#8217;s 1998 trial and found the man had been charged with raping and sodomizing his former girlfriend, and convicted of assault. Piasecki included that information in the story along with the man&#8217;s claim that he was innocent. &#8220;I&#8217;d called the Tribune library (to make sure) he was who he said he was,&#8221; Piasecki said.</p>
<p>The story ran Dec. 8, 2005, in the Weekly, its sister paper the Ventura County Reporter, and on the Reporter&#8217;s Web site. At the time the story went up, the Pasadena Weekly didn&#8217;t have a functioning Web archive, so the source&#8217;s call went to the Ventura, Calif., newsroom first. Then Piasecki and Pasadena Weekly Editor Kevin Uhrich were consulted.</p>
<p>&#8220;Our first reaction was &#8216;no don&#8217;t change it&#8217;,&#8221; Piasecki said. &#8220;I tend to say that unless (the reporter) screwed up, don&#8217;t change it. What&#8217;s true is true.&#8221;</p>
<p>Piasecki said his publication made an exception here because the man wasn&#8217;t familiar with the Internet, and because his quotes toward the end of a story about someone else, were not critical to its &#8220;material essence.&#8221; The man had served two years at San Quentin and remembered seeing Williams there; his quotes added color to the story, Piasecki said. The quotes are still in the Ventura newspaper&#8217;s online archive, only the man&#8217;s name was removed.</p>
<p>&#8220;The guy said every time he applied for a job they Googled his name and this was the only hit,&#8221; Piasecki said. &#8220;We took his name out so he could move on with his life.&#8221;</p>
<p>&#8220;I didn&#8217;t see any harm,&#8221; Uhrich said, adding this is the only time the Weekly has edited an archived story beyond correcting specific factual errors and taking offline a guest editorial he learned after publication was largely plagiarized. (The paper&#8217;s own Web site hosts archives dating back to January 2006.)</p>
<p>At the New York Times, even plagiarized stories remain as part of the permanent record. Those by ex-Times reporter Jayson Blair still appear intact in the Times archives with editor&#8217;s notes appended to the articles.</p>
<p>&#8220;The Jayson Blair stories are going to (stay) in the archives,&#8221; Whitney said. &#8220;We can&#8217;t pretend he was never here.&#8221;</p>
<p>Because Internet databases do not discriminate in what they pick up and store, however, a ProQuest search of a Jayson Blair story with plagiarized sections called up the story without the editor&#8217;s notes.</p>
<p>Despite the timeless nature of online postings, laws that protect news outlets have not changed. No matter how emphatic or justified a source&#8217;s complaint may be, any threat to take legal action against the reporter or news organization after the one-to-two-year statute of limitations for libel law is an idle threat, said Roger Myers, general counsel for the California First Amendment Coalition.</p>
<p>Ethically, however, dealing with source requests to alter online archives is increasingly complicated, and as with just about every aspect of online journalism, still evolving.</p>
<p>When a story, column or even a reader response to a story is posted online then transferred to the publication&#8217;s archive, &#8220;it&#8217;s a matter of record,&#8221; said Robert Steele, a scholar of journalism ethics and values at the Poynter Institute in St. Petersburg, Fla. &#8220;To change it would change a piece of history.&#8221;</p>
<p>If editors start removing some stories or parts of stories from archives, readers will begin to wonder what else is missing, Steele said.</p>
<p>And yet Steele, who advises newsroom leaders on a variety of ethical issues, acknowledges that in the rapidly changing media landscape, there are no absolutes.</p>
<p>&#8220;If it can be proven that the material did not come from the person whose name is attached, that would be a reason to take something down,&#8221; he said. &#8220;If it is substantially inaccurate, that would be a reason to correct it and in a rare case take it down.&#8221;</p>
<p>Or, Steele added, if a source could make a convincing argument that the story&#8217;s accessibility online poses a &#8220;profound and immanent threat to their well-being,&#8221; that might be a case to consider altering or deleting it from the record. Though he emphasized that these would be rare exceptions.</p>
<p>In the rare case when an editor does change or delete a story from the archive, there is no guarantee the original version of the story won&#8217;t come up in a Google search. As Paul McAfee, director of interactive operations at the Press Enterprise newspaper in Riverside, Calif., explained: &#8220;The major search engines crawl the news Web sites on a regular basis. They could pull up an erroneous story and &#8216;cache&#8217; it in their archives. &#8220;Hopefully they will pick up the correction,&#8221; he said. Though he added that it&#8217;s likely that both the original and the updated version of the story will come up in a search.</p>
<p>There are formal request processes to have items removed from Google and the other search engines, but there is no guarantee their decision-makers will honor the request. Under federal law, &#8220;Internet entities that host other people&#8217;s content are not liable for that content.&#8221; Myers said.</p>
<p>While McAfee said policy at the Press Enterprise is to not alter any accurate news archive, he recently helped a reader who&#8217;d posted offensive comments on <a href=http://www.pe.com>pe.com</a>&#8216;s message board, then wanted the comments deleted.</p>
<p>&#8220;Someone wrote a comment that sounded really racist, then a few months later they saw the light and changed their opinion,&#8221; McAfee said. When the poster asked McAfee to remove the comments from the message board, he agreed to. Unlike its editorial content, postings on the publication&#8217;s electronic message board are eventually purged automatically, he said. Because they are generated by the public and not by the newspaper&#8217;s editorial department, these message boards are not subject the publication&#8217;s editorial policies, McAfee said.</p>
<p>&#8220;I wrote (the poster) back, &#8216;It&#8217;s off our site.&#8217; They wrote back &#8216;yes but it&#8217;s still cashed in Google.&#8217; The Google spiders picked it up, it was stuck in Google&#8217;s cache. The person asked me to intercede with Google. I sent them the Web address and a form for Google. I didn&#8217;t do it for them,&#8221; McAfee said. &#8220;We disclaim any responsibility for anything on our message boards.&#8221;</p>
<p>Letters to the editor, on the other hand, are different from message board postings when it comes to online archives, editors say.</p>
<p>&#8220;We&#8217;ve had many experiences where letter writers, who espouse some wild or provocative opinion, want the letter taken off the Web years later,&#8221; said Clint Brewer, executive editor of the City Paper in Nashville, Tenn., and the Society of Professional Journalists national president-elect. But letters are also part of the historical record, he said.</p>
<p>Brewer said that while the landscape has changed dramatically, at this point newsroom leaders have a long-standing set of standard for accuracy and preserving the historical record based on the print journalism model. &#8220;It&#8217;s not apples to apples (but) that&#8217;s a logical place to start,&#8221; he said.</p>
<p>McAfee said he hopes the visibility and permanence of the online record – and the fact that even stories subsequently edited for accuracy may live online alongside the uncorrected versions – will make journalists take their job of getting it right more seriously than ever.</p>
<p>Whitney believes such visibility and permanence will affect sources: &#8220;I think that the arrival of YouTube and Internet and the fact that images and text last forever means that actions have lasting consequences. It&#8217;s more important than it ever has been for people before they do something (to consider the) consequences.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ojr.org/070822Zwerling/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Are search engines stealing newspapers&#039; content?</title>
		<link>http://www.ojr.org/are-search-engines-stealing-newspapers-content/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=are-search-engines-stealing-newspapers-content</link>
		<comments>http://www.ojr.org/are-search-engines-stealing-newspapers-content/#comments</comments>
		<pubDate>Fri, 06 Apr 2007 13:47:41 +0000</pubDate>
		<dc:creator>Robert Niles</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google News]]></category>
		<category><![CDATA[news aggregators]]></category>

		<guid isPermaLink="false">http://www.ojr.org/?p=1311</guid>
		<description><![CDATA[Commentary: Who cares? They're driving traffic and smart newspaper employees ought to find ways to take advantage.]]></description>
				<content:encoded><![CDATA[<p>Sam Zell might be new to the newspaper business, but he&#8217;s already publicly embraced the “<a href="http://www.www.ojr.org/ojr/stories/070301niles/">Internet is a parasite on newspapers</a>&#8221; meme.</p>
<p>&#8220;If all the newspapers in America did not allow Google to steal their content for nothing, what would Google do, and how profitable would Google be?&#8221; the <a href="http://www.latimes.com/business/la-fi-zell6apr06,1,3684937.story">Los Angeles Times reported</a> Zell saying to Stanford University class this week. Zell has agreed to takeover Tribune Co., which owns The Times.</p>
<p>Zell&#8217;s comment is as ill-informed as outgoing ASNE president <a href="http://www.www.ojr.org/ojr/stories/070329niles/">Dave Zeeck&#8217;s were to his organization</a> last week. I&#8217;ve been searching the Web through Google, and reading Google News, for years, and can&#8217;t recall Google publishing complete newspaper stories under the Google brand.</p>
<p>Yes, Google hyperlinks page titles and publishes short snippets of those pages&#8217; content beneath them on its search engine result and Google News pages. Those links have helped millions of readers find newspaper stories that they would not have read otherwise.</p>
<p>Without newspaper content on Google News and in Google&#8217;s search engine result pages [SERPs], newspapers would face an even more dire future, as those millions of readers would find instead other, non-newspaper sources of news and information. And Google wouldn&#8217;t lose much at all. Google News doesn&#8217;t run ads. And I suspect that Google&#8217;s AdWords program would continue to haul in billions of dollars annually even without newspaper.coms in the SERPs.</p>
<p>Surely, not everyone in the newspaper industry shares Zell&#8217;s extreme view. For all the ignorance that certain newspaper managers exhibit in public forums, the newspaper industry employs many more sharp individuals with deep knowledge of how the Web works and how to make money from it. They&#8217;re found in the online departments of newspaper.coms and they deserve their chance to call the shots on how newspapers will approach the Internet.<a name=start></a></p>
<p>The Zeeck and Zell attitude won&#8217;t save newspapers, and will serve only to further isolate them from a new and growing generation of Web-savvy readers.</p>
<p>I e-mailed several newspaper.com managers to ask them what they thought of Zell&#8217;s comment, and how they think the newspaper industry ought to approach the search engines.</p>
<p><b>Chris Jennewein</b><br />
Vice President, Internet Operations, <a href="http://www.signonsandiego.com/">Union-Tribune Publishing Co.</a></p>
<blockquote><p>I think newspapers should welcome search engines because they drive traffic to our sites. Our business on the Internet is all about building audience, and audiences find our sites through search engines. Fighting the new reality of Internet search is ultimately self-defeating for our industry.</p></blockquote>
<p><b>Ken Sands</b><br />
Online publisher, <a href="http://www.SpokesmanReview.com">www.SpokesmanReview.com</a></p>
<blockquote><p>Well, first reaction is that Zell is talking primarily about Google News, not Google itself. He suggests Google wouldn&#8217;t have anything without the content from media, and that&#8217;s just not true. Google News aggregates links. It doesn&#8217;t steal the content so much as organize it in a way that&#8217;s meaningful for lots of people. Mainstream media could have done the same thing but didn&#8217;t have the vision, the organizational capabilities or the technical skill.</p>
<p>Google argues that it&#8217;s a partner of the MSM, sending readers to each site (for free!). I can&#8217;t argue with that. Almost 40 percent of the visitors to our site come through Google searches. Those are typically one-time, one-story readers, but they do account for a huge amount of our traffic. There&#8217;s a legitimate question about whether these inflated numbers are meaningful&#8230; not sure this does our local advertisers much good.</p>
<p>On balance, I&#8217;d say Google does more good than harm. The Google Ad Sense for Newspapers alpha project is a good example. They&#8217;ve sent advertising to print newspapers for several months without taking a cut of the action. When the program goes to beta, they will begin taking a small percentage. They seem to understand that their best way of making money is to make sure that everyone else makes money, too.</p>
<p>Will Google some day dominate the media world? Quite possibly. But for now at least, it&#8217;s sure hard to pass up their goodies.</p></blockquote>
<p><b>Steve Yelvington</b><br />
Internet strategist, <a href="http://www.morrisdigitalworks.com">Morris Communications</a></a></p>
<blockquote><p>Sam Zell&#8217;s comments make a lot of editors feel good, and editors need something to feel good about these days. But news search and aggregation are just a piddling part of Google&#8217;s portfolio of services. Google doesn&#8217;t even run advertising in its news channel, so claiming that newspaper content is behind Google&#8217;s profitability is just saber-rattling.</p>
<p>Google is a significant threat, but not because it&#8217;s &#8220;stealing&#8221; newspaper content. They&#8217;re rolling up a lot of local small business money that newspapers can and should be chasing. We&#8217;re not going to get that business by sitting around in our newspaper offices and wishing for the return of sideburns and bell-bottom trousers. We need to be out creating new solutions and coming up with better ideas.</p></blockquote>
<p>I&#8217;ll post additional comments as they hit my in-box. Or you can post your thoughts through the comment button below.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ojr.org/are-search-engines-stealing-newspapers-content/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Non-traditional sources cloud Google News results</title>
		<link>http://www.ojr.org/050519ulken/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=050519ulken</link>
		<comments>http://www.ojr.org/050519ulken/#comments</comments>
		<pubDate>Thu, 19 May 2005 14:38:33 +0000</pubDate>
		<dc:creator>Eric Ulken</dc:creator>
				<category><![CDATA[Editing]]></category>
		<category><![CDATA[Google News]]></category>
		<category><![CDATA[news aggregators]]></category>

		<guid isPermaLink="false">http://www.ojr.org/?p=548</guid>
		<description><![CDATA[Additional research suggests that the search engine's selection of online-only news sources to include in Google News skews its search results toward political extremes.]]></description>
				<content:encoded><![CDATA[<p>OJR readers may remember a September 2004 <a href="http://www.www.ojr.org/ojr/technology/1095977436.php">article</a> by contributing editor J.D. Lasica that suggested a political bias in the popular online news portal <a href="http://news.google.com/">Google News</a>.  Searching on the term “John Kerry,” Lasica cited several stories from “second-tier” online-only news and commentary sites that appeared to have a conservative tilt.  Among them were headlines such as “Swift Boat Veterans for Truth Expose John Kerry’s Lies.”</p>
<p>Lasica’s piece got me thinking about ways to measure bias in search results.  His observations became the basis for my recently completed master’s thesis, the findings of which may be of interest given the ongoing debate about the quality of Google News’s sources. While my analysis does not indicate an overall conservative or liberal slant, it does confirm Lasica’s suspicion that non-traditional news sources are injecting ideologically biased articles into Google News search results.  The data show that articles returned in Google News searches are significantly more likely to have an ideological bias than those returned in searches on Yahoo News. (See below for detailed study results.)</p>
<p>The study examines search results for evidence of bias.  By analyzing the content of articles returned in searches on the major-party presidential candidates in the days leading up to the 2004 election, it aims to assess the aggregator&#8217;s level of political bias.  The study looks at balance within these articles as an indicator of bias, using results from the same searches on Yahoo News as a benchmark.</p>
<p>Notably, almost all of the additional bias in articles returned by Google News searches can be attributed to the site’s use of non-traditional news sources.  In other words, if we consider only sources affiliated with old-media companies, the average bias scores for articles on Google News and Yahoo News are virtually identical.</p>
<p>Google News, still in beta three and a half years after its launch, tracks the top stories on some 4,500 English-language news sites, updating its index roughly every 15 minutes.  The ability to effectively search this huge collection of timely information has helped make Google News one of the Internet’s most popular news portals, drawing about 5.9 million visitors a month.</p>
<p>But its groundbreaking method of identifying top stories based on how frequently they appear on sites in its index – and doing so entirely without human intervention – put the portal in critics’ crosshairs from the beginning.</p>
<p>That its algorithms are able to automatically determine relative importance of stories and present a front page with top stories in different subject areas has been interpreted by some as an ominous sign that computers will someday make human editors obsolete.  At the same time, users have ridiculed bugs that cause the site to occasionally attach a photo to an unrelated article or elevate a relatively minor story to a prominent spot on its front page.</p>
<p>(It should be noted that my research concerns only the site’s search results and is unrelated to Google News’s practice of automatically ranking the top stories on its front page and section fronts.)</p>
<p>Because it uses no human editors, Google News has considered itself immune to bias.</p>
<p>“The algorithms do not understand which sources are right-leaning or left-leaning,” Google News inventor Krishna Bharat told Lasica last year. “They’re apolitical, which is good.”</p>
<p>But choosing which sites to index is perhaps as subjective an editorial decision as selecting the stories to play on the front page of a newspaper or website.</p>
<p>Google News does not share the list of sites it crawls, a practice that has resulted in a lot of speculation about its criteria for inclusion and the notion that there might be some ideological imbalance in its list of sources.</p>
<p>In an attempt to <a href="http://www.privateradio.org/blog/i/google-news/index.php">shed some light</a> on the question, one blogger has written a script that grabs the news portal’s front page regularly and logs all the sources that it finds. The count stood at 2,256 as of Wednesday night, indicating that about half of the 4,500 sources have been identified.</p>
<p>Along with the mainstream sites in the list are a number of relatively obscure, online-only news sources (some of which are best described as weblogs), including the opinion sites <a href="http://michnews.com/">MichNews.com</a> and <a href="http://useless-knowledge.com/">Useless-Knowledge.com</a>.</p>
<p>Earlier this year, Google News dropped several sites, including the white supremacist journal <a href="http://www.nationalvanguard.org/">National Vanguard</a>, from its index after users complained that hate speech was turning up in searches.</p>
<p>It seems the news portal has been making plenty of its own news lately – albeit unwittingly.  In March, Agence France-Presse charged in a lawsuit that Google was infringing its copyright by displaying AFP material on Google News pages.  Days later, Google announced it would <a href="http://www.eweek.com/article2/0,1759,1778139,00.asp">stop using AFP content</a>.  Since then, the Associated Press also has <a href="http://pqasb.pqarchiver.com/latimes/819950311.html?did=819950311&#038;FMT=ABS&#038;FMTS=FT&#038;date=Apr+11%2C+2005&#038;author=Chris+Gaither&#038;desc=Web+Giants+Go+With+Different+Angles+in+Competition+for+News+Audience%3B+Yahoo+licenses+feeds+of+stories+while+Google27s+software+finds%2C+selects+and+links+to+articles">expressed “concern”</a> about Google’s use of its material without payment.</p>
<p>And just this month we learned of a <a href="http://appft1.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&#038;Sect2=HITOFF&#038;p=1&#038;u=%2Fnetahtml%2FPTO%2Fsearch-bool.html&#038;r=1&#038;f=G&#038;l=50&#038;co1=AND&#038;d=PG01&#038;s1=20050060312.PGNR.&#038;OS=DN/20050060312&#038;RS=DN/20050060312">patent application</a> filed by Google scientists in 2003, laying claim to methods of “improving the ranking of news articles” based on the “quality” of the articles’ sources – an apparent admission that relevance alone is not a satisfactory measure of an article’s value.</p>
<p>Google’s patent application offers the following variables, among others, as possible measures of a source’s quality: the volume of traffic it receives, the amount of content it produces, the speed at which it responds to breaking news, the size of its editorial staff and the number of bureaus it maintains.  Any of these factors would appear to favor traditional media outlets.</p>
<p>If this is an admission that non-traditional sources are of lower quality, how does that square with Google News’s stated goal of increasing the <a href="http://news.google.com/intl/en_us/about_google_news.html">diversity of viewpoints</a> presented on its pages?</p>
<p>Google News currently does not distinguish opinion from fact in its search results (though it now attempts to identify press releases and satire).  Hence, editorials and other opinion pieces frequently appear alongside straight news stories in search results.  It is not clear that average users can make the distinction between the two, especially given the many online-only sources that peddle a confusing mixture of fact and opinion.</p>
<p>Ranking news stories based on some measure of quality may be a step in the right direction, but to maintain its credibility, Google News needs transparency – both in its selection criteria and its list of sources.</p>
<hr width=200 size=1 noshade>
<h2>Key findings of the study</h2>
<p>I was intrigued by the notion that a site without human editors might still be biased, and I wanted to test it scientifically.  To do this, I analyzed the content of articles returned in searches on “George W. Bush” and “John Kerry” in the weeks leading up to the 2004 election.  [More complete results and a detailed description of the research process are available in the <a href="http://ulken.com/thesis/googlenews-bias-study.pdf">full study</a> (PDF).]</p>
<p>I wrote a crawler script to retrieve the results from Google News and Yahoo News for the search terms “George W. Bush” and “John Kerry” at four-hour intervals.  The program run was during the two weeks preceding the Nov. 2 presidential election, resulting in a total of 80 “snapshots.”  Each snapshot contained four sets of search results: “George W. Bush” on Google News, “George W. Bush” on Yahoo News, “John Kerry” on Google News and “John Kerry” on Yahoo News.  The program also downloaded the full text of the top articles returned in each result list.</p>
<p>For each of five snapshots, chosen randomly, the first five articles from each of the four result lists were analyzed, ensuring an equal number of Bush and Kerry results and an equal number of Google News and Yahoo News results.  This resulted in a sample of 100 articles, which then were examined sentence-by-sentence.  Overall, 1,587 sentences were coded in one of five ways:</p>
<ul>
<li>Favorable to Bush</li>
<li>Unfavorable to Bush</li>
<li>Favorable to Kerry</li>
<li>Unfavorable to Kerry</li>
<li>Neutral</li>
</ul>
<p>Using the values for each sentence, two scores are calculated for each article, measuring the degree of the article’s overall favorability to each candidate. These favorability scores could take values of –1 (completely unfavorable) to 1 (completely favorable), with 0 being neutral. For instance, a Kerry favorability score of –0.3 for an article would indicate that, on balance, 30% the content of an article is unfavorable to John Kerry.</p>
<p>Two charts – one for Google News and the other for Yahoo News – provide a basic summary of the data. They show the two candidates’ favorability scores for each article, plotted against each other. This facilitates comparison of the overall favorability of the two portals’ search results.</p>
<p><b>Favorability plots by news portal</b></p>
<div align=center><img src="/ojr/images/549/favorability_google.gif" width=340 height=360 alt="Google"></p>
<p><img src="/ojr/images/549/favorability_yahoo.gif" width=340 height=360 alt="Yahoo"></div>
<p>Each data point represents an article, and its placement on the chart represents its favorability to the two candidates:</p>
<ul>
<li>Upper left quadrant: Article is favorable to Kerry and unfavorable to Bush</li>
<li>Upper right quadrant: Article is favorable to both</li>
<li>Lower right quadrant: Article is favorable to Bush and unfavorable to Kerry</li>
<li>Lower left quadrant: Article is unfavorable to both</li>
</ul>
<p>In other words, articles in the upper right and lower left are more balanced than those in the upper left and lower right. Articles closer to the center are more neutral. The circular boundary is a density ellipse drawn to make it easier to see patterns in the data.</p>
<p>To determine the direction of bias in a particular story, we compare favorability scores for Bush and Kerry.  Where they are similar, the article is more balanced.  Each article is assigned a balance score, which is the difference between the two favorability scores.  A balance score greater than 0 would indicate bias toward Kerry while a negative score shows bias toward Bush.  Both Google News and Yahoo News have average article balance scores that are very close to 0, indicating balanced search results.  In other words, both the Google News and Yahoo News searches returned articles that were, on the whole, equally favorable to both George W. Bush and John Kerry.  This is what we would expect to see of balanced search results at a time when public opinion was pretty evenly divided between the two candidates.</p>
<div align=center><img src="/ojr/images/549/balancescores.gif" width=340 height=325 alt="Balance scores"></div>
<p>However, the spread of articles’ balance scores reveals an important difference:  Articles returned by Google News tend to be significantly more biased in one direction or the other than articles from Yahoo News.</p>
<p>Besides being coded for favorability, articles were also classified by whether they came from an independent, online-only source (such as Salon.com) or a website affiliated with a traditional news source.  A traditional news source is defined as a wire service, newspaper, magazine, TV station, radio station, broadcast network or cable network.  (Content from one of these sources that is syndicated on a news aggregator such as Yahoo News is also considered traditional.)  Of the articles returned by Google News, 40% were from non-traditional news sources, while only 24% of the Yahoo News results came from non-traditional sources.</p>
<p>When articles from non-traditional sources are omitted from the comparison, there is no significant difference in the spread of the article balance scores between Google News and Yahoo News.  This indicates that virtually all of the difference in bias between articles returned by Google News and those returned by Yahoo News is attributable to Google’s use of non-traditional news sources.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ojr.org/050519ulken/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>