<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
		xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	xmlns:media="http://search.yahoo.com/mrss/"
>

<channel>
	<title>Paul Miller - The Cloud of Data &#187; Thomson Reuters</title>
	<atom:link href="http://cloudofdata.com/tag/thomson-reuters/feed/" rel="self" type="application/rss+xml" />
	<link>http://cloudofdata.com</link>
	<description>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</description>
	<lastBuildDate>Fri, 10 Feb 2012 10:46:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<copyright>Licensed under the Creative Commons Attribution License, version 3.0 http://creativecommons.org/licenses/by/3.0/</copyright>
	<managingEditor>paul.miller@cloudofdata.com (Paul Miller)</managingEditor>
	<webMaster>paul.miller@cloudofdata.com (Paul Miller)</webMaster>
	<ttl>1440</ttl>
	<image>
		<url>http://cloudofdata.com/logo144x144.jpg</url>
		<title>Paul Miller - The Cloud of Data</title>
		<link>http://cloudofdata.com</link>
		<width>144</width>
		<height>144</height>
	</image>
	<itunes:subtitle>conversations with the executives shaping Cloud Computing and the Semantic Web.</itunes:subtitle>
	<itunes:summary>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</itunes:summary>
	<itunes:keywords>Cloud Computing, Semantic Web, Linked Data, Open Data, SaaS, PaaS</itunes:keywords>
	<itunes:category text="Technology" />
	<itunes:category text="Business" />
	<itunes:author>Paul Miller</itunes:author>
	<itunes:owner>
		<itunes:name>Paul Miller</itunes:name>
		<itunes:email>paul.miller@cloudofdata.com</itunes:email>
	</itunes:owner>
	<itunes:block>no</itunes:block>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://cloudofdata.com/logo300x300.jpg" />
		<item>
		<title>Strata Conference 2011, Day 2 Keynotes</title>
		<link>http://cloudofdata.com/2011/02/strata-conference-2011-day-2-keynotes/</link>
		<comments>http://cloudofdata.com/2011/02/strata-conference-2011-day-2-keynotes/#comments</comments>
		<pubDate>Wed, 02 Feb 2011 18:13:07 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[BigData]]></category>
		<category><![CDATA[edd dumbill]]></category>
		<category><![CDATA[Hilary Mason]]></category>
		<category><![CDATA[Mark Madsen]]></category>
		<category><![CDATA[Thomson Reuters]]></category>
		<category><![CDATA[Werner Vogels]]></category>
		<category><![CDATA[Windows Azure DataMarket]]></category>
		<category><![CDATA[Zane Adam]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1490</guid>
		<description><![CDATA[Day 2, and after yesterday&#8217;s tutorials the conference is really getting going. Here&#8217;s a stream of consciousness from the morning&#8217;s keynotes at this sold-out event. Conference chair Edd Dumbill is introducing things, talking about William Smith&#8216;s nineteenth century map of geological strata in the British Isles, the rise of industrialisation, and the move to towns. [...]]]></description>
			<content:encoded><![CDATA[<p>Day 2, and after yesterday&#8217;s tutorials the conference is really getting going.</p>
<p>Here&#8217;s a stream of consciousness from the morning&#8217;s keynotes at this sold-out event.</p>
<p><span id="more-1490"></span></p>
<p>Conference chair <a class="zem_slink" title="Edd Dumbill" rel="homepage" href="http://twitter.com/edd">Edd Dumbill</a> is introducing things, talking about <a href="http://en.wikipedia.org/wiki/William_Smith_(geologist)">William Smith</a>&#8216;s nineteenth century map of geological strata in the British Isles, the rise of industrialisation, and the move to towns. Edd suggests that a similar set of inflections are happening today in the world of data; &#8216;the start of something big.&#8217;</p>
<p>&#8220;In the same way that the industrial revolution changed what it meant to be human, the data revolution is changing what it means to be alive.&#8221;</p>
<p>The first of this morning&#8217;s keynotes; Hilary Mason from link shortener <a class="zem_slink" title="bit.ly" rel="homepage" href="http://bit.ly">bit.ly</a>.</p>
<p>Data and the people who work with data; &#8220;The state of the data union is strong.&#8221; Data scientists have an identity &#8211; a place to rally around &#8211; with Strata.</p>
<p>We have accomplished much, begging, borrowing and stealing from lots of domains. We have the tools. We have the capacity to spin up infrastructure in the Cloud. We have the algorithms to explore data, and to learn from it.</p>
<p>The most important thing we have now that we didn&#8217;t have before&#8230; is momentum. People are paying attention.</p>
<p>There are still challenges though. Timeliness of data is an issue, especially in real-time. We need to develop systems that can do robust analysis against a moving stream of data. We need to be able to store data in ways that let us operate on it in real-time. Hadoop&#8230; amazing &#8216;because I can run a query and get the result back before I forget why I submitted the query in the first place.&#8217; We need training. We need imagination, not more ad optimisation networks. We have a real opportunity to do something better.</p>
<p>Opportunities (expressed in context of bit.ly); Bit.ly gets lots of data from people shrinking web links. They learn a lot about people; what they like, what they want, what they&#8217;re doing. bit.ly also gets rich segmentation data; location, context, etc. bit.ly sees global data, for example clicks on bit.ly links from Egyptian domains.</p>
<p>Now that we have all this data, it offers a window on to the world. What can we do with it? Make the world a better place? What would <em>you</em> do with all of this data?</p>
<p>Next up, James Powell from <a class="zem_slink" title="Reuters" rel="homepage" href="http://reuters.com">Thomson Reuters</a> to talk about privacy and behavioural data in B2B contexts. Thomson Reuters gathers large amounts of global data, and filters it for customers. Time and context key; 700,000 updates a second through financial systems, 5,000,000 documents per day served through <a class="zem_slink" title="OpenCalais" rel="homepage" href="http://www.opencalais.com">Open Calais</a>, etc. Thomson Reuters interested in ways to filter information better.</p>
<p>Need to think about B2B implications of behavioural data, especially as we sell/exchange increasing volumes of data with partners. Consumers <em>reasonably</em> comfortable with giving up some personal data in return for a &#8216;better&#8217; product (Amazon recommendations, etc), that probably doesn&#8217;t scale to the enterprise. For example, Open Calais customers submitting large numbers of dummy queries to obfuscate what they&#8217;re really looking for&#8230;</p>
<p>Key problem that needs to be addressed is ambiguity; many systems in this space still rely upon implicit assumptions, whilst the enterprise is used to explicit contracts. Tension &#8211; or recipe for disaster?</p>
<p>Keys to success &#8211; need to treat behavioural data differently/better, and avoid the mistake of simply continuing consumer trends.</p>
<p>Next, Mark Madsen from Third Nature, talking about &#8216;the Mythology of Big Data.&#8217;</p>
<p>Lots of assumptions underlying conversations about Big Data. &#8216;Every technology carries within itself the seeds of its own destruction.&#8217; Code is a commodity; things that a lot of people have built profitable careers around have started to move down-market. Libraries, packages, etc make it easier for third parties to stitch things together rather than start from scratch.</p>
<p>The central myth underlying Big Data that&#8217;s erupted over the past 18-24 months; the myth of the gold rush. <em>Everyone</em> wants to be a data scientist. But just like the gold rush, success takes capital. It takes corporate engagement, and infrastructure. The &#8216;myth tells us you can go it alone&#8230; and you can&#8217;t.&#8217;</p>
<p>1950s-60s &#8211; data as product. 1970s-80s &#8211; data as byproduct. 1990s-2000s &#8211; data as assset. 2010- data as substrate (data as the basis for competition). &#8216;The real data revolution is in business structure and processes and how the use information.&#8217;</p>
<p>Using Big Data; the point isn&#8217;t necessarily about &#8216;Big.&#8217; Much valuable data inside an enterprise is only GB or TB in size. We get tied up in &#8216;big&#8217; way too much. It&#8217;s not really about data either; it&#8217;s about <em>applying</em> data. Without an application, it&#8217;s trivia.</p>
<p>Next, Amazon CTO <a class="zem_slink" title="Werner Vogels" rel="homepage" href="http://www.allthingsdistributed.com">Werner Vogels</a>. An overview of how <a class="zem_slink" title="Amazon Web Services" rel="homepage" href="http://aws.amazon.com/">Amazon Web Services</a> look at the data processing being done on their infrastructure by customers&#8230; Government, Finance, COmmerce, Pharma&#8230; all making use of tools. Plugging <em>The Fourth Paradigm</em> book from Microsoft Research (which is very good).</p>
<p>Vogels &#8211; big data is big data when your data sets become so large that you have to innovate to manage them. Customers view big data as collection and curation of data for competitive advantage&#8230; with the presumption that bigger is better. For recommendations etc, that is probably true.</p>
<p>There are a number of categories of data, where quality is far more important than quantity.</p>
<p>In the past, data tended to be collected to answer questions. Now, trend to collecting as much as possible before developing the questions you want answered, and the algorithms you will need to use for the analysis.</p>
<p>To do this, you should not be worried by data storage, data processing, etc &#8211; which is why you should embrace the scalable Cloud.</p>
<p>Data analysis pipeline; collect &#8211; store &#8211; organise &#8211; analyse &#8211; share.</p>
<p><a class="zem_slink" title="AWS Import/Export" rel="homepage" href="http://aws.amazon.com/importexport">AWS Import/Export</a> &#8211; &#8220;you shouldn&#8217;t underestimate the bandwidth of a FedEx box.&#8221; Indeed.</p>
<p>&#8220;This is Day 1 for Cloud infrastructure.&#8217;</p>
<p>Next up, Microsoft&#8217;s Zane Adam talking about data marketplaces. Windows Azure <a href="https://datamarket.azure.com/">DataMarket</a>; <a class="zem_slink" title="Data as a Service" rel="wikipedia" href="http://en.wikipedia.org/wiki/Data_as_a_Service">Data as a Service</a>, free or at cost. One stop shop for data (one of many one stop shops, unfortunately!) DataMarket <em>is</em> interesting&#8230; but this is far too much of a product pitch for the keynote track.</p>
<p>90 days since launch &#8211; 5,000+ subscriptions, 3 Million transactions to date. Given Microsoft&#8217;s presence and reach, aren&#8217;t those figures a bit low?</p>
<p>&#8220;There&#8217;s a lot of data out there&#8230; but it&#8217;s not all good.&#8221; A Data Marketplace gives customers access to good data. Does it? Do Microsoft vet every fact in a submitted data set? What would a single bad data set do to the marketplace&#8217;s brand recognition?</p>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=b94efb47-86af-442d-a0df-9515cd0e3707" alt="" /><span class="zem-script pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2011/02/strata-conference-2011-day-2-keynotes/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Examining the Linked Data opportunity; the case of Higher Education</title>
		<link>http://cloudofdata.com/2009/08/examining-the-linked-data-opportunity-the-case-of-higher-education/</link>
		<comments>http://cloudofdata.com/2009/08/examining-the-linked-data-opportunity-the-case-of-higher-education/#comments</comments>
		<pubDate>Sun, 09 Aug 2009 15:48:14 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[BBC]]></category>
		<category><![CDATA[Higher Education]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[jisclinkeddata]]></category>
		<category><![CDATA[Joint Information Systems Committee]]></category>
		<category><![CDATA[Open Calais]]></category>
		<category><![CDATA[SemHE]]></category>
		<category><![CDATA[Thomson Reuters]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=751</guid>
		<description><![CDATA[Image via Wikipedia Regardless of where you stand on some of the questions of detail with respect to the Linked Data meme, it&#8217;s clear that significant enthusiasm is being marshalled behind both the concept and the opportunities that it promises. Dion Hinchcliffe looks at some of the means by which enterprise data can be more [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="margin: 1em; display: block;">
<div>
<dl class="wp-caption alignright" style="width: 310px;">
<dt class="wp-caption-dt"><a href="http://commons.wikipedia.org/wiki/Image:York_central_hall.jpg"><img title="The :en:University of York's Central Hall, as ..." src="http://upload.wikimedia.org/wikipedia/commons/thumb/3/35/York_central_hall.jpg/300px-York_central_hall.jpg" alt="The :en:University of York's Central Hall, as ..." width="300" height="200" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image via <a href="http://commons.wikipedia.org/wiki/Image:York_central_hall.jpg">Wikipedia</a></dd>
</dl>
</div>
</div>
<p>Regardless of where you stand on some of the <a href="http://cloudofdata.com/2009/07/does-linked-data-need-rdf/">questions</a> of <a href="http://cloudofdata.com/2009/07/more-linked-data-and-rdf/">detail</a> with respect to the Linked Data meme, it&#8217;s clear that significant enthusiasm is being marshalled behind both the concept and the opportunities that it promises.</p>
<p><a class="zem_slink" title="Dion Hinchcliffe" rel="blog" href="http://hinchcliffeandco.com">Dion Hinchcliffe</a> looks at some of the means by which enterprise data can be more visible on (and useful to) the Web in a <a href="http://blogs.zdnet.com/Hinchcliffe/?p=650">ZDNet post</a> this week. The &#8216;Semantic Web &amp; Linked Data&#8217; are included, and Dion <a href="http://blogs.zdnet.com/Hinchcliffe/?p=650">writes</a>;</p>
<blockquote><p>&#8220;By far the most sophisticated and complex of the three approaches to open data presented here, [Linked Data is] highly suitable for certain applications that have rich data sets that need powerful means of processing and consumption. In particular, scientific, technical, medical, mapping, and certain government domains are highly suitable for this approach. It remains unclear if Linked Data will finally trigger the boom in the Semantic Web so use with care. However, definite consideration should be applied, given the potential of the approach to create data sets with extraordinarily high function. Businesses already managing their data with Semantic Web technologies will be the most likely candidates for adoption.&#8221;</p></blockquote>
<p>We&#8217;re certainly seeing plenty of talk — and some interesting beginnings — in the Government domain, and organisations such as <a class="zem_slink freebase/guid/9202a8c04000641f800000000572e521" title="Reuters" rel="homepage" href="http://reuters.com">Thomson Reuters</a> and the <a class="zem_slink freebase/guid/9202a8c04000641f800000000000b122" title="BBC" rel="homepage" href="http://www.bbc.co.uk/">BBC</a> are also taking compelling steps around the periphery of their core businesses.</p>
<p>Education offers another interesting set of opportunities, and Jason Ohler&#8217;s <a href="http://www.educause.edu/EDUCAUSE+Quarterly/EDUCAUSEQuarterlyMagazineVolum/TheSemanticWebinEducation/163437">piece</a> in <em>Educause Quarterly</em> (and a related <a href="http://blogs.talis.com/education/2008/10/15/jason-ohler-talks-with-talis-about-education-and-the-semantic-web/">podcast</a> I recorded with him whilst I was still at <a class="zem_slink" title="Talis Platform" rel="homepage" href="http://www.talis.com/platform/">Talis</a>) illustrates one view of that opportunity.</p>
<p>Here in the UK, the <a href="http://blogs.talis.com/education/2008/10/15/jason-ohler-talks-with-talis-about-education-and-the-semantic-web/">Joint Information Systems Committee</a> (JISC) is beginning to take note. They first funded a review of &#8216;<a href="http://www.jisc.ac.uk/whatwedo/services/techwatch/reports/horizonscanning/hs0502.aspx">Semantic Web Technologies</a>&#8216; back in 2005, then revisited the topic with &#8216;<a href="http://www.jisc.ac.uk/whatwedo/projects/semantictechnologies.aspx">Semantic Technologies in Learning and Teaching</a>&#8216; (and a related <a href="http://www.semhe.org/">workshop</a> in the south of France later this year). I&#8217;ll be recording a podcast with the manager of that project, Thanassis Tiropanis, later this month.</p>
<p>JISC have also asked me to conduct a short piece of work to look specifically at the opportunity presented to the Higher Education community by Linked Data, and this work will run over the next few months. I&#8217;m certainly keen to learn about concrete examples, and to hear reasoned arguments for and against in order to submit comprehensive findings and recommendations. So if you have something to say, please do <a href="http://cloudofdata.com/contact/">get in touch</a>.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://blog.aldobucchi.com/2009/07/web3-and-enterprise-linked-data.html">Web3 and Enterprise Linked Data. The Middleware Revolution</a> (aldobucchi.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.johnbreslin.com/blog/2009/06/24/open-government-and-linked-data-now-its-time-to-draft/">Open government and Linked Data; now it&#8217;s time to draft&#8230;</a> (johnbreslin.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php">ReadWriteWeb Interview With Tim Berners-Lee, Part 1: Linked Data</a> (readwriteweb.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/archives/linked_data_is_blooming_why_you_should_care.php">Linked Data is Blooming: Why You Should Care</a> (readwriteweb.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/89a22b3a-8ab7-428c-9b7f-806086758934/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=89a22b3a-8ab7-428c-9b7f-806086758934" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-info pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2009/08/examining-the-linked-data-opportunity-the-case-of-higher-education/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Thomson Reuters turns to FedEx and DHL to boot-strap their Cloud of Data</title>
		<link>http://cloudofdata.com/2009/06/thomson-reuters-turns-to-fedex-and-dhl-to-boot-strap-their-cloud-of-data/</link>
		<comments>http://cloudofdata.com/2009/06/thomson-reuters-turns-to-fedex-and-dhl-to-boot-strap-their-cloud-of-data/#comments</comments>
		<pubDate>Tue, 16 Jun 2009 13:00:45 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[CNET Networks]]></category>
		<category><![CDATA[Huffington Post]]></category>
		<category><![CDATA[Open Calais]]></category>
		<category><![CDATA[Semantic Technology Conference]]></category>
		<category><![CDATA[semtech2009]]></category>
		<category><![CDATA[Thomson Reuters]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=684</guid>
		<description><![CDATA[Image by perspikace via Flickr Thomson Reuters&#8216; Open Calais team have clearly been busy, with several announcements at the Semantic Technology Conference here in San Jose. On 15 June the company rolled out version 4.1 of Open Calais, embracing Spanish language content and the notion of &#8216;social tags;&#8217; &#8220;OpenCalais is a great semantic data extraction [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="margin: 1em; display: block;">
<div>
<dl class="wp-caption alignright" style="width: 250px;">
<dt class="wp-caption-dt"><a href="http://www.flickr.com/photos/46138514@N00/3155604509"><img title="Thomson Reuters" src="http://farm4.static.flickr.com/3198/3155604509_ce660b000e_m.jpg" alt="Thomson Reuters" width="240" height="84" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image by <a href="http://www.flickr.com/photos/46138514@N00/3155604509">perspikace</a> via Flickr</dd>
</dl>
</div>
</div>
<p><a class="zem_slink" title="Reuters" rel="homepage" href="http://reuters.com">Thomson Reuters</a>&#8216; <a class="zem_slink" title="Open Calais" rel="homepage" href="http://OpenCalais.com">Open Calais</a> team have clearly been busy, with several announcements at the <a class="zem_slink" title="Semantic Technology Conference" rel="homepage" href="http://semantic-conference.com/">Semantic Technology Conference</a> here in San Jose.</p>
<p>On 15 June the company <a href="http://www.opencalais.com/blogs/tom/opencalais-release-41-available-today">rolled out</a> version 4.1 of Open Calais, embracing Spanish language content and the notion of &#8216;social tags;&#8217;</p>
<blockquote><p>&#8220;<span style="font-family: Arial, Helvetica, sans-serif; font-size: 12px; color: #696868;">OpenCalais is a great semantic data extraction engine. If you write an article about the relative merits of<a style="color: #0078ae; text-decoration: none;" href="http://d.opencalais.com/er/company/ralg-tr1r/48ccfe38-7c8c-3a18-8328-8327cf83b011.html">Porsche</a> and <a style="color: #0078ae; text-decoration: none;" href="http://d.opencalais.com/er/company/ralg-tr1r/f8a13a13-8dbc-3d7e-82b6-1d7968476cae.html">BMW</a> at the test track in <a style="color: #0078ae; text-decoration: none;" href="http://d.opencalais.com/er/geo/city/ralg-geo1/b4c905cc-ae95-d283-231d-12ff4c13bf20.html">Leipzig</a>, we’ll diligently identify Porsche and BMW as companies and Leipzig as a geography. We’ll create Linked Data URIs to represent these things and open up access to the<a style="color: #0078ae; text-decoration: none;" href="http://en.wikipedia.org/wiki/Linked_Data">Linked Data</a> ecosystem so you can enhance your article with other content assets.</span></p>
<p style="color: #696868; font-family: Arial, Helvetica, sans-serif; font-size: 12px;">But… sometimes you just want a great description. The kind of tags a human would put on the article. Like “Car racing” or “Automobiles”. The kind of tag that would, for example, be very searchable and therefore …. SEO’able (that is definitely is not a word).</p>
<p style="color: #696868; font-family: Arial, Helvetica, sans-serif; font-size: 12px;">In 4.1 we’re introducing OpenCalais Social Tags. Social Tags is our attempt to emulate how a human might tag the document. Social Tags does some fairly sophisticated analysis of your entire document and maps it to a knowledgebase based on Wikipedia and other assets. From that process we generate Social Tags.<span style="color: #000000; font-family: Georgia, 'Times New Roman', 'Bitstream Charter', Times, fantasy; font-size: 13px;">&#8220;</span></p>
</blockquote>
<p style="color: #696868; font-family: Arial, Helvetica, sans-serif; font-size: 12px;"><span style="color: #000000; font-family: Georgia, 'Times New Roman', 'Bitstream Charter', Times, fantasy; font-size: 13px;">This morning, they followed through with a further pair of announcements. Firstly, CNET has been joined by <a class="zem_slink" title="The Huffington Post" rel="homepage" href="http://www.huffingtonpost.com/">The Huffington Post</a>, <a href="http://dailyme.com/">DailyMe</a> and the UK&#8217;s <a href="http://www.dailymail.co.uk/">Mail Online</a> in integrating Open Calais into their workflow.</span></p>
<p style="color: #696868; font-family: Arial, Helvetica, sans-serif; font-size: 12px;"><span style="color: #000000; font-family: Georgia, 'Times New Roman', 'Bitstream Charter', Times, fantasy; font-size: 13px;">One of the more interesting aspects of the earlier <a href="http://www.opencalais.com/press-releases/cbs-interactive-cnet-pioneers-adoption-thomson-reuters-opencalais-service">CNET announcement</a> was the contribution of data back into the pool; </span></p>
<blockquote>
<p style="color: #696868; font-family: Arial, Helvetica, sans-serif; font-size: 12px;">&#8220;CNET joins Thomson Reuters as one of the first commercial media companies to publish core data assets for public, programmatic use on the open semantic Web. CNET will leverage OpenCalais&#8217; connection to the rapidly expanding &#8216;Linked Data cloud&#8217; to allow its original content &#8212; such as tech product reviews on laptops, TVs, smart phones, and digital cameras; news articles and blog posts from its CNET News editorial staff; and parts of its core technology product catalog &#8211; to be available for public use.&#8221;</p>
</blockquote>
<p style="color: #696868; font-family: Arial, Helvetica, sans-serif; font-size: 12px;"><span style="color: #000000; font-family: Georgia, 'Times New Roman', 'Bitstream Charter', Times, fantasy; font-size: 13px;">It will be interesting to see whether these latest media properties are able and willing to do something similar.</span></p>
<p style="color: #696868; font-family: Arial, Helvetica, sans-serif; font-size: 12px;"><span style="color: #000000; font-family: Georgia, 'Times New Roman', 'Bitstream Charter', Times, fantasy; font-size: 13px;">The second element announced today in many ways mirrors Amazon&#8217;s recent <a href="http://aws.amazon.com/importexport/">Import/Export service</a>. Thomson Reuters, too, have recognised that it remains impractical to move large quantities of data over the network, and today announced their &#8216;Archive Express&#8217; which will process up to 20 million documents off a physical storage device within 24 hours, free of charge.</span></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/archives/cnet_partners_with_thomson_reuters_on_linked_data.php"> CNET Partners with Thomson Reuters on Linked Data Initiative </a> (readwriteweb.com)</li>
<li class="zemanta-article-ul-li"><a href="http://cloudofdata.com/2009/02/linked-data-and-the-enterprise-a-viable-two-way-street/">Linked Data and the Enterprise: a viable two-way street</a> (cloudofdata.com)</li>
<li class="zemanta-article-ul-li"><a href="http://news.cnet.com/8301-1023_3-10250873-93.html?part=rss&amp;subj=news"> Semantic technology gains publishing foothold </a> (news.cnet.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/808d73ed-ef60-436d-a2af-79af40f84a3d/"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=808d73ed-ef60-436d-a2af-79af40f84a3d" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2009/06/thomson-reuters-turns-to-fedex-and-dhl-to-boot-strap-their-cloud-of-data/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Berkman Center unveils fascinating insight into media trends, with a little Semantic Web goodness from Calais under the hood</title>
		<link>http://cloudofdata.com/2009/03/berkman-center-unveils-fascinating-insight-into-media-trends-with-a-little-semantic-web-goodness-from-calais-under-the-hood/</link>
		<comments>http://cloudofdata.com/2009/03/berkman-center-unveils-fascinating-insight-into-media-trends-with-a-little-semantic-web-goodness-from-calais-under-the-hood/#comments</comments>
		<pubDate>Wed, 11 Mar 2009 13:20:43 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[Barak Pridor]]></category>
		<category><![CDATA[Berkman Center for Internet & Society]]></category>
		<category><![CDATA[bloggers]]></category>
		<category><![CDATA[blogging]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[ClearForest]]></category>
		<category><![CDATA[Ethan Zuckerman]]></category>
		<category><![CDATA[Harvard University]]></category>
		<category><![CDATA[media]]></category>
		<category><![CDATA[Media Cloud]]></category>
		<category><![CDATA[newspaper]]></category>
		<category><![CDATA[Open Calais]]></category>
		<category><![CDATA[Stephen Schultze]]></category>
		<category><![CDATA[Thomson Reuters]]></category>
		<category><![CDATA[TV News]]></category>
		<category><![CDATA[Yochai Benkler]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=386</guid>
		<description><![CDATA[Image via Wikipedia Harvard University&#8217;s Berkman Center for Internet &#38; Society unveiled their Media Cloud research tool today, bringing Semantic Web goodness from Thomson Reuters&#8217; Calais and affordably scalable Cloud oomph from Amazon Web Services together in powering exploration of a fascinating topic. As the press release notes, &#8220;Media Cloud was conceived by Berkman Fellow [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="margin: 1em; display: block;">
<div>
<dl class="wp-caption alignright" style="width: 212px;">
<dt class="wp-caption-dt"><a href="http://en.wikipedia.org/wiki/Image:Harvard_Wreath_Logo_1.svg"><img title="Harvard University" src="http://upload.wikimedia.org/wikipedia/en/thumb/3/3a/Harvard_Wreath_Logo_1.svg/202px-Harvard_Wreath_Logo_1.svg.png" alt="Harvard University" width="202" height="202" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image via <a href="http://en.wikipedia.org/wiki/Image:Harvard_Wreath_Logo_1.svg">Wikipedia</a></dd>
</dl>
</div>
</div>
<p>Harvard University&#8217;s <a href="http://cyber.law.harvard.edu/">Berkman Center for Internet &amp; Society</a> unveiled their <a href="http://www.mediacloud.org/">Media Cloud</a> research tool today, bringing Semantic Web goodness from Thomson Reuters&#8217; <a href="http://www.opencalais.com/">Calais</a> and affordably scalable Cloud oomph from <a href="http://aws.amazon.com/">Amazon Web Services</a> together in powering exploration of a fascinating topic.</p>
<p>As the <a href="http://www.opencalais.com/press-releases/harvard-berkman-centers-media-cloud-offer-insights-media-trends-opencalais">press release</a> notes,</p>
<blockquote><p>&#8220;Media Cloud was conceived by Berkman Fellow Ethan Zuckerman and Berkman Faculty Co-Director Yochai Benkler.  It was inspired by their debate over whether the blogosphere largely echoed traditional media or was instead a source for original news and democratic agenda-setting.</p>
<p>&#8216;While daily newspapers struggle for survival, political, niche and special interest blogs continue to capture consumer interest,&#8217; said Yochai Benkler, Faculty Co-Director of the Berkman Center. &#8216;In the midst of this upheaval, it is difficult to know where stories begin, who sets the agenda, and how these dramatic changes impact news coverage on the whole.  We created Media Cloud to help researchers and the public get quantitative answers to these challenging questions.&#8217;&#8221;</p></blockquote>
<p>The site itself <a href="http://www.mediacloud.org/about-2/" class="broken_link">provides more detail</a>, notably,</p>
<blockquote><p>&#8220;Print newspapers are declaring bankruptcy nationwide. High-profile blogs are proliferating. Media companies are exploring new production techniques and business models in a landscape that is increasingly dominated by the Internet. In the midst of this upheaval, it is difficult to know what is actually happening to the shape of our news. Beyond one-off anecdotes or painstaking manual content analysis, there are few ways to examine the emerging news ecosystem.&#8221;</p></blockquote>
<p>By systematically harvesting full-text content from both new and traditional news sources and passing it through Calais for entity extraction, the Berkman team is able to build a coherent and normalised pool of news that they &#8211; and others &#8211; can begin to interrogate in order to answer pressing questions as to the shifting news landscape. Today&#8217;s site is just the beginning, and the team has ambitious plans to increase the number of sources they harvest, to provide ever-richer visualisations on their site, and to expose public APIs to the underlying data in order that third parties can consume it for their own purposes.</p>
<p>Ahead of the launch I spoke with <a href="http://clearforest.com/AboutUs/ManagementTeam.asp#1" class="broken_link">Barak Pridor</a>, CEO of <a href="http://thomsonreuters.com/">Thomson Reuters</a>&#8216; <a href="http://clearforest.com/">ClearForest</a> subsidiary, and Berkman Fellow <a href="http://cyber.law.harvard.edu/people/SSchultze">Stephen Schultze</a>. Both stressed the beta nature of the site, and the fact that the team are still working to optimise a number of areas. That aside, I was impressed by some of the early results.</p>
<p>Barak pointed to the role Calais is playing in enabling the Berkman team to conduct comparisons within and between different news sources, extracting key entities (personal names, companies, places, etc) from the stream of text and normalising the various ways in which we refer to them (IBM, International Business Machines, etc).</p>
<p>Stephen discussed some of the background to the project, and was keen to emphasise the <em>platform</em> nature of their activity; although focussed on a destination web site today, the intention is to expose much of the data via a series of APIs that third parties will be able to consume. In the context of recent comparable moves from the <a href="http://open.blogs.nytimes.com/2009/02/25/announcing-the-times-newswire-api/"><em>New York Times</em></a> and Britain&#8217;s <a href="http://www.guardian.co.uk/media/2009/mar/10/guardian-open-platform"><em>Guardian</em></a>, this is clearly to be welcomed and will accelerate the innovation around this incomparable pool of content.</p>
<p>Whilst the site does not currently expose the full richness of the data being harvested, it is already possible to see a number of interesting <a href="http://www.mediacloud.org/visualizations/" class="broken_link">visualisations</a>.</p>
<p style="text-align: center;"><a title="bbc-wsj-fox-maps" href="http://www.mediacloud.org/visualizations/?tagset=13&amp;chart_is_log=true&amp;pivotterm=mugabe&amp;viz_type=map&amp;media_source[1]=BBC&amp;media_source[2]=Wall+Street+Journal&amp;media_source[3]=FOX+News&amp;media_id[1]=1094&amp;media_id[2]=1150&amp;media_id[3]=1092&amp;submit=Submit+Query" class="broken_link"><img class="attachment wp-att-388 aligncenter" style="margin: 8px;" src="http://cloudofdata.com/wp-content/uploads/2009/03/bbc-wsj-fox-maps.png" alt="bbc-wsj-fox-maps" width="378" height="660" /></a></p>
<p>The maps above, for example, compare the coverage devoted to all the world&#8217;s countries by three very different news organisations; the BBC, the Wall Street Journal and FOX News. The apparent non-coverage of Iran is surprising, and suggests an algorithm still in need of tweaking. That apart, it&#8217;s actually surprising to note the very similar spread of coverage. European snobbishness about the insularity of US news &#8211; and the farcical nature of FOX &#8216;news&#8217; &#8211; may not survive exposure to this data, although we <em>can</em>, of course, reassure ourselves that &#8216;coverage&#8217; doesn&#8217;t always equate to &#8216;insight,&#8217; &#8216;analysis&#8217; or &#8216;truth.&#8217; As more data become available, even those preconceived notions may face an overdue battering.</p>
<p>The system is already capable of addressing more complex questions, and exploring temporal aspects of the ways in which stories break, spread and relate to one another across different media. Stephen shared a number of evolving visualisations that made it possible for me to explore several trends at once, and I look forward to these tools arriving on the site.</p>
<p>The project is directly funded by the Berkman Center at present, and a desire to enrich and stabilise the platform means that significant addition of new news resources may be some months off. I, for one, look forward to seeing more non-US resources such as the UK&#8217;s <em>Financial Times</em> and <em>Guardian</em>, or their international equivalents.</p>
<p>The project&#8217;s APIs are also being finalised at this point and will be released in due course, along with the source code upon which Media Cloud runs. It will be interesting to see the balance between API access and software download at that point.</p>
<p>Media Cloud represents an excellent example of the uses to which the technologies I cover can be put. It&#8217;s not a Semantic Web project. It&#8217;s not a Cloud Computing project. It&#8217;s an intriguing exploration, that puts those technologies (invisibly) to work in helping to get the job done. That&#8217;s the way it should be.</p>
<p>And, as Stephen commented when asked what he was most interested to see,</p>
<blockquote><p>&#8220;the most amazing stuff will be what other folks use it for in asking and answering their own questions.&#8221;</p></blockquote>
<p>Indeed.</p>
<p>Thomson Reuters is already supporting this project by contributing Calais and some technical expertise. I wonder whether some of our more enlightened news organisations might like to help the Berkman get more data in there, faster? As more newspapers close each week, and as we (supposedly) become ever-more insular in the choices we make about the news we consume, there&#8217;s a real need to understand the ways in which the media are changing. Many eyes on this data set is one sure way to ensure that the commentary moving forward is informed and informative.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://crooksandliars.com/susie-madrak/lawsuit-determine-fair-use-blog-links">Lawsuit to Determine Fair Use for Blog Links, Headlines</a> (crooksandliars.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.paidcontent.org/entry/419-newspapers-suddenly-adapt-to-socal-media-nearly-60-percent-offer-user-g/">Newspapers Suddenly Adapt To Socal Media; Nearly 60 Percent Offer User-Gen Content</a> (paidcontent.org)</li>
<li class="zemanta-article-ul-li"><a href="http://www.mathewingram.com/work/2008/11/26/yes-twitter-is-a-source-of-journalism/">Yes, Twitter is a source of journalism</a> (mathewingram.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.xconomy.com/boston/2009/01/13/boston-journalists-launch-globalpostcom-alternative-to-traditional-medias-shrinking-international-coverage/">Boston Journalists Launch GlobalPost.com, Alternative to Traditional Media&#8217;s Shrinking International Coverage</a> (xconomy.com)</li>
<li class="zemanta-article-ul-li"><a href="http://blogs.harvardbusiness.org/now-new-next/2009/02/whats-the-best-business-model.html">What&#8217;s the Best Business Model for Newspapers?</a> (blogs.harvardbusiness.org)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/10d91dc7-a83e-4660-a8c1-65a1851c4ec2/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=10d91dc7-a83e-4660-a8c1-65a1851c4ec2" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-related"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2009/03/berkman-center-unveils-fascinating-insight-into-media-trends-with-a-little-semantic-web-goodness-from-calais-under-the-hood/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Linked Data and the Enterprise: a viable two-way street</title>
		<link>http://cloudofdata.com/2009/02/linked-data-and-the-enterprise-a-viable-two-way-street/</link>
		<comments>http://cloudofdata.com/2009/02/linked-data-and-the-enterprise-a-viable-two-way-street/#comments</comments>
		<pubDate>Wed, 11 Feb 2009 11:48:31 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Enterprise Computing]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[BBC]]></category>
		<category><![CDATA[DBpedia]]></category>
		<category><![CDATA[JP Rangaswami]]></category>
		<category><![CDATA[Open Calais]]></category>
		<category><![CDATA[San Diego]]></category>
		<category><![CDATA[Semantic Web Gang]]></category>
		<category><![CDATA[TED]]></category>
		<category><![CDATA[Thomson Reuters]]></category>
		<category><![CDATA[Tim Berners-Lee]]></category>
		<category><![CDATA[TTI/Vanguard]]></category>
		<category><![CDATA[World Wide Web]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=331</guid>
		<description><![CDATA[In a pair of blog posts yesterday, Andreas Blumauer of Austria&#8217;s Semantic Web Company touched on an area that has been absorbing my attention recently, and raised some questions worth exploring here. I am travelling to San Diego next week to speak about the importance of evolving Enterprise attitudes to data. Borrowing some nice turns [...]]]></description>
			<content:encoded><![CDATA[<p><a title="Sunrise over San Diego" href="http://www.flickr.com/photos/nolasknab/392994798/"><img class="attachment wp-att-335 alignright" style="margin: 6px;" src="http://cloudofdata.com/wp-content/uploads/2009/02/392994798_a88124299d_m.jpg" alt="Sunrise over San Diego" width="240" height="160" /></a>In a pair of <a href="http://ablvienna.wordpress.com/2009/02/10/linked-data-for-enterprises-a-one-way-scenario/">blog</a> <a href="http://blog.semantic-web.at/2009/02/10/linked-data-in-enterprises-some-ideas-for-business-models/">posts</a> yesterday, <a href="http://ablvienna.wordpress.com/">Andreas Blumauer</a> of Austria&#8217;s <a href="http://www.semantic-web.at/">Semantic Web Company</a> touched on an area that has been absorbing my attention recently, and raised some questions worth exploring here.</p>
<p>I am travelling to <a class="zem_slink" title="San Diego, California" rel="wikipedia" href="http://en.wikipedia.org/wiki/San_Diego%2C_California">San Diego</a> next week to <a href="http://www.ttivanguard.com/conference/2009/cloud.html">speak</a> about the importance of evolving Enterprise attitudes to data. Borrowing some nice turns of phrase from <a class="zem_slink" title="Tim Berners-Lee" rel="homepage" href="http://www.w3.org/People/Berners-Lee/">Sir Tim Berners-Lee</a>&#8216;s recent <a href="http://conferences.ted.com/TED2009/program/">TED talk</a> and <a class="zem_slink" title="JP Rangaswami" rel="homepage" href="http://www.confusedofcalcutta.com">JP Rangaswami</a>&#8216;s <a href="http://cloudofdata.com/2009/02/looking-back-at-powered-by-cloud-conference/">keynote</a> to <a href="http://www.poweredbycloud.com/">Powered by Cloud</a>, amongst other things I&#8217;ll be suggesting that they &#8216;stop hugging their data&#8217; and move &#8216;from data centre to data centric.&#8217;</p>
<p>The <a href="http://linkeddata.org/">Linked Data initiative</a>, which began in March of 2007 as a <a href="http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData">community project</a> supported by <a href="http://www.w3.org/">W3C</a>&#8216;s Semantic Web Education &amp; Outreach (<a href="http://www.w3.org/2001/sw/sweo/">SWEO</a>) Interest Group (of which I was a member), has been a huge success. Described by Berners-Lee as &#8216;<a href="http://blogs.zdnet.com/semantic-web/?p=165">the Web done right</a>,&#8217; the notion of Linked Data rests upon the acceptance of <a href="http://www.w3.org/DesignIssues/LinkedData">four simple principles</a>, yet opens the door to previously unanticipated re-use of data scattered across the Web.</p>
<p>The most rapid adoption has, unsurprisingly, been seen in terms of liberally licensed data already visible on the Web in some form. <a href="http://dbpedia.org/About">DBpedia</a>, for example, is a community effort to extract structured information from <a href="http://en.wikipedia.org/">Wikipedia</a> and expose the individual facts for use across the Web. There have also been examples — as always justified by hacker mentality, &#8216;academic freedom,&#8217; the imprimatur of &#8216;research,&#8217; or the expectation that the perpetrators are &#8216;too small&#8217; to be noticed — in which data have been appropriated to the cause without due care and attention to the rights of the data owner, but these isolated cases should certainly not detract from the value of the broader effort.</p>
<div class="zemanta-img zemanta-action-dragged" style="margin: 1em; display: block;">
<div>
<dl class="wp-caption alignright" style="width: 212px;">
<dt class="wp-caption-dt"><a href="http://en.wikipedia.org/wiki/Image:Linking-Open-Data-diagram_2008-03-31.png"><img title="Diagram for the LOD datasets" src="http://upload.wikimedia.org/wikipedia/en/thumb/5/55/Linking-Open-Data-diagram_2008-03-31.png/202px-Linking-Open-Data-diagram_2008-03-31.png" alt="Diagram for the LOD datasets" width="202" height="158" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image via <a href="http://en.wikipedia.org/wiki/Image:Linking-Open-Data-diagram_2008-03-31.png">Wikipedia</a></dd>
</dl>
</div>
</div>
<p>Public Interest data from organisations such as the <a href="http://www.bbc.co.uk/">BBC</a> has also begun to appear in the &#8216;<a href="http://www4.wiwiss.fu-berlin.de/bizer/pub/lod-datasets_2008-09-18.html">Linked Data Cloud</a>&#8216; (click on individual data sets for more),  and the frequency and strength of reciprocal links between participating resources grows rapidly.</p>
<p>Enterprise data is effectively invisible to this Cloud, which brings me back to Andreas&#8217; <a href="http://ablvienna.wordpress.com/2009/02/10/linked-data-for-enterprises-a-one-way-scenario/">first post</a>. In it, he asks;</p>
<blockquote><p>&#8220;Since the [Linked Data] cloud is kind of the basic infrastructure which drives the whole process &#8211; this layer should remain a freely accessible one. But how could new business models be built on top of it (and constantly spend money on maintaining and extending the underlying infrastructure)?</p>
<p>Where could enterprises start using Linked Data? Only by retrieving data from the &#8216;outside&#8217; and mash it up with the &#8216;inside&#8217; &#8211; only one way?&#8221;</p></blockquote>
<p>I can certainly see cases in which cautious corporates will be willing to <em>consume</em> without <em>contributing</em> in return, and there&#8217;s clearly work to do in demonstrating the value that they could gain from more balanced participation; participation that should never mean unwillingly &#8216;giving away&#8217; competitive advantage or sensitive data.</p>
<p>We have an annoying tendency to view data in our databases as an indivisible mass, vigorously and unthinkingly applying the same (expensive) protections to an uninteresting and low-value factoid of underlying context as we do to the core attributes of our next big lead.</p>
<p>Andreas concludes this post by suggesting something very similar to JP Rangaswami&#8217;s notion of &#8216;data centric&#8217;;</p>
<blockquote><p>&#8220;Information has no &#8216;place&#8217; anymore, energy can&#8217;t be shipped around the world. We should rethink the meaning of a &#8216;data store&#8217; and information will flow without flooding us. Linked Data might become the essence.&#8221;</p></blockquote>
<p>Andreas&#8217; <a href="http://blog.semantic-web.at/2009/02/10/linked-data-in-enterprises-some-ideas-for-business-models/">second post</a> followed after he&#8217;d listened to the <a href="http://semanticgang.talis.com/2009/01/16/january-2009-the-semantic-web-gang-discusses-calais-40-linked-data-and-google/" class="broken_link">most recent episode</a> of the <a href="http://semanticgang.talis.com/" class="broken_link">Semantic Web Gang</a>, which I Chair. During the show, recorded last month, we discussed the <a href="http://blogs.zdnet.com/semantic-web/?p=249">latest release</a> from <a href="http://www.thomsonreuters.com/">Thomson Reuters</a>&#8216; <a href="http://www.opencalais.com/">Open Calais</a> activity, which sees it embrace Linked Data&#8217;s principles whilst continuing to run and grow a viable global business.</p>
<p>Andreas extrapolates from the conversation to suggest that a viable business model for the data-curating Enterprise might be to expose timely and accurate enrichments to the Linked Data ecosystem; enrichments that customers might pay a premium to access more quickly or in more convenient forms than are available for free. He also sees a market for application builders that optimise the flow of information, and both of these are certainly possible.</p>
<p>The Linked Data — the Data Web — opportunity is far greater, though, and too little attention is being devoted to it by Linked Data&#8217;s advocates as they concentrate their efforts on big public datasets of the sort Berners-Lee discussed in Long Beach last week. Big public data sets <em>are</em> important, and Berners-Lee is right to suggest that more Open and Linked access to the outputs of scholarship will help in our efforts to tackle many of the world&#8217;s ills. There&#8217;s as much value locked up inside our commercial enterprises too, though, and yet the rationale that will ultimately lead to us unlocking this is quite different.</p>
<p>It is that rationale which we need to get right, almost certainly without mentioning &#8216;RDF&#8217;, &#8216;Semantic Web,&#8217; or even &#8216;Open.&#8217;</p>
<p>And if you&#8217;re in Southern California next week too, why not come and say &#8216;Hi&#8217;&#8230;?</p>
<p>&#8216;<a href="http://www.flickr.com/photos/nolasknab/392994798/">Sunrise over San Diego</a>&#8216; <em>image © Alon Banks</em>, 2007</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.boingboing.net/2009/02/04/ted2009-tim-bernersl.html">TED2009: Tim Berners-Lee</a> (boingboing.net)</li>
<li class="zemanta-article-ul-li"><a href="http://ourownsystem.com/2009/02/05/linked-data-will-change-health-care/" class="broken_link">Linked data will change health care</a> (ourownsystem.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.sciam.com/article.cfm?id=semantic-web-in-actio">The Semantic Web in Action</a> (sciam.com)</li>
<li class="zemanta-article-ul-li"><a href="http://thenumerati.net/index.cfm?postID=255">Tim Berners-Lee: Linked data</a> (thenumerati.net)</li>
<li class="zemanta-article-ul-li"><a href="http://blogs.talis.com/nodalities/2009/01/building-coherence-at-bbccouk.php">Building coherence at bbc.co.uk</a> (blogs.talis.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/archives/calais_4_linked_data.php">Calais 4.0 Released: Linked Data Meets the Commercial Web</a> (readwriteweb.com)</li>
<li class="zemanta-article-ul-li"><a href="http://ivanherman.wordpress.com/2008/11/14/calais-release-4-and-the-linking-data-cloud/">Calais Release 4 and the Linking Data cloud&#8230;</a> (ivanherman.wordpress.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/b5c4308a-faf5-425c-aa2f-bcd0a17471d2/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=b5c4308a-faf5-425c-aa2f-bcd0a17471d2" alt="Reblog this post [with Zemanta]" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2009/02/linked-data-and-the-enterprise-a-viable-two-way-street/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>Semantic Web Gang podcast for January talks about Linked Data and Thomson Reuters</title>
		<link>http://cloudofdata.com/2009/01/semantic-web-gang-podcast-for-january-talks-about-linked-data-and-thomson-reuters/</link>
		<comments>http://cloudofdata.com/2009/01/semantic-web-gang-podcast-for-january-talks-about-linked-data-and-thomson-reuters/#comments</comments>
		<pubDate>Fri, 16 Jan 2009 16:03:50 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Calais]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[Podcast]]></category>
		<category><![CDATA[Semantic Web Gang]]></category>
		<category><![CDATA[Thomson Reuters]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=251</guid>
		<description><![CDATA[One of the podcasts I do is called the Semantic Web Gang. It&#8217;s a monthly round-table, which I chair (in the loosest sense of the word!). The first show of the year was recorded yesterday, and just went up on the Semantic Web Gang site. Have a listen, to hear the Gang and I talk [...]]]></description>
			<content:encoded><![CDATA[<p>One of the podcasts I do is called the <a href="http://semanticgang.talis.com/" class="broken_link">Semantic Web Gang</a>. It&#8217;s a monthly round-table, which I chair (in the loosest sense of the word!).</p>
<p>The first show of the year was recorded yesterday, and <a href="http://semanticgang.talis.com/2009/01/16/january-2009-the-semantic-web-gang-discusses-calais-40-linked-data-and-google/" class="broken_link">just went up on the Semantic Web Gang site</a>. Have a listen, to hear the Gang and I talk about Thomson Reuters&#8217; Calais web service, Linked Data, and the possibility that Google might be getting all semantic on us.</p>
<p>For those interested in the Thomson Reuters activity, last year <a href="http://blogs.talis.com/nodalities/2008/03/barak_pridor_talks_about_clear.php">I also recorded an interview</a> with the CEO of Thomson Reuters&#8217; Clearforest subsidiary, Barak Pridor.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/archives/calais_4_linked_data.php">Calais 4.0 Released: Linked Data Meets the Commercial Web</a></li>
<li class="zemanta-article-ul-li"><a href="http://blogs.talis.com/nodalities/2009/01/enabling-the-linked-data-ecosystem.php">Enabling the Linked Data Ecosystem</a></li>
<li class="zemanta-article-ul-li"><a href="http://blog.larkc.eu/?p=591">OpenCalais will link to Linked Open Data identifiers</a></li>
<li class="zemanta-article-ul-li"><a href="http://venturebeat.com/2008/09/24/thomson-reuters-calais-goes-commercial-offering-a-starting-point-for-semantic-web-startups/">Thomson Reuters&#8217; Calais goes commercial, offering a starting point for semantic web startups</a></li>
<li class="zemanta-article-ul-li"><a href="http://mndoci.com/blog/2008/12/03/what-one-information-junkie-wants/" class="broken_link">What one information junkie wants</a></li>
<li class="zemanta-article-ul-li"><a href="http://www.iwr.co.uk/information-world-review/news/2229778/open-calais-carves-enterprise-4312980">Open Calais carves enterprise channel</a></li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/b73c559f-0682-4f21-993e-f211e8e8a723/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=b73c559f-0682-4f21-993e-f211e8e8a723" alt="Reblog this post [with Zemanta]" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2009/01/semantic-web-gang-podcast-for-january-talks-about-linked-data-and-thomson-reuters/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Even in an Architecture of Participation, Thomson Reuters believes Content can be King</title>
		<link>http://cloudofdata.com/2009/01/even-in-an-architecture-of-participation-thomson-reuters-believes-content-can-be-king/</link>
		<comments>http://cloudofdata.com/2009/01/even-in-an-architecture-of-participation-thomson-reuters-believes-content-can-be-king/#comments</comments>
		<pubDate>Wed, 14 Jan 2009 13:05:06 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web 2.0]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[DBpedia]]></category>
		<category><![CDATA[GeoNames]]></category>
		<category><![CDATA[The World Factbook]]></category>
		<category><![CDATA[Thomson Reuters]]></category>
		<category><![CDATA[ZDNet]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=236</guid>
		<description><![CDATA[I write about Thomson Reuters&#8216; release of Calais 4.0 over on ZDNet today, and wanted to use this post to explore some of the broader context within which Calais should increasingly be considered. As well as linking to &#8216;usual suspects&#8217; in the Linked Data space such as the CIA Factbook, GeoNames, DBpedia, Musicbrainz and the [...]]]></description>
			<content:encoded><![CDATA[<p><a title="Casa Battló" href="http://www.flickr.com/photos/grassvalleylarry/12344114/"><img class="attachment wp-att-237 alignright" style="margin: 7px;" src="http://cloudofdata.com/wp-content/uploads/2009/01/12344114_cbb1cee5c5_m.jpg" alt="Casa Battló" width="240" height="156" /></a>I <a href="http://blogs.zdnet.com/semantic-web/?p=249">write</a> about <a href="http://www.thomsonreuters.com/">Thomson Reuters</a>&#8216; release of Calais 4.0 <a href="http://blogs.zdnet.com/semantic-web/?p=249">over on ZDNet today</a>, and wanted to use this post to explore some of the broader context within which <a href="http://www.opencalais.com/">Calais</a> should increasingly be considered.</p>
<p>As well as linking to &#8216;usual suspects&#8217; in the <a class="zem_slink" title="Linked Data" rel="homepage" href="http://linkeddata.org/">Linked Data</a> space such as the <a class="zem_slink" title="The World Factbook" rel="wikipedia" href="http://en.wikipedia.org/wiki/The_World_Factbook">CIA Factbook</a>, <a class="zem_slink" title="GeoNames" rel="wikipedia" href="http://en.wikipedia.org/wiki/GeoNames">GeoNames</a>, <a class="zem_slink" title="DBpedia" rel="homepage" href="http://dbpedia.org/About">DBpedia</a>, Musicbrainz and the like, Thomson Reuters is taking the additional step of providing access to parts of their information empire. Already recognised as one of the premiere sources for financial and company information on the global stage, the company is choosing to embed Thomson Reuters services such as stock tickers, information on corporate Boards, etc.</p>
<p>In a world in which everyone can participate, and in which citizen journalism, crowd sourcing and the rest are leading to powerful and wide-ranging disruptions to traditional media, Thomson Reuters is betting that authoritative, timely and branded data has value, and that opportunities exist to build that value by giving some previously expensive information away for free in order to jump-start a range of new and lucrative services.</p>
<p>Freebase employees once talked about a desire to become the canonical source of certain facts on the web; <em>the</em> go-to place for a class of information. Might Thomson Reuters have similar ambitions in the business information market, and might the undeniable (and free) value of the Calais web services be the hook that encourages a whole swathe of developers building applications for this market to perpetuate &#8211; and strengthen &#8211; the company&#8217;s reach?</p>
<p>This isn&#8217;t cynical, this isn&#8217;t devious, this isn&#8217;t &#8216;wrong,&#8217; and there is absolutely no suggestion that the currently free services will ever incur charges for use. Instead, it&#8217;s an interesting example of a titan of Old Media thinking creatively about ways in which it can continue to have value in a changing world.</p>
<p><a href="http://www.flickr.com/photos/grassvalleylarry/12344114/">Casa Battló</a> <em>© <a href="http://www.flickr.com/people/grassvalleylarry/">Larry Miller</a> 2003, and shared on <a href="http://www.flickr.com/">Flickr</a> under <a href="http://creativecommons.org/licenses/by/2.0/deed.en_GB">Creative Commons License</a>.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://semanticbusiness.blogspot.com/2008/12/updated-profile-calais.html">Updated Profile: Calais</a></li>
<li class="zemanta-article-ul-li"><a href="http://ivanherman.wordpress.com/2008/11/14/calais-release-4-and-the-linking-data-cloud/">Calais Release 4 and the Linking Data cloud&#8230;</a></li>
<li class="zemanta-article-ul-li"><a href="http://www.semanticfocus.com/blog/entry/title/cross-pollinating-dbpedia-and-freebase/">Cross-Pollinating DBpedia and Freebase</a></li>
<li class="zemanta-article-ul-li"><a href="http://www.semanticfocus.com/blog/entry/title/freebase-officially-linked-data-with-release-of-rdf-service/">Freebase Officially Linked Data with Release of RDF Service</a></li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/14a2bb98-acfd-40dd-a211-d2dfbfa3331c/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=14a2bb98-acfd-40dd-a211-d2dfbfa3331c" alt="Reblog this post [with Zemanta]" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2009/01/even-in-an-architecture-of-participation-thomson-reuters-believes-content-can-be-king/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>

