<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
		xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	xmlns:media="http://search.yahoo.com/mrss/"
>

<channel>
	<title>Paul Miller - The Cloud of Data &#187; Cloud of Data</title>
	<atom:link href="http://cloudofdata.com/tag/cloud-of-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://cloudofdata.com</link>
	<description>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</description>
	<lastBuildDate>Thu, 17 May 2012 15:04:40 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<copyright>Licensed under the Creative Commons Attribution License, version 3.0 http://creativecommons.org/licenses/by/3.0/</copyright>
	<managingEditor>paul.miller@cloudofdata.com (Paul Miller)</managingEditor>
	<webMaster>paul.miller@cloudofdata.com (Paul Miller)</webMaster>
	<ttl>1440</ttl>
	<image>
		<url>http://cloudofdata.com/logo144x144.jpg</url>
		<title>Paul Miller - The Cloud of Data</title>
		<link>http://cloudofdata.com</link>
		<width>144</width>
		<height>144</height>
	</image>
	<itunes:subtitle>conversations with the executives shaping Cloud Computing and the Semantic Web.</itunes:subtitle>
	<itunes:summary>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</itunes:summary>
	<itunes:keywords>Cloud Computing, Semantic Web, Linked Data, Open Data, SaaS, PaaS</itunes:keywords>
	<itunes:category text="Technology" />
	<itunes:category text="Business" />
	<itunes:author>Paul Miller</itunes:author>
	<itunes:owner>
		<itunes:name>Paul Miller</itunes:name>
		<itunes:email>paul.miller@cloudofdata.com</itunes:email>
	</itunes:owner>
	<itunes:block>no</itunes:block>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://cloudofdata.com/logo300x300.jpg" />
		<item>
		<title>Amazon Public Data Sets bring the Cloud of Data closer</title>
		<link>http://cloudofdata.com/2008/12/amazon-public-data-sets-bring-the-cloud-of-data-closer/</link>
		<comments>http://cloudofdata.com/2008/12/amazon-public-data-sets-bring-the-cloud-of-data-closer/#comments</comments>
		<pubDate>Tue, 16 Dec 2008 15:32:08 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[PaaS]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Amazon EC2]]></category>
		<category><![CDATA[Amazon S3]]></category>
		<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[Amazon.com]]></category>
		<category><![CDATA[Cloud of Data]]></category>
		<category><![CDATA[Google App Engine]]></category>
		<category><![CDATA[Licensing]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Open Data Commons]]></category>
		<category><![CDATA[Public Domain]]></category>
		<category><![CDATA[Talis]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=199</guid>
		<description><![CDATA[Image via CrunchBase, source unknown It began, as so many things do these days, with an idle tweet. On 21 November, Amazon Web Services&#8216; Deepak Singh pointed to a new page describing the company&#8217;s &#8216;Public Data Sets on Amazon Web Services.&#8217; Lidija Davis covered the news for ReadWriteWeb two days later and on 4 December [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img">
<div>
<dl class="wp-caption alignright" style="margin: 1em; float: right; display: block; width: 210px;">
<dt class="wp-caption-dt"><a href="http://www.crunchbase.com/company/amazon"><img title="Image representing Amazon as depicted in Crunc..." src="http://www.crunchbase.com/assets/images/resized/0000/3898/3898v1-max-450x450.jpg" alt="Image representing Amazon as depicted in Crunc..." width="200" height="89" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image via <a href="http://www.crunchbase.com">CrunchBase</a>, source unknown</dd>
</dl>
</div>
</div>
<p>It began, as so many things do these days, with an idle <a href="http://en.wikipedia.org/wiki/Twitter">tweet</a>.</p>
<p>On 21 November, <a href="http://aws.amazon.com/">Amazon Web Services</a>&#8216; <a href="http://mndoci.com/blog/about/" class="broken_link">Deepak Singh</a> <a href="http://twitter.com/mndoci/status/1016646762">pointed</a> to a new page describing the company&#8217;s &#8216;<a href="http://aws.amazon.com/publicdatasets/">Public Data Sets on Amazon Web Services</a>.&#8217;</p>
<p><a href="http://www.readwriteweb.com/about_Lidija.php" class="broken_link">Lidija Davis</a> <a href="http://www.readwriteweb.com/archives/amazon_web_services_seeks_publ.php">covered the news</a> for ReadWriteWeb two days later and on 4 December Amazon issued its <a href="http://phx.corporate-ir.net/phoenix.zhtml?c=176060&amp;p=irol-newsArticle&amp;ID=1232302&amp;highlight=">formal press release</a>, prompting a flurry of coverage from Mike Arrington at <a href="http://www.techcrunch.com/2008/12/04/amazon-launches-public-data-sets-to-ease-research/">TechCrunch</a>, Larry Dignan at <a href="http://blogs.zdnet.com/BTL/?p=11081">ZDNet</a>, Krishnan Subramanian at <a href="http://www.cloudave.com/link/amazon-tries-to-lure-scientific-community-into-the-clouds">CloudAve</a>, and many others.</p>
<p>Alongside broader discussion of this move, members of the <a href="http://www.w3.org/">W3C</a>-backed <a href="http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData">Linking Open Data project</a> delved into the synergies via their <a href="http://lists.w3.org/Archives/Public/public-lod/">public mailing list</a> and Linking Open Data enthusiast <a class="zem_slink" title="Kingsley Idehen" rel="crunchbase" href="http://www.crunchbase.com/person/kingsley-idehen">Kingsley Idehen</a>&#8216;s <a href="http://www.openlinksw.com/">company</a> issued a <a href="http://www.earthtimes.org/articles/show/openlink-bolsters-semantic-web-vision,648977.shtml" class="broken_link">Press Release</a> suggesting ways in which their products might fit within this shifting data landscape.</p>
<p>So what have Amazon done, what does it mean, and how does it &#8216;bring the Cloud of Data closer&#8217; as the title of this post suggests?</p>
<p>Amazon&#8217;s <a href="http://aws.amazon.com/publicdatasets/">web page</a> describes their offer quite succinctly;</p>
<blockquote><p>&#8220;Public Data Sets on <span class="caps">AWS</span> provides a centralized repository of public data sets that can be seamlessly integrated into <span class="caps">AWS</span> cloud-based applications.  <span class="caps">AWS</span> is hosting the public data sets at no charge for the community, and like all <span class="caps">AWS</span> services, users pay only for the compute and storage they use for their own applications.&#8221;</p></blockquote>
<p>As Krishnan noted in his post,</p>
<blockquote><p>&#8220;By doing this, Amazon is helping research community save money on storage and  bandwidth costs associated with assessing these public data from any EC2  instances they use in their research. When the data in question is in hundreds  of terabytes or petabytes, we are talking about huge cost savings here.&#8221;</p></blockquote>
<p>In addition, OpenLink&#8217;s <a href="http://www.earthtimes.org/articles/show/openlink-bolsters-semantic-web-vision,648977.shtml" class="broken_link">press release</a> gives an indication of the efficient manner in which services and data <em>already hosted by Amazon</em> can be plugged together as needed;</p>
<blockquote><p>&#8220;As a vital contribution to the momentum behind the burgeoning Web of Linked Data, [OpenLink's product] Virtuoso provides a simple deployment mechanism for highly integrated knowledge bases emerging from the Linking Open Data community. For example, it is now possible to deploy personal or service-specific renditions of <a href="http://dbpedia.org/About">DBpedia</a> within 1.5 hours, compared to an 8 &#8211; 22 hour effort when performed from scratch.&#8221;<br />
(my links)</p></blockquote>
<p>By offering free hosting for public data, then, Amazon are doing the wider community a huge service. Much of the data there today is reasonably readily available from other sources, so the biggest immediate benefits are those of speed and cost outlined above by Krishnan and OpenLink. For existing or potential users of Amazon&#8217;s Web Services to power their applications, this is yet another reason to consider Amazon.</p>
<p><a class="zem_slink" title="Harvard Medical School" rel="homepage" href="http://hms.harvard.edu/">Harvard Medical School</a>&#8216;s Dr. Peter Tonellato was quoted in Amazon&#8217;s release, and he is unlikely to be alone;</p>
<blockquote><p>&#8220;<span class="ccbnTxt">Public Data Sets on AWS will enable me and many of my colleagues to       collaborate with each other by sharing our commonly used data sets,       research environments and tools. We can set up a controlled environment in       minutes, run our computational analysis for a couple of hours, and shut       down the environment. Our results are completely repeatable. I only pay       for the compute time I use, and more importantly I can spend more time       focusing on research, not downloading and setting up computational       infrastructure.</span>&#8220;</p></blockquote>
<p>The bigger long-term contribution of this Amazon initiative may actually lie with data that are difficult or impossible to find online today. In a previous existence at the <a href="http://ads.ahds.ac.uk/">Archaeology Data Service</a> (ADS), for example, my colleagues and I were always being contacted by individuals and organisations with data that they <em>wanted</em> to see online; individuals and organisations that lacked the skills, resources or mandate to mount and maintain the data themselves. How many of those organisations will <a href="http://aws.amazon.com/publicdatasets/#3">beat a path to Amazon&#8217;s door</a> now&#8230; and what sort of resource might we see emerge as a result?</p>
<p>However&#8230;</p>
<p>Krishnan concludes his post with a reality-check, commenting;</p>
<blockquote><p>&#8220;this data stored on AWS servers are useful only if the researchers use Amazon  EC2 for their computing needs&#8230; even if  they could tap into it from external platforms, it doesn’t mean much if these public datasets are  accessible using some kind of API from their original source itself.&#8221;</p></blockquote>
<p>In other words, much (most? all?) of the advantage Amazon is offering evaporates if developers then have to pull the hosted data off Amazon&#8217;s servers and into their own applications running locally or via a competing Cloud provider such as Google.</p>
<p>Although the way in which it is recognised and monetised is finally shifting, data is still valuable, and Amazon (and others) clearly recognise the benefits of enticing users to entrust data to <em>their</em> offering, whilst (almost) imperceptibly making it that little bit more painful to use the data somewhere else.</p>
<p>Kingsley Idehen is <a href="http://www.earthtimes.org/articles/show/openlink-bolsters-semantic-web-vision,648977.shtml" class="broken_link">quoted</a> as saying,</p>
<blockquote><p>&#8220;The Web&#8217;s potential as a globally distributed information space that plugs into disparate databases has never been in question. What has remained unclear is how a federated Web of linked databases would be delivered in a manner consistent with the Web&#8217;s core architecture, without compromising its simplicity.&#8221;</p></blockquote>
<p>It is in moving us toward this open vision that Amazon&#8217;s offering (although undoubtedly an important step along the way) is ultimately lacking. For that, we may well require the open and linked approach of Semantic Web offerings from companies such as <a href="http://www.talis.com/platform/">Talis</a> and Kingsley&#8217;s OpenLink. These recognise the futility of expecting all data to migrate to a single service provider, whilst still ensuring that those on the &#8216;inside&#8217; may gain the benefits of proximity on the network, pre-computation of certain indices, etc. Amazon and its services clearly have a place within that emerging ecosystem, but it is a place that they will need to share with others.</p>
<p>The worthwhile philanthropic aspects of Amazon&#8217;s announcement apart, the company is certainly doing its part to evangelise the benefits of moving data to the Cloud, and this is to be wholeheartedly welcomed.</p>
<p>CIOs are recognising the benefits of Cloud-based computation, and their resistance to the loss of control implied by individual cost centres&#8217; embracing of SaaS solutions such as Salesforce is diminishing. The proposition of accessing <em>data</em> in the Cloud, at will, is even more profound, and the benefits to be gained require careful and compelling explanation in the face of inevitable fears regarding issues such as data integrity.</p>
<p>Showing everyone the benefits to be gained in sharing disparate <em>public</em> data sets is one more step along the way to widespread acceptance of the value in easing restrictions over access to more sensitive resources.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.cloudave.com/link/the-evolution-of-an-all-encompassing-world-of-clouds">The evolution of an all encompassing world of clouds</a></li>
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/archives/amazon_web_services_bigger_than_amazon.php">Amazon Web Services: Bigger Than Amazon</a></li>
<li class="zemanta-article-ul-li"><a href="http://www.xconomy.com/seattle/2008/12/04/public-data-sets-go-on-amazons-cloud/">Public Data Goes on Amazon&#8217;s Cloud</a></li>
<li class="zemanta-article-ul-li"><a href="http://blogs.ft.com/techblog/2008/12/the-amazon-cloud-no-longer-a-mid-altantic-kludge/">The Amazon Cloud: no longer a mid-Altantic kludge</a></li>
<li class="zemanta-article-ul-li"><a href="http://www.cloudave.com/link/amazon-tries-to-lure-scientific-community-into-the-clouds">Amazon Tries to Lure Scientific Community into the Clouds</a></li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/8e52f4ea-d3e4-4217-8ac8-03193483b71e/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=8e52f4ea-d3e4-4217-8ac8-03193483b71e" alt="Reblog this post [with Zemanta]" /></a></div>
<div class="al2fb_like_button"><div id="fb-root"></div><script type="text/javascript">
(function(d, s, id) {
  var js, fjs = d.getElementsByTagName(s)[0];
  if (d.getElementById(id)) return;
  js = d.createElement(s); js.id = id;
  js.src = "//connect.facebook.net/en_US/all.js#xfbml=1&appId=133647763430045";
  fjs.parentNode.insertBefore(js, fjs);
}(document, "script", "facebook-jssdk"));
</script>
<fb:like href="http://cloudofdata.com/2008/12/amazon-public-data-sets-bring-the-cloud-of-data-closer/" layout="standard" show_faces="true" width="450" action="like" font="arial" colorscheme="light" ref="AL2FB"></fb:like></div>]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2008/12/amazon-public-data-sets-bring-the-cloud-of-data-closer/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
	</channel>
</rss>

