<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
		xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	xmlns:media="http://search.yahoo.com/mrss/"
>

<channel>
	<title>Paul Miller - The Cloud of Data &#187; Open Data Commons</title>
	<atom:link href="http://cloudofdata.com/tag/open-data-commons/feed/" rel="self" type="application/rss+xml" />
	<link>http://cloudofdata.com</link>
	<description>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</description>
	<lastBuildDate>Fri, 10 Feb 2012 10:46:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<copyright>Licensed under the Creative Commons Attribution License, version 3.0 http://creativecommons.org/licenses/by/3.0/</copyright>
	<managingEditor>paul.miller@cloudofdata.com (Paul Miller)</managingEditor>
	<webMaster>paul.miller@cloudofdata.com (Paul Miller)</webMaster>
	<ttl>1440</ttl>
	<image>
		<url>http://cloudofdata.com/logo144x144.jpg</url>
		<title>Paul Miller - The Cloud of Data</title>
		<link>http://cloudofdata.com</link>
		<width>144</width>
		<height>144</height>
	</image>
	<itunes:subtitle>conversations with the executives shaping Cloud Computing and the Semantic Web.</itunes:subtitle>
	<itunes:summary>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</itunes:summary>
	<itunes:keywords>Cloud Computing, Semantic Web, Linked Data, Open Data, SaaS, PaaS</itunes:keywords>
	<itunes:category text="Technology" />
	<itunes:category text="Business" />
	<itunes:author>Paul Miller</itunes:author>
	<itunes:owner>
		<itunes:name>Paul Miller</itunes:name>
		<itunes:email>paul.miller@cloudofdata.com</itunes:email>
	</itunes:owner>
	<itunes:block>no</itunes:block>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://cloudofdata.com/logo300x300.jpg" />
		<item>
		<title>Open is good &#8211; but encouragement better than mandate</title>
		<link>http://cloudofdata.com/2012/02/open-is-good-but-encouragement-better-than-mandate/</link>
		<comments>http://cloudofdata.com/2012/02/open-is-good-but-encouragement-better-than-mandate/#comments</comments>
		<pubDate>Mon, 06 Feb 2012 14:13:23 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[1OdataLicenseEU]]></category>
		<category><![CDATA[Andrés Nin]]></category>
		<category><![CDATA[Creative Commons]]></category>
		<category><![CDATA[epsi]]></category>
		<category><![CDATA[epsiplatform]]></category>
		<category><![CDATA[neelie kroes]]></category>
		<category><![CDATA[Open Data Commons]]></category>
		<category><![CDATA[open licence]]></category>
		<category><![CDATA[open license]]></category>
		<category><![CDATA[psi directive]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1801</guid>
		<description><![CDATA[Openness is undeniably cool right now, at least if you move in the slightly odd circles that I do. Openly available scientific papers are disrupting the world of scholarly publishing (which may not be all good, but that&#8217;s a post for another day). Openly available university courses are finally beginning to work out how to [...]]]></description>
			<content:encoded><![CDATA[<div class="wp-caption alignright" style="width: 310px"><a href="http://commons.wikipedia.org/wiki/File:Open_Data_stickers.jpg"><img class="zemanta-img-inserted zemanta-img-configured" title="English: Open Data stickers" src="http://cloudofdata.com/wp-content/uploads/2011/07/300px-Open_Data_stickers5.jpg" alt="English: Open Data stickers" width="300" height="225" /></a><p class="wp-caption-text">Image via Wikipedia</p></div>
<p>Openness is undeniably cool right now, at least if you move in the slightly odd circles that I do. Openly available scientific papers are disrupting the world of scholarly publishing (which may not be all good, but that&#8217;s a post for another day). Openly available university courses are finally beginning to work out how to offer meaningful accreditation to students. Openly accessible data from government agencies around the world bulks out almost every data marketplace, and anchors many an analysis. Openly available code for cloud infrastructure or networking is challenging the hold of the tech world&#8217;s giants. Everywhere you look, &#8216;incumbents&#8217; are apparently being &#8216;challenged&#8217; and &#8216;disrupted&#8217; by the power of open.</p>
<p>The truth, of course, is a little more complex and a lot more nuanced, as business models shift and evolve just like they always have. In sustainable systems, some people still need to be rewarded (often through being paid) for their effort. And in sustainable systems, <em>paying</em> someone can often be a pretty straightforward means of ensuring that you have a throat to choke if something breaks; big companies adopting open source often seek a proper financial relationship with someone who installs and maintains the &#8216;free&#8217; software or hardware they&#8217;re depending upon.</p>
<p>One area of openness that I&#8217;ve been involved with for about ten years is that of open licensing for both creative works and data. And it&#8217;s come a very long way.</p>
<p>Here in Europe, for example, the (badly flawed) 2003 <a href="http://en.wikipedia.org/wiki/PSI_Directive">Public Sector Information Directive</a> is under review, and there&#8217;s every likelihood that the replacement will make a number of sensible moves toward greater openness, transparency, and reusability for publicly funded data. As <a href="http://epsiplatform.eu/content/single-eu-open-data-license-campaign">the EPSI Platform site notes</a> today, Andrés Nin proposes going a step further than the European Commission is currently contemplating, by <a href="http://actuable.es/peticiones/say-to-neeliekroeseu-we-want-single-opendata-licence-in-the">instituting a common open license across Europe</a>;</p>
<blockquote><p>&#8220;The creation of a single public information re-use space in Europe requires much more, it requires a common European OpenData license applicable to all data generated by European public administrations.&#8221;</p></blockquote>
<p>I would certainly welcome a <em>model license</em> that European member states might be enabled to use. I&#8217;d also welcome — and support — vigorous efforts to dissuade individual member states or ministries from their usual practice of tweaking and otherwise modifying perfectly good documents in order to demonstrate how &#8216;special&#8217; or &#8216;different&#8217; their circumstances apparently are. When will they all realise that they are neither as special nor as different as they like to think?</p>
<p>But — and it&#8217;s a big but — it seems unwise, premature, and unhelpful to even begin to suggest that such a license might be mandated across Europe. It isn&#8217;t required, and attempts to develop a single document that everyone could accept would be an unhelpful distraction that would result in something so bureaucratic, so ringed in opt-outs and prevarications, as to be utterly worthless. It would also, in all likelihood, be one of those exercises in which the process very quickly subsumed the point. A prime candidate for, in the words of an old boss, being too busy to be effective.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://r.zemanta.com/?u=http%3A//www.guardian.co.uk/science/2012/jan/27/academic-publishers-enemies-science-wrong&amp;a=72496211&amp;rid=76056481-0aaf-4346-84b0-0ed02aeddf27&amp;e=c5c38559b96c2a50e9bb649290e600df">Branding academic publishers &#8216;enemies of science&#8217; is offensive and wrong</a> (guardian.co.uk)</li>
<li class="zemanta-article-ul-li"><a href="http://opendotdotdot.blogspot.com/2011/12/open-data-europe-starts-to-get-it.html">Open Data: Europe Starts to Get It</a> (opendotdotdot.blogspot.com)</li>
<li class="zemanta-article-ul-li"><a href="http://thenextweb.com/eu/2011/12/12/open-data-in-europe-gets-a-huge-boost-from-new-eu-rules/">Open Data in Europe gets a huge boost from new EU rules</a> (thenextweb.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=76056481-0aaf-4346-84b0-0ed02aeddf27" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2012/02/open-is-good-but-encouragement-better-than-mandate/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Licensing of Linked Data</title>
		<link>http://cloudofdata.com/2009/10/licensing-of-linked-data/</link>
		<comments>http://cloudofdata.com/2009/10/licensing-of-linked-data/#comments</comments>
		<pubDate>Mon, 26 Oct 2009 16:07:43 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Creative Commons]]></category>
		<category><![CDATA[International Semantic Web Conference]]></category>
		<category><![CDATA[ISWC]]></category>
		<category><![CDATA[ISWC2009]]></category>
		<category><![CDATA[Jordan Hatcher]]></category>
		<category><![CDATA[Leigh Dodds]]></category>
		<category><![CDATA[licencing]]></category>
		<category><![CDATA[Licensing]]></category>
		<category><![CDATA[Open Data Commons]]></category>
		<category><![CDATA[Tim Berners-Lee]]></category>
		<category><![CDATA[Tim O'Reilly]]></category>
		<category><![CDATA[Tom Heath]]></category>
		<category><![CDATA[web2summit]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=850</guid>
		<description><![CDATA[As part of a workshop at this year&#8217;s International Semantic Web Conference (ISWC), former colleague Leigh Dodds prepared an interesting diagram on the ways in which resources comprising the Linked Data Cloud are currently licensed. For various reasons, I was unable to make it to Virginia for the event, but a scan through the presentations [...]]]></description>
			<content:encoded><![CDATA[<p>As part of a <a href="http://iswc2009.semanticweb.org/wiki/index.php/ISWC_2009_Tutorials/Legal_and_Social_Frameworks_for_Sharing_Data_on_the_Web">workshop</a> at this year&#8217;s <a href="http://iswc2009.semanticweb.org/">International Semantic Web Conference</a> (ISWC), former colleague <a class="zem_slink" title="Leigh Dodds" rel="blog" href="http://www.ldodds.com/blog">Leigh Dodds</a> prepared an interesting diagram on the ways in which resources comprising the <a class="zem_slink freebase/guid/9202a8c04000641f8000000005714541" title="Linked Data" rel="wikipedia" href="http://en.wikipedia.org/wiki/Linked_Data">Linked Data</a> Cloud are currently licensed.</p>
<p style="text-align: center;"><a href="http://www.flickr.com/photos/ldodds/4043803502/"><img class="size-full wp-image-851 aligncenter" title="4043803502_7df222bedb" src="http://cloudofdata.com/wp-content/uploads/2009/10/4043803502_7df222bedb.jpg" alt="4043803502_7df222bedb" /></a></p>
<p>For various reasons, I was unable to make it to Virginia for the event, but a scan through the <a href="http://iswc2009.semanticweb.org/wiki/index.php/ISWC_2009_Tutorials/Legal_and_Social_Frameworks_for_Sharing_Data_on_the_Web#Slides">presentations</a> from Leigh, <a class="zem_slink" title="Tom Heath" rel="blog" href="http://tomheath.com/">Tom Heath</a> (another former colleague), <a class="zem_slink" title="Jordan Hatcher" rel="blog" href="http://jordanhatcher.com">Jordan Hatcher</a> (with whom I worked on earlier iterations of the <a href="http://www.opendatacommons.org/">Open Data Commons</a> license), and <a class="zem_slink freebase/guid/9202a8c04000641f8000000000065d43" title="Creative Commons" rel="homepage" href="http://creativecommons.org/">Creative Commons</a>&#8216; Kaitlin Thaney, it looks like they did a great job covering the bases on this critically important aspect of the evolving Data Web.</p>
<p>If we are to encourage the sorts of break-out use of data that <a class="zem_slink freebase/guid/9202a8c04000641f800000000003b0aa" title="Tim Berners-Lee" rel="wikipedia" href="http://en.wikipedia.org/wiki/Tim_Berners-Lee">Tim Berners-Lee</a> and <a href="http://en.wikipedia.org/wiki/Tim_O'Reilly">Tim O&#8217;Reilly</a> were <a href="http://www.youtube.com/watch?v=KY5skobffk0">discussing</a> at last week&#8217;s <a href="http://www.web2summit.com/web2009">Web 2.0 Summit</a>, we need to move past the current <em>laissez-faire</em> approaches adopted by too many and ensure that there are licensing regimes in place to <em>enable</em>, <em>facilitate</em> and <em>encourage</em> widespread re-use.</p>
<p style="text-align: center;"><a class="zem_olink" title="Linking Open Data: An Emerging Practice Area for the Semantic Web" href="http://phaneron.rickmurphy.org/?p=34"><iframe title="YouTube video player" class="youtube-player" type="text/html" width="425" height="344" src="http://www.youtube.com/embed/KY5skobffk0" frameborder="0" allowFullScreen="true"> </iframe></a></p>
<p>As Leigh&#8217;s work demonstrates, we have a long way to go. Only around a third of the current Linked Data projects are explicitly licensing their content, and several of those may be mis-licensing by applying copyright-based licenses such as those from Creative Commons to data not covered by copyright legislation.</p>
<p>With the current state of data licensing, and the reliance of <a href="http://creativecommons.org/choose/zero">CC0</a> and Open Data Commons&#8217; <a href="http://www.opendatacommons.org/licenses/pddl/">Public Domain Dedication and License</a> upon the <a class="zem_slink freebase/guid/9202a8c04000641f800000000002e27e" title="Public domain" rel="wikipedia" href="http://en.wikipedia.org/wiki/Public_domain">public domain</a> as a means of overcoming territorial differences, will we find <em>enterprise</em> use of Linked Data increasingly relying upon the more cumbersome tool of contract law&#8230; to the detriment of a free and flexible exchange of ideas?</p>
<p><em>&#8216;<a href="http://www.flickr.com/photos/ldodds/4043803502/">Linked Open Data Rights Survey</a>&#8216; image by Leigh Dodds, shared on Flickr under <a href="http://creativecommons.org/licenses/by-nc-nd/2.0/deed.en">a Creative Commons license</a>.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.opencontentlawyer.com/2009/10/26/iswc-linked-data-and-the-law/">ISWC linked data and the law</a> (opencontentlawyer.com)</li>
<li class="zemanta-article-ul-li"><a href="http://ivan-herman.name/2009/10/26/iswc2009-i/">Iswc2009 I.</a> (ivan-herman.name)</li>
<li class="zemanta-article-ul-li"><a href="http://sciencecommons.org/weblog/archives/2009/09/09/cc0-endorsed-nature/">CC0 endorsed in Nature opinion piece</a> (sciencecommons.org)</li>
<li class="zemanta-article-ul-li"><a href="http://phaneron.rickmurphy.org/?p=36">Linked Data: Interpretants and Interpretation</a> (phaneron.rickmurphy.org)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/204379dc-1f9a-41dd-ae94-42b603fe42ff/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=204379dc-1f9a-41dd-ae94-42b603fe42ff" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-info pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2009/10/licensing-of-linked-data/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>John Wilbanks talks about Creative Commons, Data, Science and more</title>
		<link>http://cloudofdata.com/2009/06/john-wilbanks-talks-about-creative-commons-data-science-and-more/</link>
		<comments>http://cloudofdata.com/2009/06/john-wilbanks-talks-about-creative-commons-data-science-and-more/#comments</comments>
		<pubDate>Mon, 01 Jun 2009 13:03:40 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Podcast]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Creative Commons]]></category>
		<category><![CDATA[John Wilbanks]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Open Data Commons]]></category>
		<category><![CDATA[Science Commons]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=653</guid>
		<description><![CDATA[Image by mecredis via Flickr My latest podcast is with John Wilbanks, the VP at Creative Commons with responsibility for their Science Commons project. John has a varied background that includes founding a bio-informatics startup, Harvard&#8217;s Berkman Center, the World Wide Web Consortium and the US Congress. In his current role at Science Commons, he [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="margin: 1em; display: block;">
<div>
<dl class="wp-caption alignright" style="width: 250px;">
<dt class="wp-caption-dt"><a href="http://www.flickr.com/photos/97285730@N00/518672515"><img title="John Wilbanks, Science Commons" src="http://farm1.static.flickr.com/241/518672515_36ebcf9e70_m.jpg" alt="John Wilbanks, Science Commons" width="240" height="160" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image by <a href="http://www.flickr.com/photos/97285730@N00/518672515">mecredis</a> via Flickr</dd>
</dl>
</div>
</div>
<p>My latest podcast is with <a class="zem_slink" title="John Wilbanks" rel="wikipedia" href="http://en.wikipedia.org/wiki/John_Wilbanks">John Wilbanks</a>, the VP at <a class="zem_slink" title="Creative Commons" rel="homepage" href="http://creativecommons.org/">Creative Commons</a> with responsibility for their <a class="zem_slink" title="Science Commons" rel="homepage" href="http://sciencecommons.org/">Science Commons</a> project.</p>
<p>John has a varied background that includes founding a bio-informatics startup, Harvard&#8217;s Berkman Center, the World Wide Web Consortium and the US Congress.</p>
<p>In his current role at Science Commons, he is working to ensure that the outputs of publicly funded science become more available; both for other scientists to use, and for the wider public. The successes of the Open Access movement have led to greater visibility for scientific <em>papers</em>, but the data upon which those papers depend still tends to be difficult to locate.</p>
<p></p>
<p>We discuss initiatives at Science Commons and elsewhere, and consider some of the barriers to a change in approach.</p>
<p><em>Production of this podcast was supported by </em><a class="zem_slink" title="Talis Group" rel="homepage" href="http://www.talis.com/"><em>Talis</em></a><em>, and </em><a href="http://blogs.talis.com/xiphos/2009/06/01/john-wilbanks-talks-about-open-data-and-science-commons/"><em>show notes</em></a><em> are available on their <a href="http://blogs.talis.com/xiphos/">Xiphos</a> </em><em>blog.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://r.zemanta.com/?u=http%3A//www.cbc.ca/technology/story/2009/03/20/tech-090320-mit-open-access.html%3Fref%3Drss&amp;a=3884641&amp;rid=639a2846-e0e9-40d0-996b-3e8a83f8afb8&amp;e=f108c321e662d9768a7856a9c7b27cf6">MIT opens access to its research articles</a> (cbc.ca)</li>
<li class="zemanta-article-ul-li"><a href="http://sciencecommons.org/weblog/archives/2009/05/12/wilbanks-cse-talk/"> Wilbanks&#8217; talk on &#8216;Knowledge Interoperability&#8217; from CSE now online </a> (sciencecommons.org)</li>
<li class="zemanta-article-ul-li"><a href="http://opendotdotdot.blogspot.com/2009/03/open-science-closed-source.html">Open Science, Closed Source</a> (opendotdotdot.blogspot.com)</li>
<li class="zemanta-article-ul-li"><a href="http://joi.ito.com/weblog/2009/02/28/sage---open-acc.html">Sage &#8211; Open Access Data from Merck</a> (joi.ito.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/81258ef5-1065-4599-9869-b2bb405a1372/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=81258ef5-1065-4599-9869-b2bb405a1372" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2009/06/john-wilbanks-talks-about-creative-commons-data-science-and-more/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
			<enclosure url="http://cloudofdata.com/podpress_trac/feed/653/0/twt20090529-JohnWilbanks.mp3" length="45885149" type="audio/mpeg" />
		<itunes:duration>0:47:48</itunes:duration>
		<itunes:subtitle>



Image by mecredis via Flickr



My latest podcast is with John Wilbanks, the VP at Creative Commons with responsibility for their Science Commons project.
John has a varied background that includes founding a bio-informatics startup, Harvard[...]</itunes:subtitle>
		<itunes:summary>



Image by mecredis via Flickr



My latest podcast is with John Wilbanks, the VP at Creative Commons with responsibility for their Science Commons project.
John has a varied background that includes founding a bio-informatics startup, Harvard&#8217;s Berkman Center, the World Wide Web Consortium and the US Congress.
In his current role at Science Commons, he is working to ensure that the outputs of publicly funded science become more available; both for other scientists to use, and for the wider public. The successes of the Open Access movement have led to greater visibility for scientific papers, but the data upon which those papers depend still tends to be difficult to locate.

We discuss initiatives at Science Commons and elsewhere, and consider some of the barriers to a change in approach.
Production of this podcast was supported by Talis, and show notes are available on their Xiphos blog.
Related articles by Zemanta

MIT opens access to its research articles (cbc.ca)
 Wilbanks&#8217; talk on &#8216;Knowledge Interoperability&#8217; from CSE now online  (sciencecommons.org)
Open Science, Closed Source (opendotdotdot.blogspot.com)
Sage &#8211; Open Access Data from Merck (joi.ito.com)

</itunes:summary>
		<itunes:keywords>Podcast</itunes:keywords>
		<itunes:author>Paul Miller</itunes:author>
		<itunes:explicit>no</itunes:explicit>
		<itunes:block>no</itunes:block>
	</item>
		<item>
		<title>Synergies in Big Data ?</title>
		<link>http://cloudofdata.com/2009/01/synergies-in-big-data/</link>
		<comments>http://cloudofdata.com/2009/01/synergies-in-big-data/#comments</comments>
		<pubDate>Fri, 16 Jan 2009 18:35:18 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[AWS]]></category>
		<category><![CDATA[Digging into Data Challenge]]></category>
		<category><![CDATA[Joint Information Systems Committee]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[National Endowment for the Humanities]]></category>
		<category><![CDATA[National Science Foundation]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Open Data Commons]]></category>
		<category><![CDATA[Social Sciences and Humanities Research Council]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://1155249679</guid>
		<description><![CDATA[Image via Wikipedia A short post, sparked by two related items that arrived by Twitter and email almost simultaneously. Via Twitter, TechnologyReview reports that it&#8217;s getting easier to visualise massive data sets without the traditional supercomputer. Via email, the UK&#8217;s Joint Information Systems Committee (JISC) announces its contribution of £200,000 to a joint funding call [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img">
<div>
<dl class="wp-caption alignright" style="margin: 1em; float: right; display: block; width: 212px;">
<dt class="wp-caption-dt"><a href="http://commons.wikipedia.org/wiki/Image:Processor_board_cray-1_hg.jpg"><img title="Processor board of a CRAY YMP vector computer ..." src="http://upload.wikimedia.org/wikipedia/commons/thumb/8/81/Processor_board_cray-1_hg.jpg/202px-Processor_board_cray-1_hg.jpg" alt="Processor board of a CRAY YMP vector computer ..." width="202" height="126" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image via <a href="http://commons.wikipedia.org/wiki/Image:Processor_board_cray-1_hg.jpg">Wikipedia</a></dd>
</dl>
</div>
</div>
<p>A short post, sparked by two related items that arrived by Twitter and email almost simultaneously.</p>
<p>Via Twitter, <em>TechnologyReview</em> <a href="http://www.technologyreview.com/computing/21976/?a=f">reports</a> that it&#8217;s getting easier to visualise massive data sets without the traditional supercomputer.</p>
<p>Via email, the UK&#8217;s <a href="http://www.jisc.ac.uk/">Joint Information Systems Committee</a> (JISC) announces its contribution of £200,000 to a joint funding call with the United States&#8217; <a href="http://www.nsf.gov/">National Science Foundation</a> (NSF) and <a href="http://www.neh.gov/">National Endowment for the Humanities</a> (NEH), and Canada&#8217;s <a href="http://www.sshrc.ca/">Social Sciences and Humanities Research Council</a> (SSHRC). The scope of the call? <a href="http://www.diggingintodata.org/">The challenge of working with Big Data</a>. I&#8217;m very keen to see the bids to that particular call&#8230; and to see the innovation that answering its challenges will (hopefully!) spark.</p>
<p>Add to that <a href="http://cloudofdata.com/2008/12/amazon-public-data-sets-bring-the-cloud-of-data-closer/">my post last month</a> about Amazon&#8217;s Public Data sets, and  the signs are getting ever-stronger that there are real opportunities here.</p>
<p>It&#8217;s getting easier to manage massive data sets. Those data sets are increasingly addressable <em>over the Web</em>. There&#8217;s growing interest in working across and between data sets, and there <a href="http://blogs.talis.com/nodalities/2007/12/talis_and_creative_commons_lau.php">are even licenses to ensure that researchers can do what they need to</a>.</p>
<p>Interesting times, indeed.</p>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/d16b94e3-9023-4183-a814-ad2d205bc307/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=d16b94e3-9023-4183-a814-ad2d205bc307" alt="Reblog this post [with Zemanta]" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2009/01/synergies-in-big-data/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Amazon Public Data Sets bring the Cloud of Data closer</title>
		<link>http://cloudofdata.com/2008/12/amazon-public-data-sets-bring-the-cloud-of-data-closer/</link>
		<comments>http://cloudofdata.com/2008/12/amazon-public-data-sets-bring-the-cloud-of-data-closer/#comments</comments>
		<pubDate>Tue, 16 Dec 2008 15:32:08 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[PaaS]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Amazon EC2]]></category>
		<category><![CDATA[Amazon S3]]></category>
		<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[Amazon.com]]></category>
		<category><![CDATA[Cloud of Data]]></category>
		<category><![CDATA[Google App Engine]]></category>
		<category><![CDATA[Licensing]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Open Data Commons]]></category>
		<category><![CDATA[Public Domain]]></category>
		<category><![CDATA[Talis]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=199</guid>
		<description><![CDATA[Image via CrunchBase, source unknown It began, as so many things do these days, with an idle tweet. On 21 November, Amazon Web Services&#8216; Deepak Singh pointed to a new page describing the company&#8217;s &#8216;Public Data Sets on Amazon Web Services.&#8217; Lidija Davis covered the news for ReadWriteWeb two days later and on 4 December [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img">
<div>
<dl class="wp-caption alignright" style="margin: 1em; float: right; display: block; width: 210px;">
<dt class="wp-caption-dt"><a href="http://www.crunchbase.com/company/amazon"><img title="Image representing Amazon as depicted in Crunc..." src="http://www.crunchbase.com/assets/images/resized/0000/3898/3898v1-max-450x450.jpg" alt="Image representing Amazon as depicted in Crunc..." width="200" height="89" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image via <a href="http://www.crunchbase.com">CrunchBase</a>, source unknown</dd>
</dl>
</div>
</div>
<p>It began, as so many things do these days, with an idle <a href="http://en.wikipedia.org/wiki/Twitter">tweet</a>.</p>
<p>On 21 November, <a href="http://aws.amazon.com/">Amazon Web Services</a>&#8216; <a href="http://mndoci.com/blog/about/" class="broken_link">Deepak Singh</a> <a href="http://twitter.com/mndoci/status/1016646762">pointed</a> to a new page describing the company&#8217;s &#8216;<a href="http://aws.amazon.com/publicdatasets/">Public Data Sets on Amazon Web Services</a>.&#8217;</p>
<p><a href="http://www.readwriteweb.com/about_Lidija.php">Lidija Davis</a> <a href="http://www.readwriteweb.com/archives/amazon_web_services_seeks_publ.php">covered the news</a> for ReadWriteWeb two days later and on 4 December Amazon issued its <a href="http://phx.corporate-ir.net/phoenix.zhtml?c=176060&amp;p=irol-newsArticle&amp;ID=1232302&amp;highlight=">formal press release</a>, prompting a flurry of coverage from Mike Arrington at <a href="http://www.techcrunch.com/2008/12/04/amazon-launches-public-data-sets-to-ease-research/">TechCrunch</a>, Larry Dignan at <a href="http://blogs.zdnet.com/BTL/?p=11081">ZDNet</a>, Krishnan Subramanian at <a href="http://www.cloudave.com/link/amazon-tries-to-lure-scientific-community-into-the-clouds">CloudAve</a>, and many others.</p>
<p>Alongside broader discussion of this move, members of the <a href="http://www.w3.org/">W3C</a>-backed <a href="http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData">Linking Open Data project</a> delved into the synergies via their <a href="http://lists.w3.org/Archives/Public/public-lod/">public mailing list</a> and Linking Open Data enthusiast <a class="zem_slink" title="Kingsley Idehen" rel="crunchbase" href="http://www.crunchbase.com/person/kingsley-idehen">Kingsley Idehen</a>&#8216;s <a href="http://www.openlinksw.com/">company</a> issued a <a href="http://www.earthtimes.org/articles/show/openlink-bolsters-semantic-web-vision,648977.shtml">Press Release</a> suggesting ways in which their products might fit within this shifting data landscape.</p>
<p>So what have Amazon done, what does it mean, and how does it &#8216;bring the Cloud of Data closer&#8217; as the title of this post suggests?</p>
<p>Amazon&#8217;s <a href="http://aws.amazon.com/publicdatasets/">web page</a> describes their offer quite succinctly;</p>
<blockquote><p>&#8220;Public Data Sets on <span class="caps">AWS</span> provides a centralized repository of public data sets that can be seamlessly integrated into <span class="caps">AWS</span> cloud-based applications.  <span class="caps">AWS</span> is hosting the public data sets at no charge for the community, and like all <span class="caps">AWS</span> services, users pay only for the compute and storage they use for their own applications.&#8221;</p></blockquote>
<p>As Krishnan noted in his post,</p>
<blockquote><p>&#8220;By doing this, Amazon is helping research community save money on storage and  bandwidth costs associated with assessing these public data from any EC2  instances they use in their research. When the data in question is in hundreds  of terabytes or petabytes, we are talking about huge cost savings here.&#8221;</p></blockquote>
<p>In addition, OpenLink&#8217;s <a href="http://www.earthtimes.org/articles/show/openlink-bolsters-semantic-web-vision,648977.shtml">press release</a> gives an indication of the efficient manner in which services and data <em>already hosted by Amazon</em> can be plugged together as needed;</p>
<blockquote><p>&#8220;As a vital contribution to the momentum behind the burgeoning Web of Linked Data, [OpenLink's product] Virtuoso provides a simple deployment mechanism for highly integrated knowledge bases emerging from the Linking Open Data community. For example, it is now possible to deploy personal or service-specific renditions of <a href="http://dbpedia.org/About">DBpedia</a> within 1.5 hours, compared to an 8 &#8211; 22 hour effort when performed from scratch.&#8221;<br />
(my links)</p></blockquote>
<p>By offering free hosting for public data, then, Amazon are doing the wider community a huge service. Much of the data there today is reasonably readily available from other sources, so the biggest immediate benefits are those of speed and cost outlined above by Krishnan and OpenLink. For existing or potential users of Amazon&#8217;s Web Services to power their applications, this is yet another reason to consider Amazon.</p>
<p><a class="zem_slink" title="Harvard Medical School" rel="homepage" href="http://hms.harvard.edu/">Harvard Medical School</a>&#8216;s Dr. Peter Tonellato was quoted in Amazon&#8217;s release, and he is unlikely to be alone;</p>
<blockquote><p>&#8220;<span class="ccbnTxt">Public Data Sets on AWS will enable me and many of my colleagues to       collaborate with each other by sharing our commonly used data sets,       research environments and tools. We can set up a controlled environment in       minutes, run our computational analysis for a couple of hours, and shut       down the environment. Our results are completely repeatable. I only pay       for the compute time I use, and more importantly I can spend more time       focusing on research, not downloading and setting up computational       infrastructure.</span>&#8220;</p></blockquote>
<p>The bigger long-term contribution of this Amazon initiative may actually lie with data that are difficult or impossible to find online today. In a previous existence at the <a href="http://ads.ahds.ac.uk/">Archaeology Data Service</a> (ADS), for example, my colleagues and I were always being contacted by individuals and organisations with data that they <em>wanted</em> to see online; individuals and organisations that lacked the skills, resources or mandate to mount and maintain the data themselves. How many of those organisations will <a href="http://aws.amazon.com/publicdatasets/#3">beat a path to Amazon&#8217;s door</a> now&#8230; and what sort of resource might we see emerge as a result?</p>
<p>However&#8230;</p>
<p>Krishnan concludes his post with a reality-check, commenting;</p>
<blockquote><p>&#8220;this data stored on AWS servers are useful only if the researchers use Amazon  EC2 for their computing needs&#8230; even if  they could tap into it from external platforms, it doesn’t mean much if these public datasets are  accessible using some kind of API from their original source itself.&#8221;</p></blockquote>
<p>In other words, much (most? all?) of the advantage Amazon is offering evaporates if developers then have to pull the hosted data off Amazon&#8217;s servers and into their own applications running locally or via a competing Cloud provider such as Google.</p>
<p>Although the way in which it is recognised and monetised is finally shifting, data is still valuable, and Amazon (and others) clearly recognise the benefits of enticing users to entrust data to <em>their</em> offering, whilst (almost) imperceptibly making it that little bit more painful to use the data somewhere else.</p>
<p>Kingsley Idehen is <a href="http://www.earthtimes.org/articles/show/openlink-bolsters-semantic-web-vision,648977.shtml">quoted</a> as saying,</p>
<blockquote><p>&#8220;The Web&#8217;s potential as a globally distributed information space that plugs into disparate databases has never been in question. What has remained unclear is how a federated Web of linked databases would be delivered in a manner consistent with the Web&#8217;s core architecture, without compromising its simplicity.&#8221;</p></blockquote>
<p>It is in moving us toward this open vision that Amazon&#8217;s offering (although undoubtedly an important step along the way) is ultimately lacking. For that, we may well require the open and linked approach of Semantic Web offerings from companies such as <a href="http://www.talis.com/platform/">Talis</a> and Kingsley&#8217;s OpenLink. These recognise the futility of expecting all data to migrate to a single service provider, whilst still ensuring that those on the &#8216;inside&#8217; may gain the benefits of proximity on the network, pre-computation of certain indices, etc. Amazon and its services clearly have a place within that emerging ecosystem, but it is a place that they will need to share with others.</p>
<p>The worthwhile philanthropic aspects of Amazon&#8217;s announcement apart, the company is certainly doing its part to evangelise the benefits of moving data to the Cloud, and this is to be wholeheartedly welcomed.</p>
<p>CIOs are recognising the benefits of Cloud-based computation, and their resistance to the loss of control implied by individual cost centres&#8217; embracing of SaaS solutions such as Salesforce is diminishing. The proposition of accessing <em>data</em> in the Cloud, at will, is even more profound, and the benefits to be gained require careful and compelling explanation in the face of inevitable fears regarding issues such as data integrity.</p>
<p>Showing everyone the benefits to be gained in sharing disparate <em>public</em> data sets is one more step along the way to widespread acceptance of the value in easing restrictions over access to more sensitive resources.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.cloudave.com/link/the-evolution-of-an-all-encompassing-world-of-clouds">The evolution of an all encompassing world of clouds</a></li>
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/archives/amazon_web_services_bigger_than_amazon.php">Amazon Web Services: Bigger Than Amazon</a></li>
<li class="zemanta-article-ul-li"><a href="http://www.xconomy.com/seattle/2008/12/04/public-data-sets-go-on-amazons-cloud/">Public Data Goes on Amazon&#8217;s Cloud</a></li>
<li class="zemanta-article-ul-li"><a href="http://blogs.ft.com/techblog/2008/12/the-amazon-cloud-no-longer-a-mid-altantic-kludge/">The Amazon Cloud: no longer a mid-Altantic kludge</a></li>
<li class="zemanta-article-ul-li"><a href="http://www.cloudave.com/link/amazon-tries-to-lure-scientific-community-into-the-clouds">Amazon Tries to Lure Scientific Community into the Clouds</a></li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/8e52f4ea-d3e4-4217-8ac8-03193483b71e/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=8e52f4ea-d3e4-4217-8ac8-03193483b71e" alt="Reblog this post [with Zemanta]" /></a></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2008/12/amazon-public-data-sets-bring-the-cloud-of-data-closer/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
	</channel>
</rss>

