<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
		xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	xmlns:media="http://search.yahoo.com/mrss/"
>

<channel>
	<title>Paul Miller - The Cloud of Data &#187; Panton Principles</title>
	<atom:link href="http://cloudofdata.com/tag/panton-principles/feed/" rel="self" type="application/rss+xml" />
	<link>http://cloudofdata.com</link>
	<description>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</description>
	<lastBuildDate>Thu, 17 May 2012 15:04:40 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<copyright>Licensed under the Creative Commons Attribution License, version 3.0 http://creativecommons.org/licenses/by/3.0/</copyright>
	<managingEditor>paul.miller@cloudofdata.com (Paul Miller)</managingEditor>
	<webMaster>paul.miller@cloudofdata.com (Paul Miller)</webMaster>
	<ttl>1440</ttl>
	<image>
		<url>http://cloudofdata.com/logo144x144.jpg</url>
		<title>Paul Miller - The Cloud of Data</title>
		<link>http://cloudofdata.com</link>
		<width>144</width>
		<height>144</height>
	</image>
	<itunes:subtitle>conversations with the executives shaping Cloud Computing and the Semantic Web.</itunes:subtitle>
	<itunes:summary>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</itunes:summary>
	<itunes:keywords>Cloud Computing, Semantic Web, Linked Data, Open Data, SaaS, PaaS</itunes:keywords>
	<itunes:category text="Technology" />
	<itunes:category text="Business" />
	<itunes:author>Paul Miller</itunes:author>
	<itunes:owner>
		<itunes:name>Paul Miller</itunes:name>
		<itunes:email>paul.miller@cloudofdata.com</itunes:email>
	</itunes:owner>
	<itunes:block>no</itunes:block>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://cloudofdata.com/logo300x300.jpg" />
		<item>
		<title>Repositories in the Cloud? Why on earth not?!</title>
		<link>http://cloudofdata.com/2010/02/repositories-in-the-cloud-why-on-earth-not/</link>
		<comments>http://cloudofdata.com/2010/02/repositories-in-the-cloud-why-on-earth-not/#comments</comments>
		<pubDate>Sun, 21 Feb 2010 18:05:42 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Academic publishing]]></category>
		<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[Andy Powell]]></category>
		<category><![CDATA[Archives]]></category>
		<category><![CDATA[AWS]]></category>
		<category><![CDATA[Colleges and Universities]]></category>
		<category><![CDATA[Eduserv]]></category>
		<category><![CDATA[Higher Education]]></category>
		<category><![CDATA[infochimps]]></category>
		<category><![CDATA[Institutional repository]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[Open access]]></category>
		<category><![CDATA[Panton Principles]]></category>
		<category><![CDATA[repcloud]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Software as a service]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=932</guid>
		<description><![CDATA[To be honest, I&#8217;ve never fully understood Higher Education&#8217;s penchant for building &#8216;institutional repositories.&#8217; These frequently under-populated aggregations of academic papers produced by &#8216;research active&#8217; employees of a particular university appear aligned almost exclusively to vaguely expressed institutional imperatives, and seem largely unrelated to either the selfish aspirations of the contributing authors or the tangible [...]]]></description>
			<content:encoded><![CDATA[<p>To be honest, I&#8217;ve never fully understood Higher Education&#8217;s penchant for building &#8216;<a class="zem_slink freebase/en/institutional_repository" title="Institutional repository" rel="wikipedia" href="http://en.wikipedia.org/wiki/Institutional_repository">institutional repositories</a>.&#8217; These frequently under-populated aggregations of academic papers produced by &#8216;research active&#8217; employees of a particular university appear aligned almost exclusively to vaguely expressed institutional imperatives, and seem largely unrelated to either the selfish aspirations of the contributing authors or the tangible relationships they painstakingly construct with others across their chosen discipline. The &#8216;repository&#8217; all too often appears a bureaucratic solution to a problem that the supposed beneficiaries do not recognise; a technological aberration that sits outside the conversational flow of the Web to which it is only tenuously attached.</p>
<p>Furthermore, &#8216;<a class="zem_slink freebase/en/open_access" title="Open access (publishing)" rel="wikipedia" href="http://en.wikipedia.org/wiki/Open_access_%28publishing%29">Open Access</a>&#8216; and &#8216;Repository&#8217; typically go hand in hand. If you support Open Access you need a repository, and if you question the role of repositories you&#8217;re in the pocket of evil publishers who want to lock up everything ever written and lease reading rights back to the employers of those who wrote the stuff in the first place.</p>
<p>Nonsense.</p>
<p>Open Access is an important component of today&#8217;s scholarly ecosystem. It&#8217;s not the only answer, and it&#8217;s not perfect, but it <em>does</em> have a significant part to play. Institutions have a role in preserving, disseminating and exploiting the work of their employees, but these are very different tasks that may benefit from different solutions. In too many cases, the repository is by default seen as a preservation mechanism <em>and</em> a dissemination vehicle, and as such it may fail to cost-effectively achieve either aim.</p>
<p>There are some large, well known, and research-intensive institutions where it might be possible to make a compelling argument for projecting a strong institutional image around a single &#8216;home&#8217; for all of that research output. Never mind, for a moment, that so much research today is the result of inter-institutional collaboration, or that the eminent researcher might wish to take &#8216;their&#8217; research publications with them as they move from Oxford to Harvard to York during their glittering career.</p>
<p>Alongside those institutions sit a plethora of others where research of equal quality is also being conducted; there just, maybe, isn&#8217;t quite as much of it. Bombarded by &#8216;advice&#8217; and funding, and desperate to keep up with the <a class="zem_slink freebase/en/russell_group" title="Russell Group" rel="wikipedia" href="http://en.wikipedia.org/wiki/Russell_Group">Russell Group</a>, ever-more institutions blindly join the repository cult and wonder why their new toys do not fill to overflowing with the jewels of scholarly erudition.</p>
<p>As research becomes increasingly data-rich, the whole cycle looks set to repeat. The recently released <a href="http://pantonprinciples.org/">Panton Principles</a> for <a class="zem_slink freebase/en/open_data" title="Open Data" rel="wikipedia" href="http://en.wikipedia.org/wiki/Open_Data">Open Data</a> in Science are to be welcomed, but I&#8217;ll bet the institutional response will all too often be the commissioning of a &#8216;data repository&#8217; to sit alongside the &#8216;publication repository&#8217; they already don&#8217;t use.</p>
<p>All of which is a rather long-winded way of introducing the fact that Eduserv&#8217;s <a class="zem_slink" title="Andy Powell" rel="twitter" href="http://twitter.com/andypowe11">Andy Powell</a> has asked me to facilitate a breakout afternoon on &#8216;Policy Issues&#8217; at the <a href="http://www.eduserv.org.uk/events/repcloud" class="broken_link">Repositories in the Cloud</a> event <a href="http://www.eduserv.org.uk/research">Eduserv</a> and <a class="zem_slink freebase/en/joint_information_systems_committee" title="Joint Information Systems Committee" rel="wikipedia" href="http://en.wikipedia.org/wiki/Joint_Information_Systems_Committee">JISC</a> are holding in London on Tuesday.</p>
<blockquote><p>&#8220;This free event, organised jointly by Eduserv and the JISC, will bring together software developers, repository managers, service providers, funding and advisory bodies to discuss the potential policy and technical issues associated with <strong>cloud computing</strong> and the delivery of <strong>repository services</strong> in UK HEIs.&#8221;</p></blockquote>
<p>In a post on 11 February, <a href="http://efoundations.typepad.com/efoundations/2010/02/repositories-and-the-cloud-tell-us-your-views.html">Andy invited participants to share some of their views</a> ahead of the meeting, and on 19 February <a href="http://efoundations.typepad.com/efoundations/2010/02/in-the-clouds.html">he wrote about some of his own thoughts</a>.</p>
<p>Like Andy, I struggled somewhat to nail down a coherent set of thoughts about the issue of pushing today&#8217;s repositories into the Cloud. On one level, I wonder whether the vast majority of institutions with small (and relatively low traffic) repositories would see much of a tangible efficiency gain or cost saving by moving off an in-house computer to rent an equivalent <a class="zem_slink freebase/en/virtual_machine" title="Virtual machine" rel="wikipedia" href="http://en.wikipedia.org/wiki/Virtual_machine">Virtual Machine</a> from Amazon, Rackspace, or any of their competitors. If we&#8217;re talking about IT systems within a typical university, there are others (email, calendaring, pools of compute resource for research jobs, etc) that appear more immediately compelling for the shift Cloud-ward. Which is not to say that there isn&#8217;t a clear opportunity for someone trusted to step into this space and offer a <a class="zem_slink freebase/en/software_as_a_service" title="Software as a service" rel="wikipedia" href="http://en.wikipedia.org/wiki/Software_as_a_service">SaaS</a> repository to which institutions might affordably subscribe. Eduserv? Mimas? Edina? The British Library? The National Archives? Duraspace? Any could, and if we&#8217;re not ready for something more then at least one probably should.</p>
<p>However, a bolder reconsideration of what repositories <em>are</em> and what they&#8217;re <em>for</em> might very well lead to something interesting, sustainable, and perfectly suited for benefitting from Cloud Computing&#8217;s strengths.</p>
<p>Why does a paper have to be &#8216;deposited&#8217; in a repository? Why does a single paper with three authors from three institutions have to be deposited in three separate institutional repositories? Why does that same paper have to be deposited – separately – in the subject repository favoured by scholars in the relevant discipline? Why does the institution&#8217;s very reasonable desire to protect, preserve, promote and disseminate its excellence mean that it has to run systems in perpetuity that preserve and permit access? Why do we address the fundamentally different (perhaps even contradictory) problems of access and preservation in the same system? Why can&#8217;t the individual researcher easily assemble a view across their publication history, regardless of the institution within which they happened to reside as they wrote each paper? Why don&#8217;t the assemblages of papers reflect personal, professional and disciplinary relationships, alongside (or instead of) the contractual accident of employee-employer relationships? Why isn&#8217;t the wealth of metadata implicit to any publication (authors, subjects, dates, citations, and more) available and actionable, both inside the repository and far beyond it across the Web? Why isn&#8217;t there a tight and active association between the paper and the data from which its findings were derived (something for which <em><a href="http://intarch.ac.uk/">Internet Archaeology</a></em> was demonstrating utility a very long time ago)?</p>
<p>Scholarly papers principally comprise text, augmented by the occasional static image. They&#8217;re not big, and they don&#8217;t tend to change very fast. In many ways, they represent a fairly easy problem set with which to work. As more and more data becomes key to research in a growing number of subject areas, the problems are set to become far larger and far more difficult. For individual universities to even consider replicating the process by which they all ended up with their repositories of text surely seems madness in this data-rich environment. Even with levels of uptake as low as those seen in too many text repositories, the issues of data management, curation, access and dissemination are too great to be sensibly solved in the institutional machine room. Services like <a href="http://infochimps.org/">InfoChimps</a> and Amazon&#8217;s own <a href="http://aws.amazon.com/publicdatasets/">Public Data Sets</a> offering show some of the ways that we might begin to work with data at scale. Might we, for example, come to recognise as Amazon has that it&#8217;s actually cheaper and quicker to entrust large data sets to FedEx rather than transmit them over the Internet?</p>
<p>&#8216;The answer&#8217; might be some central service for the community, funded by JISC like the Arts &amp; Humanities Data Service (AHDS) of old. Or it might be something different, something nimbler, more responsive, more flexible to individual, institutional, and disciplinary requirements, and something more scalable to new disciplines; institutional support for and use of <em>existing</em> Cloud infrastructures extending far beyond UK Higher Education, aligned with a clear understanding of the separation between preservation and access.</p>
<p>I certainly don&#8217;t have all the answers, but I do believe that simply asking whether or not we should move existing repositories to the Cloud is to miss the point. Rather, we should ask what role the Cloud might play in addressing the business requirements to which the institutional repository was our initial – faltering – response. The answer might very well be &#8216;None,&#8217; but I doubt it.</p>
<p>I look forward to Tuesday&#8217;s discussion. I&#8217;m not going there to push my personal view that individual institutions frequently shouldn&#8217;t be building, running or populating their own repositories at all. I&#8217;m going there to facilitate the discussion those in the room want to have, and to learn from their experiences and their perspectives.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://scholarlykitchen.sspnet.org/2010/01/07/citation-advantage-for-mandated-open-access-articles/">Does a Citation Advantage Exist for Mandated Open Access Articles?</a> (scholarlykitchen.sspnet.org)</li>
<li class="zemanta-article-ul-li"><a href="http://hangingtogether.org/?p=770">Scholarly content and the cliff edge: the place of subject &#8216;repositories&#8217;</a> (hangingtogether.org)</li>
<li class="zemanta-article-ul-li"><a href="http://www.downes.ca/cgi-bin/page.cgi?post=51742">Scholarly Communications must be Scalable</a> (downes.ca)</li>
<li class="zemanta-article-ul-li"><a href="http://opendotdotdot.blogspot.com/2010/02/beyond-open-access-open-publishing.html">Beyond Open Access: Open Publishing</a> (opendotdotdot.blogspot.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.scienceblog.com/cms/57-college-presidents-declare-support-public-access-publicly-funded-research-us-25470.html" class="broken_link">57 college presidents declare support for public access to publicly funded research in the US</a> (scienceblog.com)</li>
<li class="zemanta-article-ul-li"><a href="http://r.zemanta.com/?u=http%3A//www.guardian.co.uk/education/2010/feb/11/academics-in-aspic-says-mandelson&amp;a=12898526&amp;rid=f65ff066-66fd-42d9-bc76-113bd6066317&amp;e=5236f562a8baffa164e8623f52cd7d44">Mandelson says academics are &#8216;set in aspic&#8217;</a> (guardian.co.uk)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/f65ff066-66fd-42d9-bc76-113bd6066317/"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=f65ff066-66fd-42d9-bc76-113bd6066317" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-info pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<div class="al2fb_like_button"><div id="fb-root"></div><script type="text/javascript">
(function(d, s, id) {
  var js, fjs = d.getElementsByTagName(s)[0];
  if (d.getElementById(id)) return;
  js = d.createElement(s); js.id = id;
  js.src = "//connect.facebook.net/en_US/all.js#xfbml=1&appId=133647763430045";
  fjs.parentNode.insertBefore(js, fjs);
}(document, "script", "facebook-jssdk"));
</script>
<fb:like href="http://cloudofdata.com/2010/02/repositories-in-the-cloud-why-on-earth-not/" layout="standard" show_faces="true" width="450" action="like" font="arial" colorscheme="light" ref="AL2FB"></fb:like></div>]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2010/02/repositories-in-the-cloud-why-on-earth-not/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

