<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
		xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	xmlns:media="http://search.yahoo.com/mrss/"
>

<channel>
	<title>Paul Miller - The Cloud of Data &#187; JISC</title>
	<atom:link href="http://cloudofdata.com/tag/jisc/feed/" rel="self" type="application/rss+xml" />
	<link>http://cloudofdata.com</link>
	<description>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</description>
	<lastBuildDate>Fri, 10 Feb 2012 10:46:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<copyright>Licensed under the Creative Commons Attribution License, version 3.0 http://creativecommons.org/licenses/by/3.0/</copyright>
	<managingEditor>paul.miller@cloudofdata.com (Paul Miller)</managingEditor>
	<webMaster>paul.miller@cloudofdata.com (Paul Miller)</webMaster>
	<ttl>1440</ttl>
	<image>
		<url>http://cloudofdata.com/logo144x144.jpg</url>
		<title>Paul Miller - The Cloud of Data</title>
		<link>http://cloudofdata.com</link>
		<width>144</width>
		<height>144</height>
	</image>
	<itunes:subtitle>conversations with the executives shaping Cloud Computing and the Semantic Web.</itunes:subtitle>
	<itunes:summary>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</itunes:summary>
	<itunes:keywords>Cloud Computing, Semantic Web, Linked Data, Open Data, SaaS, PaaS</itunes:keywords>
	<itunes:category text="Technology" />
	<itunes:category text="Business" />
	<itunes:author>Paul Miller</itunes:author>
	<itunes:owner>
		<itunes:name>Paul Miller</itunes:name>
		<itunes:email>paul.miller@cloudofdata.com</itunes:email>
	</itunes:owner>
	<itunes:block>no</itunes:block>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://cloudofdata.com/logo300x300.jpg" />
		<item>
		<title>In a world of niche Clouds, how do you define a useful niche?</title>
		<link>http://cloudofdata.com/2010/12/in-a-world-of-niche-clouds-how-do-you-define-a-useful-niche/</link>
		<comments>http://cloudofdata.com/2010/12/in-a-world-of-niche-clouds-how-do-you-define-a-useful-niche/#comments</comments>
		<pubDate>Tue, 14 Dec 2010 13:08:20 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Enterprise Computing]]></category>
		<category><![CDATA[IaaS]]></category>
		<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[Andy Powell]]></category>
		<category><![CDATA[Data center]]></category>
		<category><![CDATA[Eduserv]]></category>
		<category><![CDATA[FleSSR]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[Joint Information Systems Committee]]></category>
		<category><![CDATA[Rackspace]]></category>
		<category><![CDATA[VMware]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1393</guid>
		<description><![CDATA[There are a couple of interesting posts on the blog of the UK&#8217;s FLESSR project, detailing their efforts to work out how feasible it might be to offer a new Cloud service to universities. More on that in a moment. I don&#8217;t think I&#8217;ve ever really been convinced by the argument that everything will end [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://geekandpoke.typepad.com/geekandpoke/2008/05/simply-explaine.html" target="_blank"><img class="alignright size-medium wp-image-1396" style="margin: 0px; border: 0px initial initial;" title="Simply Explained - Cloud Computing" src="http://cloudofdata.com/wp-content/uploads/2010/12/cloud-explained-300x214.jpg" alt="" width="300" height="214" /></a>There are a couple of interesting posts on the blog of the UK&#8217;s FLESSR project, detailing their efforts to work out how feasible it might be to offer a new Cloud service to universities. More on that in a moment.</p>
<p>I don&#8217;t think I&#8217;ve ever really been convinced by the argument that <em>everything</em> will end up in the data centres of <a class="zem_slink" title="Amazon EC2" rel="homepage" href="http://aws.amazon.com/ec2/">Amazon</a>.</p>
<p>The straightforward provision of commodity Cloud Computing is an important &#8211; and growing &#8211; area, and one that will continue to expand as interfaces become simpler, FUD is challenged, and prices maintain their relentless march towards the bottom. <em>Everyone</em> has <em>something</em> they could usefully, sensibly, and cost-effectively run in a commodity Cloud such as those offered by <a href="http://aws.amazon.com/">Amazon</a>, <a class="zem_slink" title="Rackspace" rel="homepage" href="http://www.rackspace.com">Rackspace</a>, <a href="http://www.flexiant.com/">Flexiant</a>, and others. In <em>this</em> space, basic stability, security and reliability combine with a compelling &#8211; and diminishing &#8211; pricing proposition to create commodity services targeted squarely to lowest common denominator functionality. Here, market forces may (inevitably?) lead to an eventual reduction in the number of providers. Cost, although not the only consideration, is both important and compelling. Although markets like competition, there may even be a single winner here, one day.</p>
<p>Layered all around the basic, routine, grunt-work computation that these commodity public clouds handle so well, many organisations find themselves having to cope with a wide range of <em>other</em> use cases and data sets. Some require specialist hardware (like the <a class="zem_slink" title="Graphics processing unit" rel="wikipedia" href="http://en.wikipedia.org/wiki/Graphics_processing_unit">GPUs</a> that Amazon has <a href="http://aws.typepad.com/aws/2010/11/new-ec2-instance-type-the-cluster-gpu-instance.html">recently begun selling access to</a>). Some demand particular regulatory and legislative hoops to be jumped through. Some have quirky requirements around latency in data transfer or speed of in-CPU processing. Some have <em>lots</em> of data, and issues with regard to getting the stuff from one location to another with a sensible balance between transfer cost and time.</p>
<p>All of these are certainly capable of being addressed in the Cloud, but the economics and the business rationale begin to shift. For the data owner, cost may no longer be quite so significant a factor. Reliability may matter more, or speed, or the audit trail. For the Cloud provider, these requirements no longer look like the lowest common denominator. It&#8217;s not cost-effective to provide these capabilities to <em>everyone</em> and still keep the price low. It becomes more sensible to segment, to divide, and to create bespoke offerings of various kinds. Some of these services require such specific things in terms of network topology, physical building layout, and staff expertise that it may even become counter-productive to have these services in the same building as the commodity Cloud. Here, there&#8217;s plenty of room for new entrants, plenty of scope for competition, and plenty of opportunity to differentiate in terms of price, location, support, and a host of other factors. This segment of the Cloud is only just getting started.</p>
<p>In these contexts, we see compelling arguments made for on-premise private clouds, off-premise private clouds, hybrid clouds, community clouds and the rest. Some of the arguments made in favour of private and hybrid certainly are part of the FUD we see in this space, but beneath the noise, the security scares, and the vested interests of SysAdmins and sellers of data centre components, there lies a grain of truth. Not everything is most sensibly run on a cheap VM, rented from Amazon (or Rackspace, or whoever) with your credit card, and physically located half way round the planet.</p>
<p>Unfortunately, it can be difficult to make sensible decisions about which type of cloud works best in each situation, and large swathes of the market are doing everything in their power to add to the confusion.</p>
<p>Having accepted that the basic offering from a public cloud provider is not the solution for my particular requirements, where do I turn next?</p>
<p>Do I listen to the (convincing) pitch from a vendor of &#8216;community cloud&#8217; solutions for my domain? If I&#8217;m in Healthcare, they come with HIPAA and European Data Protection Directive, and all sorts of other accreditations. For dealing with sensitive patient data, this may be just what I need&#8230; but does the wily salesman <em>also</em> persuade me to run staff email and the hospital volleyball club website on this over-specified (and expensive) infrastructure?</p>
<p>Do I listen to the (convincing) pitch from a vendor of virtualisation software? If I&#8217;ve got a reasonably sized data centre with some life left in it, I may see the value of virtualising all of that expensive hardware, and running current applications in house more efficiently. But instead of gradually reducing my in-house costs, do I continue to add more machines as current ones reach end of life, or as new requirements come along?</p>
<p>Do I listen to the (convincing) pitch from my co-location facility, which happily sells me a &#8216;private cloud&#8217; that may fail to deliver some of the economies of scale so central to the main Cloud proposition?</p>
<p>Do I listen to the horror stories, stick my head in the sand, and simply keep ordering servers until every single one of my competitors undercuts my costs and I go out of business?</p>
<p>These, and more, are certainly possible. But let&#8217;s return to that UK project I mentioned right at the start.</p>
<p>Flexible Services for the Support of Research (<a href="http://flessr.blogspot.com/">FleSSR</a>) is</p>
<blockquote><p>&#8220;a new cloud pilot project looking at utilising hybrid private-public IaaS cloud infrastructure to provide computational and data services to the academic research community. The project is a collaboration between the Oxford e-Research Center, IT Service @ University or Reading, e-Science Centre @ STFC, Eduserv, EoverI, Eucalyptus INC and Canonical Ltd.&#8221;</p></blockquote>
<p>The ten month project is funded by the Joint Information Systems Committee (<a href="http://www.jisc.ac.uk">JISC</a>), an organisation that supports the innovative use of IT across UK universities.</p>
<p>Now, to a degree, the project&#8217;s mindset must be influenced by its partners. IT staff at Reading and STFC are incumbents with turf to protect (or new vistas to discover, map, and claim). Eduserv has a new data centre that they&#8217;d like to fill with willing clients. It would be easy to be cynical, but knowing some of the people involved, I see no real reason to be. It is perfectly reasonable to suggest that a &#8216;community&#8217; the size of UK Higher Education would realise value in replicating less (not nothing) at every university campus across the country, and bringing much of that together in some sort of Cloud. That Cloud might use public infrastructure, or it might be served up from an organisation such as Eduserv, which is known to the community, aware of the community&#8217;s requirements, quirks and foibles, and (importantly) not-for profit (and therefore cheaper?).</p>
<p>Personally, I&#8217;d always rather presumed that an organisation like Eduserv (or JISC itself) would act on behalf of the community to procure a competitive price on access to the resources of Amazon, Rackspace, or one of the others. I&#8217;m not convinced that <em>most</em> UK research computation needs any sort of special treatment that couldn&#8217;t be met from Amazon&#8217;s Dublin data centre&#8230; unless the community itself can somehow beat &#8211; and continue to beat &#8211; Amazon on price. Somewhat surprisingly, that&#8217;s exactly what some calculations in <a href="http://flessr.blogspot.com/2010/12/costs-of-storage-in-cloud.html">two</a> <a href="http://flessr.blogspot.com/2010/12/costs-of-building-storage-for-cloud.html">posts</a> by Eduserv&#8217;s Andy Powell suggest could happen. By including network costs and other charges over and above the basic storage cost, Andy finds Amazon, Rackspace and Dropbox to be more expensive than anticipated, and posits that Eduserv (connected to every UK university free of charge via JISC&#8217;s high speed <a href="http://www.ja.net/">JANET</a> service, and constrained in the ways it can generate profit from services sold to universities by its charitable status) might actually work out cheaper.</p>
<p>There&#8217;s a lot of work to do in terms of fleshing out the assumptions behind some of Andy&#8217;s figures, but the whole industry certainly benefits when people conduct exercises like these out in the open, for all to see. If Andy has made mistakes, the vendors should be quick to jump in and correct them. If his assumptions miss the mark, public debate can redress the balance.</p>
<p>The Cloud is not all about price. But more transparency around the true cost of computing in the Cloud &#8211; and in your data centre &#8211; means that we can all make more informed decisions.</p>
<p>Thanks for sharing, Andy &#8211; and hopefully readers will be willing and able to look over your calculations and share their own views.</p>
<p><strong>Note</strong>: <em>this post was conceived and written in the United Kingdom. By reading this post you agree to comply with UK usage, and will henceforth pronounce the word &#8216;niche&#8217; from the title as &#8216;neesh,&#8217; not &#8216;nitch.&#8217; Or maybe not.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.zdnet.com/blog/btl/rackspace-launches-managed-cloud-services/42436">Rackspace launches managed cloud services</a> (zdnet.com)</li>
<li class="zemanta-article-ul-li"><a href="http://venturebeat.com/2010/12/06/cloud-computing-public-private-hybrid-demistified/">Are hybrid clouds the path to cloud-computing nirvana?</a> (venturebeat.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.rackspacecloud.com/blog/2010/12/14/test/" class="broken_link">We&#8217;ll Take Care of Your Cloud, While You Manage Your Business</a> (rackspacecloud.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.cloudave.com/8675/trust-is-key-for-cloud-success-and-what-can-we-do-about-it/">Trust Is Key For Cloud Success And What Can We Do About It?</a> (cloudave.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=f19f2112-f391-4e6b-b351-c623cae0cabf" alt="" /><span class="zem-script pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2010/12/in-a-world-of-niche-clouds-how-do-you-define-a-useful-niche/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Do we need a Registry&#8230; to register technologists&#8217; numerous contradictory uses of the term?</title>
		<link>http://cloudofdata.com/2010/10/do-we-need-a-registry-to-register-technologists-numerous-contradictory-uses-of-the-term/</link>
		<comments>http://cloudofdata.com/2010/10/do-we-need-a-registry-to-register-technologists-numerous-contradictory-uses-of-the-term/#comments</comments>
		<pubDate>Fri, 15 Oct 2010 11:08:17 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Enterprise Computing]]></category>
		<category><![CDATA[ckan]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[Joint Information Systems Committee]]></category>
		<category><![CDATA[registry]]></category>
		<category><![CDATA[uddi]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1189</guid>
		<description><![CDATA[The act of registration seems ingrained in all of us, especially where dealing with &#8216;authority.&#8217; That seemingly random agglomeration of letters and numbers on the back and front of our cars? A registration number. We visit the Registry Office in order to make births, marriages and deaths official. Indeed, in the dark days before it [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://carscoop.blogspot.com/2009/06/name-plate-bond-007-license-plate-up.html"><img class="alignright size-medium wp-image-1191" title="A car registration" src="http://cloudofdata.com/wp-content/uploads/2010/10/name-plate-bond-007-license-plate-up-300x173.jpg" alt="" width="300" height="173" /></a>The act of <em>registration</em> seems ingrained in all of us, especially where dealing with &#8216;authority.&#8217; That seemingly random agglomeration of letters and numbers on the back and front of our cars? A <em>registration number</em>. We visit the <em>Registry Office</em> in order to make births, marriages and deaths official. Indeed, in the dark days before it was <a href="http://your.asda.com/asda-wedding-couple" class="broken_link">permissible to marry</a> between the frozen peas and the fish fingers, the Registry Office was the place to which the atheistic, agnostic, and otherwise religion-averse turned in order to conduct the wedding ceremony itself.</p>
<p>As with so many words, <em>registry</em> is one that the technology sector coopted and quickly began to render almost meaningless by adding layer upon layer of obscure nuance. For Windows users (at least back when I last used Windows), the Registry was the place you went to disable whatever was responsible for slowing your machine to a painful crawl. On the web, registry shares some of the attributes of library, directory, repository, and similar concepts; it is typically a place into which some resource, function, or package is registered for later discovery and use by others. Personally, I&#8217;ve always tended to presume that registries are differentiated from those similar terms by being intended wholly or predominantly for use by machines rather than people; a registry a place for <em>software</em> to find the data it needs, isn&#8217;t it?</p>
<p>Registries crop up regularly in discussion of curated online resources, typically when someone says &#8220;We need a registry for <em>x</em>&#8221; or (increasingly), &#8220;The Web is the registry for <em>y</em>.&#8221; To synthesise some of the different perspectives here, and to really look at whether or not the Web could be the registry, the UK&#8217;s Joint Information Systems Committee (<a href="http://www.jisc.ac.uk/">JISC</a>) has asked me to conduct a short desk-based project which is now well underway.</p>
<p>In the education/ cultural heritage/ government contexts within which JISC operates, there are plenty of &#8216;registries&#8217; to look at; the venerable <a href="http://iesr.ac.uk/">Information Environment Service Registry</a>, the <a href="http://www.ckan.net/">CKAN</a> <a href="http://data.gov.uk/data">instance</a> sitting behind <a href="http://data.gov.uk/">data.gov.uk</a>, America&#8217;s new <a href="http://www.learningregistry.org/">National Learning Registry</a>, Australia&#8217;s <a href="http://www.apsr.edu.au/orca/index.htm">Registry of Research Collections</a> and more. But what do they have in common, how are they differentiated, and where are the similar examples from the business world? Since the decline in enthusiasm for standards like <a href="http://en.wikipedia.org/wiki/Universal_Description_Discovery_and_Integration">UDDI</a>, are formal registries in decline?</p>
<p>I&#8217;m certainly not seeing mainstream uses of the term that seem &#8216;wrong&#8217;, but there&#8217;s clearly scope to develop some sort of simple taxonomy to capture the various functions for which someone might propose a registry-based solution. Having done that, I can then finish by looking at the extent to which the architecture of the Web can reliably and sustainably support those functions.</p>
<p>So&#8230; what do <em>you</em> think a registry is, and do you have any great examples of that use case? Is it best done in some centralised and organised fashion, or can resources spread out across the web simply self-assemble as required?</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.seerc.org/fusion/semanticregistry/">FUSION Semantic Registry</a> (seerc.org)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=2655bafe-3775-4a41-9990-1d91ed0f5bdd" alt="" /><span class="zem-script pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2010/10/do-we-need-a-registry-to-register-technologists-numerous-contradictory-uses-of-the-term/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Final version of &#8216;Linked Data Horizon Scan&#8217; now available online</title>
		<link>http://cloudofdata.com/2010/02/final-version-of-linked-data-horizon-scan-now-available-online/</link>
		<comments>http://cloudofdata.com/2010/02/final-version-of-linked-data-horizon-scan-now-available-online/#comments</comments>
		<pubDate>Wed, 24 Feb 2010 12:05:13 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[dev8d]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[jisclinkeddata]]></category>
		<category><![CDATA[Joint Information Systems Committee]]></category>
		<category><![CDATA[LinkedData]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=938</guid>
		<description><![CDATA[Last year, I published a draft version of the Linked Data Horizon Scan that I had been commissioned to write for the UK&#8217;s Joint Information Systems Committee (JISC). The final version of that report is available today, both in commentable form via JISC&#8217;s JISCPress tool and for download as a PDF. JISC&#8217;s associated call to [...]]]></description>
			<content:encoded><![CDATA[<p>Last year, I published a draft version of the <em><a href="http://cloudofdata.com/2009/12/draft-report-explores-linked-data-potential-in-uk-universities/">Linked Data Horizon Scan</a></em> that I had been commissioned to write for the UK&#8217;s Joint Information Systems Committee (<a href="http://www.jisc.ac.uk/">JISC</a>).</p>
<p>The final version of that report is available today, both <a href="http://linkeddata.jiscpress.org/">in commentable form via JISC&#8217;s JISCPress tool</a> and for <a href="http://cloudofdata.s3.amazonaws.com/FINAL-201001-LinkedDataHorizonScan.pdf">download</a> as a PDF.</p>
<p>JISC&#8217;s associated call to fund Linked Data projects will be available imminently, and £750,000 is available to share between successful bidders. Watch <a href="http://www.jisc.ac.uk/fundingopportunities.aspx">the JISC funding pages</a>, and get ready to bid. If you&#8217;re not inside a UK university yourself, then find a friendly partner who is… as projects have to be led by a University. And good luck!</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://cloudofdata.com/2009/12/draft-report-explores-linked-data-potential-in-uk-universities/">DRAFT report explores Linked Data potential in UK Universities</a> (cloudofdata.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/bcbf99fd-cc6d-4443-975a-3412462118a6/"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=bcbf99fd-cc6d-4443-975a-3412462118a6" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-info pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2010/02/final-version-of-linked-data-horizon-scan-now-available-online/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Repositories in the Cloud? Why on earth not?!</title>
		<link>http://cloudofdata.com/2010/02/repositories-in-the-cloud-why-on-earth-not/</link>
		<comments>http://cloudofdata.com/2010/02/repositories-in-the-cloud-why-on-earth-not/#comments</comments>
		<pubDate>Sun, 21 Feb 2010 18:05:42 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Academic publishing]]></category>
		<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[Andy Powell]]></category>
		<category><![CDATA[Archives]]></category>
		<category><![CDATA[AWS]]></category>
		<category><![CDATA[Colleges and Universities]]></category>
		<category><![CDATA[Eduserv]]></category>
		<category><![CDATA[Higher Education]]></category>
		<category><![CDATA[infochimps]]></category>
		<category><![CDATA[Institutional repository]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[Open access]]></category>
		<category><![CDATA[Panton Principles]]></category>
		<category><![CDATA[repcloud]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Software as a service]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=932</guid>
		<description><![CDATA[To be honest, I&#8217;ve never fully understood Higher Education&#8217;s penchant for building &#8216;institutional repositories.&#8217; These frequently under-populated aggregations of academic papers produced by &#8216;research active&#8217; employees of a particular university appear aligned almost exclusively to vaguely expressed institutional imperatives, and seem largely unrelated to either the selfish aspirations of the contributing authors or the tangible [...]]]></description>
			<content:encoded><![CDATA[<p>To be honest, I&#8217;ve never fully understood Higher Education&#8217;s penchant for building &#8216;<a class="zem_slink freebase/en/institutional_repository" title="Institutional repository" rel="wikipedia" href="http://en.wikipedia.org/wiki/Institutional_repository">institutional repositories</a>.&#8217; These frequently under-populated aggregations of academic papers produced by &#8216;research active&#8217; employees of a particular university appear aligned almost exclusively to vaguely expressed institutional imperatives, and seem largely unrelated to either the selfish aspirations of the contributing authors or the tangible relationships they painstakingly construct with others across their chosen discipline. The &#8216;repository&#8217; all too often appears a bureaucratic solution to a problem that the supposed beneficiaries do not recognise; a technological aberration that sits outside the conversational flow of the Web to which it is only tenuously attached.</p>
<p>Furthermore, &#8216;<a class="zem_slink freebase/en/open_access" title="Open access (publishing)" rel="wikipedia" href="http://en.wikipedia.org/wiki/Open_access_%28publishing%29">Open Access</a>&#8216; and &#8216;Repository&#8217; typically go hand in hand. If you support Open Access you need a repository, and if you question the role of repositories you&#8217;re in the pocket of evil publishers who want to lock up everything ever written and lease reading rights back to the employers of those who wrote the stuff in the first place.</p>
<p>Nonsense.</p>
<p>Open Access is an important component of today&#8217;s scholarly ecosystem. It&#8217;s not the only answer, and it&#8217;s not perfect, but it <em>does</em> have a significant part to play. Institutions have a role in preserving, disseminating and exploiting the work of their employees, but these are very different tasks that may benefit from different solutions. In too many cases, the repository is by default seen as a preservation mechanism <em>and</em> a dissemination vehicle, and as such it may fail to cost-effectively achieve either aim.</p>
<p>There are some large, well known, and research-intensive institutions where it might be possible to make a compelling argument for projecting a strong institutional image around a single &#8216;home&#8217; for all of that research output. Never mind, for a moment, that so much research today is the result of inter-institutional collaboration, or that the eminent researcher might wish to take &#8216;their&#8217; research publications with them as they move from Oxford to Harvard to York during their glittering career.</p>
<p>Alongside those institutions sit a plethora of others where research of equal quality is also being conducted; there just, maybe, isn&#8217;t quite as much of it. Bombarded by &#8216;advice&#8217; and funding, and desperate to keep up with the <a class="zem_slink freebase/en/russell_group" title="Russell Group" rel="wikipedia" href="http://en.wikipedia.org/wiki/Russell_Group">Russell Group</a>, ever-more institutions blindly join the repository cult and wonder why their new toys do not fill to overflowing with the jewels of scholarly erudition.</p>
<p>As research becomes increasingly data-rich, the whole cycle looks set to repeat. The recently released <a href="http://pantonprinciples.org/">Panton Principles</a> for <a class="zem_slink freebase/en/open_data" title="Open Data" rel="wikipedia" href="http://en.wikipedia.org/wiki/Open_Data">Open Data</a> in Science are to be welcomed, but I&#8217;ll bet the institutional response will all too often be the commissioning of a &#8216;data repository&#8217; to sit alongside the &#8216;publication repository&#8217; they already don&#8217;t use.</p>
<p>All of which is a rather long-winded way of introducing the fact that Eduserv&#8217;s <a class="zem_slink" title="Andy Powell" rel="twitter" href="http://twitter.com/andypowe11">Andy Powell</a> has asked me to facilitate a breakout afternoon on &#8216;Policy Issues&#8217; at the <a href="http://www.eduserv.org.uk/events/repcloud" class="broken_link">Repositories in the Cloud</a> event <a href="http://www.eduserv.org.uk/research">Eduserv</a> and <a class="zem_slink freebase/en/joint_information_systems_committee" title="Joint Information Systems Committee" rel="wikipedia" href="http://en.wikipedia.org/wiki/Joint_Information_Systems_Committee">JISC</a> are holding in London on Tuesday.</p>
<blockquote><p>&#8220;This free event, organised jointly by Eduserv and the JISC, will bring together software developers, repository managers, service providers, funding and advisory bodies to discuss the potential policy and technical issues associated with <strong>cloud computing</strong> and the delivery of <strong>repository services</strong> in UK HEIs.&#8221;</p></blockquote>
<p>In a post on 11 February, <a href="http://efoundations.typepad.com/efoundations/2010/02/repositories-and-the-cloud-tell-us-your-views.html">Andy invited participants to share some of their views</a> ahead of the meeting, and on 19 February <a href="http://efoundations.typepad.com/efoundations/2010/02/in-the-clouds.html">he wrote about some of his own thoughts</a>.</p>
<p>Like Andy, I struggled somewhat to nail down a coherent set of thoughts about the issue of pushing today&#8217;s repositories into the Cloud. On one level, I wonder whether the vast majority of institutions with small (and relatively low traffic) repositories would see much of a tangible efficiency gain or cost saving by moving off an in-house computer to rent an equivalent <a class="zem_slink freebase/en/virtual_machine" title="Virtual machine" rel="wikipedia" href="http://en.wikipedia.org/wiki/Virtual_machine">Virtual Machine</a> from Amazon, Rackspace, or any of their competitors. If we&#8217;re talking about IT systems within a typical university, there are others (email, calendaring, pools of compute resource for research jobs, etc) that appear more immediately compelling for the shift Cloud-ward. Which is not to say that there isn&#8217;t a clear opportunity for someone trusted to step into this space and offer a <a class="zem_slink freebase/en/software_as_a_service" title="Software as a service" rel="wikipedia" href="http://en.wikipedia.org/wiki/Software_as_a_service">SaaS</a> repository to which institutions might affordably subscribe. Eduserv? Mimas? Edina? The British Library? The National Archives? Duraspace? Any could, and if we&#8217;re not ready for something more then at least one probably should.</p>
<p>However, a bolder reconsideration of what repositories <em>are</em> and what they&#8217;re <em>for</em> might very well lead to something interesting, sustainable, and perfectly suited for benefitting from Cloud Computing&#8217;s strengths.</p>
<p>Why does a paper have to be &#8216;deposited&#8217; in a repository? Why does a single paper with three authors from three institutions have to be deposited in three separate institutional repositories? Why does that same paper have to be deposited – separately – in the subject repository favoured by scholars in the relevant discipline? Why does the institution&#8217;s very reasonable desire to protect, preserve, promote and disseminate its excellence mean that it has to run systems in perpetuity that preserve and permit access? Why do we address the fundamentally different (perhaps even contradictory) problems of access and preservation in the same system? Why can&#8217;t the individual researcher easily assemble a view across their publication history, regardless of the institution within which they happened to reside as they wrote each paper? Why don&#8217;t the assemblages of papers reflect personal, professional and disciplinary relationships, alongside (or instead of) the contractual accident of employee-employer relationships? Why isn&#8217;t the wealth of metadata implicit to any publication (authors, subjects, dates, citations, and more) available and actionable, both inside the repository and far beyond it across the Web? Why isn&#8217;t there a tight and active association between the paper and the data from which its findings were derived (something for which <em><a href="http://intarch.ac.uk/">Internet Archaeology</a></em> was demonstrating utility a very long time ago)?</p>
<p>Scholarly papers principally comprise text, augmented by the occasional static image. They&#8217;re not big, and they don&#8217;t tend to change very fast. In many ways, they represent a fairly easy problem set with which to work. As more and more data becomes key to research in a growing number of subject areas, the problems are set to become far larger and far more difficult. For individual universities to even consider replicating the process by which they all ended up with their repositories of text surely seems madness in this data-rich environment. Even with levels of uptake as low as those seen in too many text repositories, the issues of data management, curation, access and dissemination are too great to be sensibly solved in the institutional machine room. Services like <a href="http://infochimps.org/">InfoChimps</a> and Amazon&#8217;s own <a href="http://aws.amazon.com/publicdatasets/">Public Data Sets</a> offering show some of the ways that we might begin to work with data at scale. Might we, for example, come to recognise as Amazon has that it&#8217;s actually cheaper and quicker to entrust large data sets to FedEx rather than transmit them over the Internet?</p>
<p>&#8216;The answer&#8217; might be some central service for the community, funded by JISC like the Arts &amp; Humanities Data Service (AHDS) of old. Or it might be something different, something nimbler, more responsive, more flexible to individual, institutional, and disciplinary requirements, and something more scalable to new disciplines; institutional support for and use of <em>existing</em> Cloud infrastructures extending far beyond UK Higher Education, aligned with a clear understanding of the separation between preservation and access.</p>
<p>I certainly don&#8217;t have all the answers, but I do believe that simply asking whether or not we should move existing repositories to the Cloud is to miss the point. Rather, we should ask what role the Cloud might play in addressing the business requirements to which the institutional repository was our initial – faltering – response. The answer might very well be &#8216;None,&#8217; but I doubt it.</p>
<p>I look forward to Tuesday&#8217;s discussion. I&#8217;m not going there to push my personal view that individual institutions frequently shouldn&#8217;t be building, running or populating their own repositories at all. I&#8217;m going there to facilitate the discussion those in the room want to have, and to learn from their experiences and their perspectives.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://scholarlykitchen.sspnet.org/2010/01/07/citation-advantage-for-mandated-open-access-articles/">Does a Citation Advantage Exist for Mandated Open Access Articles?</a> (scholarlykitchen.sspnet.org)</li>
<li class="zemanta-article-ul-li"><a href="http://hangingtogether.org/?p=770">Scholarly content and the cliff edge: the place of subject &#8216;repositories&#8217;</a> (hangingtogether.org)</li>
<li class="zemanta-article-ul-li"><a href="http://www.downes.ca/cgi-bin/page.cgi?post=51742">Scholarly Communications must be Scalable</a> (downes.ca)</li>
<li class="zemanta-article-ul-li"><a href="http://opendotdotdot.blogspot.com/2010/02/beyond-open-access-open-publishing.html">Beyond Open Access: Open Publishing</a> (opendotdotdot.blogspot.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.scienceblog.com/cms/57-college-presidents-declare-support-public-access-publicly-funded-research-us-25470.html" class="broken_link">57 college presidents declare support for public access to publicly funded research in the US</a> (scienceblog.com)</li>
<li class="zemanta-article-ul-li"><a href="http://r.zemanta.com/?u=http%3A//www.guardian.co.uk/education/2010/feb/11/academics-in-aspic-says-mandelson&amp;a=12898526&amp;rid=f65ff066-66fd-42d9-bc76-113bd6066317&amp;e=5236f562a8baffa164e8623f52cd7d44">Mandelson says academics are &#8216;set in aspic&#8217;</a> (guardian.co.uk)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/f65ff066-66fd-42d9-bc76-113bd6066317/"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=f65ff066-66fd-42d9-bc76-113bd6066317" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-info pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2010/02/repositories-in-the-cloud-why-on-earth-not/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Examining the Linked Data opportunity; the case of Higher Education</title>
		<link>http://cloudofdata.com/2009/08/examining-the-linked-data-opportunity-the-case-of-higher-education/</link>
		<comments>http://cloudofdata.com/2009/08/examining-the-linked-data-opportunity-the-case-of-higher-education/#comments</comments>
		<pubDate>Sun, 09 Aug 2009 15:48:14 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[BBC]]></category>
		<category><![CDATA[Higher Education]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[jisclinkeddata]]></category>
		<category><![CDATA[Joint Information Systems Committee]]></category>
		<category><![CDATA[Open Calais]]></category>
		<category><![CDATA[SemHE]]></category>
		<category><![CDATA[Thomson Reuters]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=751</guid>
		<description><![CDATA[Image via Wikipedia Regardless of where you stand on some of the questions of detail with respect to the Linked Data meme, it&#8217;s clear that significant enthusiasm is being marshalled behind both the concept and the opportunities that it promises. Dion Hinchcliffe looks at some of the means by which enterprise data can be more [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="margin: 1em; display: block;">
<div>
<dl class="wp-caption alignright" style="width: 310px;">
<dt class="wp-caption-dt"><a href="http://commons.wikipedia.org/wiki/Image:York_central_hall.jpg"><img title="The :en:University of York's Central Hall, as ..." src="http://upload.wikimedia.org/wikipedia/commons/thumb/3/35/York_central_hall.jpg/300px-York_central_hall.jpg" alt="The :en:University of York's Central Hall, as ..." width="300" height="200" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image via <a href="http://commons.wikipedia.org/wiki/Image:York_central_hall.jpg">Wikipedia</a></dd>
</dl>
</div>
</div>
<p>Regardless of where you stand on some of the <a href="http://cloudofdata.com/2009/07/does-linked-data-need-rdf/">questions</a> of <a href="http://cloudofdata.com/2009/07/more-linked-data-and-rdf/">detail</a> with respect to the Linked Data meme, it&#8217;s clear that significant enthusiasm is being marshalled behind both the concept and the opportunities that it promises.</p>
<p><a class="zem_slink" title="Dion Hinchcliffe" rel="blog" href="http://hinchcliffeandco.com">Dion Hinchcliffe</a> looks at some of the means by which enterprise data can be more visible on (and useful to) the Web in a <a href="http://blogs.zdnet.com/Hinchcliffe/?p=650">ZDNet post</a> this week. The &#8216;Semantic Web &amp; Linked Data&#8217; are included, and Dion <a href="http://blogs.zdnet.com/Hinchcliffe/?p=650">writes</a>;</p>
<blockquote><p>&#8220;By far the most sophisticated and complex of the three approaches to open data presented here, [Linked Data is] highly suitable for certain applications that have rich data sets that need powerful means of processing and consumption. In particular, scientific, technical, medical, mapping, and certain government domains are highly suitable for this approach. It remains unclear if Linked Data will finally trigger the boom in the Semantic Web so use with care. However, definite consideration should be applied, given the potential of the approach to create data sets with extraordinarily high function. Businesses already managing their data with Semantic Web technologies will be the most likely candidates for adoption.&#8221;</p></blockquote>
<p>We&#8217;re certainly seeing plenty of talk — and some interesting beginnings — in the Government domain, and organisations such as <a class="zem_slink freebase/guid/9202a8c04000641f800000000572e521" title="Reuters" rel="homepage" href="http://reuters.com">Thomson Reuters</a> and the <a class="zem_slink freebase/guid/9202a8c04000641f800000000000b122" title="BBC" rel="homepage" href="http://www.bbc.co.uk/">BBC</a> are also taking compelling steps around the periphery of their core businesses.</p>
<p>Education offers another interesting set of opportunities, and Jason Ohler&#8217;s <a href="http://www.educause.edu/EDUCAUSE+Quarterly/EDUCAUSEQuarterlyMagazineVolum/TheSemanticWebinEducation/163437">piece</a> in <em>Educause Quarterly</em> (and a related <a href="http://blogs.talis.com/education/2008/10/15/jason-ohler-talks-with-talis-about-education-and-the-semantic-web/">podcast</a> I recorded with him whilst I was still at <a class="zem_slink" title="Talis Platform" rel="homepage" href="http://www.talis.com/platform/">Talis</a>) illustrates one view of that opportunity.</p>
<p>Here in the UK, the <a href="http://blogs.talis.com/education/2008/10/15/jason-ohler-talks-with-talis-about-education-and-the-semantic-web/">Joint Information Systems Committee</a> (JISC) is beginning to take note. They first funded a review of &#8216;<a href="http://www.jisc.ac.uk/whatwedo/services/techwatch/reports/horizonscanning/hs0502.aspx">Semantic Web Technologies</a>&#8216; back in 2005, then revisited the topic with &#8216;<a href="http://www.jisc.ac.uk/whatwedo/projects/semantictechnologies.aspx">Semantic Technologies in Learning and Teaching</a>&#8216; (and a related <a href="http://www.semhe.org/">workshop</a> in the south of France later this year). I&#8217;ll be recording a podcast with the manager of that project, Thanassis Tiropanis, later this month.</p>
<p>JISC have also asked me to conduct a short piece of work to look specifically at the opportunity presented to the Higher Education community by Linked Data, and this work will run over the next few months. I&#8217;m certainly keen to learn about concrete examples, and to hear reasoned arguments for and against in order to submit comprehensive findings and recommendations. So if you have something to say, please do <a href="http://cloudofdata.com/contact/">get in touch</a>.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://blog.aldobucchi.com/2009/07/web3-and-enterprise-linked-data.html">Web3 and Enterprise Linked Data. The Middleware Revolution</a> (aldobucchi.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.johnbreslin.com/blog/2009/06/24/open-government-and-linked-data-now-its-time-to-draft/">Open government and Linked Data; now it&#8217;s time to draft&#8230;</a> (johnbreslin.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/archives/interview_with_tim_berners-lee_part_1.php">ReadWriteWeb Interview With Tim Berners-Lee, Part 1: Linked Data</a> (readwriteweb.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/archives/linked_data_is_blooming_why_you_should_care.php">Linked Data is Blooming: Why You Should Care</a> (readwriteweb.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/89a22b3a-8ab7-428c-9b7f-806086758934/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=89a22b3a-8ab7-428c-9b7f-806086758934" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-info pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2009/08/examining-the-linked-data-opportunity-the-case-of-higher-education/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

