<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	xmlns:media="http://search.yahoo.com/mrss/"
>

<channel>
	<title>Paul Miller - The Cloud of Data</title>
	<atom:link href="http://cloudofdata.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://cloudofdata.com</link>
	<description>Cloud Computing, Big Data, Linked Data, Open Data, more</description>
	<lastBuildDate>Wed, 15 May 2013 15:46:44 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
	<copyright>Licensed under the Creative Commons Attribution License, version 3.0 http://creativecommons.org/licenses/by/3.0/</copyright>
	<managingEditor>paul.miller@cloudofdata.com (Paul Miller)</managingEditor>
	<webMaster>paul.miller@cloudofdata.com (Paul Miller)</webMaster>
	<ttl>1440</ttl>
	<image>
		<url>http://cloudofdata.com/logo144x144.jpg</url>
		<title>Paul Miller - The Cloud of Data</title>
		<link>http://cloudofdata.com</link>
		<width>144</width>
		<height>144</height>
	</image>
	<itunes:subtitle>conversations with the executives shaping Cloud Computing and the Semantic Web.</itunes:subtitle>
	<itunes:summary>Linked Data, Big Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</itunes:summary>
	<itunes:keywords>Cloud, Computing, Semantic, Web, Linked, Data, Open, Data, SaaS, PaaS</itunes:keywords>
	<itunes:category text="Technology" />
	<itunes:category text="Business" />
	<itunes:author>Paul Miller</itunes:author>
	<itunes:owner>
		<itunes:name>Paul Miller</itunes:name>
		<itunes:email>paul.miller@cloudofdata.com</itunes:email>
	</itunes:owner>
	<itunes:block>no</itunes:block>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://cloudofdata.zippykid.netdna-cdn.com/logo300x300.jpg" />
		<item>
		<title>Unpicking the multi-cloud at GigaOM Structure</title>
		<link>http://cloudofdata.com/2013/05/unpicking-the-multi-cloud-at-gigaom-structure/</link>
		<comments>http://cloudofdata.com/2013/05/unpicking-the-multi-cloud-at-gigaom-structure/#comments</comments>
		<pubDate>Wed, 15 May 2013 15:46:43 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Enterprise Computing]]></category>
		<category><![CDATA[IaaS]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Ben Kepes]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[david linthicum]]></category>
		<category><![CDATA[GigaOM]]></category>
		<category><![CDATA[GigaOM Pro]]></category>
		<category><![CDATA[gigaom research]]></category>
		<category><![CDATA[hybrid cloud]]></category>
		<category><![CDATA[jo maitland]]></category>
		<category><![CDATA[mapping session]]></category>
		<category><![CDATA[mission bay]]></category>
		<category><![CDATA[multi-cloud]]></category>
		<category><![CDATA[multicloud]]></category>
		<category><![CDATA[private cloud]]></category>
		<category><![CDATA[public cloud]]></category>
		<category><![CDATA[Rightscale]]></category>
		<category><![CDATA[San Francisco]]></category>
		<category><![CDATA[structure]]></category>
		<category><![CDATA[structure 2013]]></category>
		<category><![CDATA[structureconf]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=3375</guid>
		<description><![CDATA[Last month, RightScale&#8217;s State of the Cloud report got me thinking about the rise of multi-cloud solutions. Next month, I&#8217;ll be moderating a Mapping Session at GigaOM&#8217;s Structure event to work ...]]></description>
				<content:encoded><![CDATA[<div id="attachment_3379" class="wp-caption alignright" style="width: 310px"><a href="http://www.acc-missionbayconferencecenter.com/"><img class="size-medium wp-image-3379 " src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/05/Building_Gallery_5-300x225.jpg" alt="Image © Mission Bay Conference Center" width="300" height="225" /></a><p class="wp-caption-text">Image © Mission Bay Conference Center</p></div>
<p>Last month, <a href="http://cloudofdata.com/2013/04/survey-lifts-covers-on-cloud-promiscuity-good-thing-bad-thing-or-who-cares/">RightScale&#8217;s <em>State of the Cloud</em> report got me thinking about the rise of multi-cloud solutions</a>. Next month, I&#8217;ll be moderating <a href="http://structure2013mappingsession.eventbrite.com/">a Mapping Session at GigaOM&#8217;s Structure event</a> to work out how, where, when, why and if this trend is going to prove significant.</p>
<p>Hybrid clouds, in which one public cloud and one private cloud are used together, are becoming increasingly common solutions to a range of business challenges. RightScale&#8217;s figures suggest growing interest in something more complex and, potentially, more interesting; <em>multi-cloud</em>. In a multi-cloud arrangement, customers build solutions combining one or more public clouds with one or more private clouds. This has the potential to significantly increase complexity, without necessarily delivering a comparable increase in value.</p>
<p><a href="http://cloudofdata.com/2013/04/survey-lifts-covers-on-cloud-promiscuity-good-thing-bad-thing-or-who-cares/">In my post last month, I suggested that many of these multi-cloud deployments were essentially accidental</a>. A quick email exchange with RightScale shows that their survey respondents would appear to disagree; multi-cloud, for them, is a conscious business decision.</p>
<p>I&#8217;m intrigued, and so were GigaOM. So we&#8217;re putting on a <a href="http://structure2013mappingsession.eventbrite.com/">Mapping Session</a> at next month&#8217;s <a href="http://event.gigaom.com/structure/">Structure conference</a> to explore the issue further. I&#8217;ll be joined at the front of the room by fellow <a href="http://pro.gigaom.com/analysts/">GigaOM Pro Analysts</a> <a class="zem_slink" title="Diversity Limited" href="http://diversity.net.nz" rel="homepage" target="_blank">Ben Kepes</a> and <a href="http://www.davidlinthicum.com/">David Linthicum</a>, as well as GigaOM Research Director <a href="http://event.gigaom.com/structure/speakers/#jo_maitland">Jo Maitland</a>.</p>
<p>A GigaOM Mapping Session is more like a workshop than a regular panel. Although there are analysts at the front of the room, they&#8217;re really there to stimulate and guide a conversation with every single person in the room. We don&#8217;t have all the answers. We&#8217;re there to explore the topic, and to work out what — if anything — it might mean. The perspectives of customers, practitioners, suppliers and investors are an integral part of the process. When it works well, everyone comes away with a broader perspective than when they entered the room. Ideas are born, perceptions are sharpened, the germs of deals are done, and directions for future GigaOM research are painted. Mapping Sessions are free for registered Structure attendees, but <a href="http://structure2013mappingsession.eventbrite.com/">separate registration is required</a>. Numbers are capped, to ensure plenty of opportunity for discussion. If you&#8217;ve got perspectives to share, <a href="http://structure2013mappingsession.eventbrite.com/">please do register</a> to join us on 20 June in San Francisco.</p>
<p>Multi-cloud clearly throws up a host of intriguing issues for cloud vendors, their customers, and the ecosystem of cloud management-type providers (like survey author RightScale). What, if anything, do cloud vendors need to do in order to encourage, support, or hinder multi-cloud adoption? What&#8217;s the value proposition behind multi-cloud for customers, and what do the providers of cloud management services need to do in order to capitalise upon an emerging trend? Is this a long-term trend, or a short-lived opportunity?</p>
<p>Many of those stakeholders will be in the room, speaking, listening, and interacting. Do join us, and them.</p>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=d6634700-2590-4cad-a4a1-3cd58a9c76c0" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2013/05/unpicking-the-multi-cloud-at-gigaom-structure/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Discussing Virtual Machine interoperability with the Open Data Center Alliance</title>
		<link>http://cloudofdata.com/2013/05/discussing-virtual-machine-interoperability-with-the-open-data-center-alliance/</link>
		<comments>http://cloudofdata.com/2013/05/discussing-virtual-machine-interoperability-with-the-open-data-center-alliance/#comments</comments>
		<pubDate>Wed, 15 May 2013 14:49:09 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Enterprise Computing]]></category>
		<category><![CDATA[IaaS]]></category>
		<category><![CDATA[Data center]]></category>
		<category><![CDATA[data centre]]></category>
		<category><![CDATA[forecast13]]></category>
		<category><![CDATA[forecast2013]]></category>
		<category><![CDATA[hypervisor]]></category>
		<category><![CDATA[interoperability]]></category>
		<category><![CDATA[ODCA]]></category>
		<category><![CDATA[Open Data Center Alliance]]></category>
		<category><![CDATA[Virtual machine]]></category>
		<category><![CDATA[virtual machine interoperability]]></category>
		<category><![CDATA[vm]]></category>
		<category><![CDATA[vm interoperability]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=3263</guid>
		<description><![CDATA[The Open Data Center Alliance (ODCA) is holding its Forecast event in San Francisco in June, and I&#8217;ve been invited to moderate the panel discussing Virtual Machine Interoperability. As moderator, I&#8217;ll ...]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/34128007@N04/4342708059/" target="_blank"><img class="alignleft size-medium wp-image-3341" style="border: 0px; margin: 6px;" src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/05/4342708059_ced54c886b_b-300x161.jpg" alt="4342708059_ced54c886b_b" width="300" height="161" /></a>The <a href="http://www.opendatacenteralliance.org/">Open Data Center Alliance</a> (ODCA) is holding its <a href="http://www.opendatacenteralliance.org/forecast2013">Forecast</a> event in San Francisco in June, and I&#8217;ve been invited to moderate the panel discussing Virtual Machine Interoperability. As moderator, I&#8217;ll be far more interested in facilitating insights from panel and audience than in wittering on about what <em>I</em> think, so I wanted to use this blog post to begin getting some of the issues clear in my mind. What is VM interoperability, and why does it matter?</p>
<p>From time to time, I write about <a href="http://cloudofdata.com/tag/open-data/">Open Data</a>. This has nothing to do with that. The Open Data Center Alliance is interested in <em>data centres</em>, not <em>data</em>. The Alliance was established back in 2010 with Intel driving things forward, and now <a href="http://www.opendatacenteralliance.org/membership">claims over 300 member organisations</a>, including the likes of BMW, Lockheed Martin, Microsoft, Deutsche Bank and Marriott Hotels.</p>
<p>According to <a href="http://www.opendatacenteralliance.org/aboutus">the ODCA website</a>,</p>
<blockquote><p>we came together to deliver a unified voice for emerging data center and cloud computing requirements. Our mission is to speed the migration to cloud computing by enabling the solution and service ecosystem to address IT requirements with the highest level of interoperability and standards.</p></blockquote>
<p>Much of the Alliance&#8217;s work involves identifying customer requirements and capturing these in a series of <a href="http://www.opendatacenteralliance.org/ourwork/usagemodels">usage models</a>. In theory, prospective customers can modify these usage models in defining their own requirements, and suppliers can tap new business by aligning their offerings to the models. I&#8217;ve not seen much evidence that this is happening at scale yet, but the Alliance site does <a href="http://www.opendatacenteralliance.org/ourwork/usagemodels">state</a> that</p>
<blockquote><p>we anticipate quick industry response to the requirements, initial POCs of solutions beginning in 2012. This could accelerate over $50 billion in cloud service investments and is expected to save $25 billion through IT operational efficiency due to cloud adoption.</p></blockquote>
<p>One of those usage models is concerned with <a href="http://www.opendatacenteralliance.org/docs/Virtual_Machine_(VM)_Interoperability_in_a_Hybrid_Cloud_Environment_Rev1.2.pdf">Virtual Machine Interoperability in a Hybrid Cloud Environment</a> (pdf), and last month it was augmented by the release of a <a href="http://www.opendatacenteralliance.org/docs/VM_Interop_PoC_White_Paper.pdf">Proof of Concept document</a> (pdf) which</p>
<blockquote><p>outlines testing criteria and procedures for documenting how hypervisor and VM solutions from both ODCA members and non-members interoperated in real-world enterprise cloud scenarios.</p></blockquote>
<p><a href="http://www.opendatacenteralliance.org/newsroom/mediaresources/drivingvminteroperability">Quoted in an April press release</a> to mark publication of the PoC document, ODCA executive director Marvin Wheeler commented that</p>
<blockquote><p>true interoperability of hypervisor and virtual machine solutions between clouds is critical to advancing enterprise ready cloud implementations. We encourage global IT managers and cloud solution and service providers to join us at the Forecast 2013 VM interoperability panel on June 17 to take part in the collaborative discussions that will help drive the next phase of VM interoperability in the enterprise cloud.</p></blockquote>
<p>The basic premise is a simple one; a <a href="http://en.wikipedia.org/wiki/Virtual_machine">virtual machine</a> started on one <a href="http://en.wikipedia.org/wiki/Hypervisor">hypervisor</a> or class of physical server should run equally well when moved to run on a different hypervisor or physical server. Further, there is a presumption that there is a credible business requirement for this capability. As enterprise users of cloud increasingly find themselves adopting a hybrid approach spanning their own data centre, co-location facilities, hosting sites and diverse public clouds, the likelihood of them needing to run their standard Windows or Linux virtual machines on top of more than one hypervisor certainly increases. It&#8217;s less clear, though, that an inability to reliably move a Linux VM from the KVM hypervisor in your own data centre to the Xen hypervisor at a cloud provider is causing significant business pain today; it&#8217;s often reasonably straightforward, for example, to simply select a different cloud provider able to support your chosen KVM hypervisor.</p>
<p>But even if a lack of VM interoperability isn&#8217;t presenting an <em>insurmountable</em> barrier to business right now, it&#8217;s still one more thing to think about when pulling a set of disparate services together. If we can cost-effectively and pragmatically remove or diminish its ability to complicate, then that&#8217;s presumably a good thing.</p>
<p>ODCA&#8217;s PoC work took the usage requirements the organisation had already defined, and applied them to a specific set of documented tests. As IDG&#8217;s Joab Jackson notes in <a href="http://www.infoworld.com/d/cloud-computing/virtual-machines-inching-toward-portability-217331">his InfoWorld piece</a>, the results were not great.</p>
<blockquote><p>Running through all the different possible combinations of hypervisors and OSes, the researchers found that 13 test cases resulted in warnings, and 19 test cases failed entirely. Only in two cases did the VM work flawlessly across two different hypervisors. In both of these cases, a VM created with Xen worked without troubles on a Microsoft hyper-V environment — in one case running Ubuntu and in the other case running Windows Server.</p></blockquote>
<p>The researchers do note that they set the bar for success pretty high, and that several of the &#8216;warning&#8217; states would still result in a functional VM. It does appear clear, though, that customers with a real need for multi-VM interoperability face significant challenges in efficiently managing workloads across different virtualisation infrastructures.</p>
<p>And that brings us to the panel at Forecast, which can hopefully help quantify the true scale of the problem&#8230; and indicate some of the ways in which vendors are working to fix the broken bits. In terms of the panel discussion itself, I look forward to delving into at least the following:</p>
<ul>
<li>how big a problem is the current state of VM interoperability?</li>
<li>do the ODCA VM Interoperability use cases prioritise the right things?</li>
<li>what do customers really need, and are they getting it?</li>
<li>how are vendors really responding to requirements such as those defined by ODCA?</li>
</ul>
<p>If you have opinions to share, perspectives to offer, facts with which to illuminate, or questions to ask, I do hope you&#8217;ll join us.</p>
<p>ODCA has given me a couple of free passes to Forecast. If you want to come along <em>and we know one another</em>, <a href="http://cloudofdata.com/contact/">then please do get in touch</a>.</p>
<p><em>Note: ODCA is covering my flight to San Francisco, and accommodation during the event.</em></p>
<p><em><a href="http://www.flickr.com/photos/34128007@N04/4342708059/">Image</a> of event venue, San Francisco&#8217;s Westin St. Francis, by Flickr user &#8216;<a href="http://www.flickr.com/people/34128007@N04/">prayitno</a>&#8216;</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.datacenterknowledge.com/archives/2013/04/26/hypervisor-interoperability-and-you-odca-tackles-hypervisors-in-new-report/" target="_blank">Open Data Center Alliance Tackles Cloud Lock-In</a> (datacenterknowledge.com)</li>
<li class="zemanta-article-ul-li"><a href="http://r.zemanta.com/?u=http%3A//www.infoworld.com/d/cloud-computing/virtual-machines-inching-toward-portability-217331&amp;a=163529860&amp;rid=106fc7a1-73b2-440c-9f94-295813fadbea&amp;e=4776fe8ba9daff59e4af7d3e968ae7cd" target="_blank">Virtual machines inching toward portability</a> (infoworld.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.cloudave.com/28504/5-key-essentials-of-cloud-workloads-migration/" target="_blank">5 Key Essentials of Cloud Workloads Migration</a> (cloudave.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=106fc7a1-73b2-440c-9f94-295813fadbea" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2013/05/discussing-virtual-machine-interoperability-with-the-open-data-center-alliance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Getting it right with data attribution</title>
		<link>http://cloudofdata.com/2013/05/getting-it-right-with-data-attribution/</link>
		<comments>http://cloudofdata.com/2013/05/getting-it-right-with-data-attribution/#comments</comments>
		<pubDate>Fri, 10 May 2013 16:33:55 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[attribution]]></category>
		<category><![CDATA[citation]]></category>
		<category><![CDATA[data attribution]]></category>
		<category><![CDATA[data citation]]></category>
		<category><![CDATA[data commons]]></category>
		<category><![CDATA[data licence]]></category>
		<category><![CDATA[data license]]></category>
		<category><![CDATA[data market]]></category>
		<category><![CDATA[data reuse]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[data use]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[Dataset]]></category>
		<category><![CDATA[Leigh Dodds]]></category>
		<category><![CDATA[pragmatism]]></category>
		<category><![CDATA[Terms & Conditions]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=3259</guid>
		<description><![CDATA[There have always, it seems, been people for whom attribution and citation really matter. Some of them passionately engage in arguments that last months or years, debating the merits of ...]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/colinsite/3997333611/" target="_blank"><img class="alignleft size-medium wp-image-3303" style="border: 0px; margin: 6px;" src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/05/3997333611_2565fc9a4d_b-300x225.jpg" alt="3997333611_2565fc9a4d_b" width="300" height="225" /></a>There have always, it seems, been people for whom attribution and citation <em>really</em> matter. Some of them passionately engage in arguments that last months or years, debating the merits of comma placement in written citations for the work of others. Bizarre, right?</p>
<p>But, as we all become increasingly dependent upon data sourced from third parties, aspects of this rather esoteric pastime are beginning to matter to a far broader audience. Products, recommendations, decisions and entire businesses are being constructed on top of data sourced from trusted partners, from new data brokers, from crowdsourced communities, or simply plucked from across the open web. Without an understanding of where that data came from, and how it was collected, interpreted or maintained, all of those products, recommendations, decisions and businesses stand upon very shaky foundations indeed.</p>
<p>Data attribution is increasingly important, but it will be essential to make sure that the rules, tools and norms which emerge are both lightweight and pragmatic. Now is not the time to get heavy-handed and pedantic about where the comma goes.</p>
<p>Former colleague <a title="https://twitter.com/ldodds" href="https://twitter.com/ldodds">Leigh Dodds</a> recently offered a useful discussion of the <a title="http://blog.ldodds.com/2013/04/30/how-do-we-attribute-data/" href="http://blog.ldodds.com/2013/04/30/how-do-we-attribute-data/">rationale behind data attribution</a>. Early on, he describes the related (and, often, sloppily interchangeable) notions of <em>attribution</em> and <em>citation</em>;</p>
<blockquote><p>It might also be useful to distinguish between:</p>
<ul>
<li><strong>Attribution</strong> — highlighting the creator/publisher of some data to acknowledge their efforts, conferring reputation</li>
<li><strong>Citation</strong> — providing a link or reference to the data itself, in order to communicate provenance or drive discovery</li>
</ul>
</blockquote>
<p>This distinction is important in some circumstances, but it can also be useful to consider a simpler, more selfish, but ultimately more scalable justification. Attribution (and citation) of data quite simply provides an audit trail, enabling you, your bosses, your investors, or your customers, to know more about the data upon which actions are based.</p>
<h2>Creators want credit, consumers want to trust</h2>
<p>Much of the serious consideration of attribution comes from a relatively small cadre of data <em>owners</em> or creators, who (understandably, perhaps) want credit for their hard work. Perhaps they want to prove use in order to secure future funding or advancement, or perhaps they simply want to track where their data ends up. Through a series of licenses, contracts and Terms &amp; Conditions statements, these creators have done much in codifying the ways that data should be referred to. Leigh discusses some of the licensing terms in his post, but it&#8217;s probably fair to say that none of them have really caught on outside a few rather narrowly scoped groups of co-dependent developers and data providers.</p>
<p>Data owners&#8217; requirements for credit run the gamut, from loosely phrased requests for a link back to their website all the way past this rather excessive example quoted by Leigh to end up as lengthy tomes of draconian legalese;</p>
<blockquote><p>The attribution must be no smaller than 70% of the size of the largest bit of information used, or 7px, whichever is larger. If you are making the information available via your own API you need to make sure your users comply with all these conditions.</p></blockquote>
<p>All too often, that sort of self-defeating prescription is enough to send prospective users back to Google, in search of a less demanding alternative.</p>
<p>For consumers of data, or for those wondering where the data behind a product or decision came from, things are rather simpler. On the whole, those on this side of the divide are simply looking for a pointer which enables them to learn more. Is my company&#8217;s multi-million dollar change of direction based upon detailed data from stable Governments, credible banks and respected analysts, or did the person responsible use some numbers the found on their friend&#8217;s blog?</p>
<p>Carefully crafted rules regarding attribution&#8217;s wording, placement, colour, size and typeface are an irrelevance, probably deserving to be ignored or ridiculed.</p>
<p>Far better, and far more likely of success, to simply encourage users and re-users of data to sensibly point back (however they like) to their principal sources.</p>
<h2>Remembering all the ancestors is a bit daft</h2>
<p>Data set A is modified and added to in order to create data set A1. Data set A1 is modified and added to in order to create data set A2. Data set B is modified and added to in order to create data set B1. Data set B1 is modified and added to in order to create data set B2. Data Set C modifies and extends data sets A2 and B2. It seems reasonable to acknowledge the contribution made to C by A2 and B2, but some would argue (loudly) that A, B, A1 and B1 also need to be acknowledged in C. This is one aspect of &#8216;attribution stacking&#8217;, and attribution stacking is, quite simply, stupid.</p>
<p>If I am the creator of data set C, I am selecting A2 and B2 because they are the right data sets for my purpose. That selection will be based upon a range of criteria, including the scope and coverage of the data. The selection will also be based upon my impression of the brands responsible for A2 and B2, and that impression (implicitly or explicitly) will include some awareness of the processes they use to select, validate and manage the data <em>they</em> use. It&#8217;s for <em>them</em> to carefully select, validate and provide attribution for A1 and B1, not for me. And it&#8217;s A1 and B1&#8242;s job to do the same for A and B, not me.</p>
<p>Things get even worse in some open source data projects, where all the individual contributors expect to be acknowledged. <em>Inside</em> the project (and on its website, etc), that&#8217;s fine and sensible. Outside, though? It&#8217;s ridiculous. So if data set A were created by individuals Aa, Ab, Ac, Ad and all their friends right up to Az, under some licenses there would be an expectation that every single one of those individuals be acknowledged by name in any mention of data sets A, A1, A2 or C. A massive administrative burden for any downstream users of the data set, and of no real benefit to anyone whatsoever. This desire for glory really does need to be challenged, if it is not to stifle free and fair downstream use and reuse of the data. Within the project building A, it may be vital to know that user Aa is a bit sloppy, or that user Ad has a nasty habit of making the data say what she thinks it <em>should</em> rather than what it actually <em>does</em>. But it is the responsibility of the project behind A to put processes and procedures in place to address these issues, and to ensure that all of its participants receive appropriate credit within the project for their contribution. By the time we reach A1 or A2, though, those internal details no longer matter. A1 chose to use A because those processes <em>exist</em>. After an initial evaluation of those processes and their implementation, A1 can — and should — simply trust them, rather than endlessly second-guessing them.</p>
<h2>Tracking and Trust are different</h2>
<p>Ultimately, the motivations of data creators and data re-users are very different. The processes and procedures put in place by creators and owners in search of kudos or statistics may actively obstruct the use and reuse that they profess to want. Complex forms of attribution, aligned to heavy-handed enforcement of infringements, do nothing to encourage a far broader community of use to emerge.</p>
<p>By attempting to count — and manage — the small number of uses today, data creators are stifling growth that otherwise is ready to explode. A perfect example of the saying (which may not translate beyond Britain&#8217;s shores!) of &#8216;biting off your nose to spite your face.&#8217; Think about it&#8230; <img src='http://cloudofdata.zippykid.netdna-cdn.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
<p>Leigh ends his post with</p>
<blockquote><p>Attribution should be a social norm that we encourage, strongly, in order to acknowledge the sources of our Open Data.</p></blockquote>
<p>Other than broadening it from &#8216;Open Data&#8217; to just &#8216;data,&#8217; I couldn&#8217;t agree more. But let&#8217;s keep it lightweight, simple, and pragmatic.</p>
<p><strong>Note:</strong> or perhaps this post should have been called &#8220;When you stand on a giant&#8217;s shoulders, it&#8217;s a good idea to say thank you.&#8221;</p>
<p><em><a href="http://www.flickr.com/photos/colinsite/3997333611/">Image</a> of <a class="zem_slink" title="Eduardo Paolozzi" href="http://en.wikipedia.org/wiki/Eduardo_Paolozzi" rel="wikipedia" target="_blank">Eduardo Paolozzi</a>&#8216;s sculpture of <a class="zem_slink" title="Isaac Newton" href="http://en.wikipedia.org/wiki/Isaac_Newton" rel="wikipedia" target="_blank">Sir Isaac Newton</a> by <a class="zem_slink" title="Flickr" href="http://flickr.com" rel="homepage" target="_blank">Flickr</a> user &#8216;<a href="http://www.flickr.com/people/colinsite/">monkeywing</a>&#8216;</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://blog.ldodds.com/2013/04/30/how-do-we-attribute-data/" target="_blank">How Do We Attribute Data?</a> (ldodds.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=d5dfb0ba-c1ae-440d-8e33-43ba274d4587" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2013/05/getting-it-right-with-data-attribution/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Seeking Simplicity&#8217;s Sweet Spot</title>
		<link>http://cloudofdata.com/2013/05/seeking-simplicitys-sweet-spot/</link>
		<comments>http://cloudofdata.com/2013/05/seeking-simplicitys-sweet-spot/#comments</comments>
		<pubDate>Thu, 09 May 2013 14:10:54 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Albert Einstein]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[data scientist]]></category>
		<category><![CDATA[Datahero]]></category>
		<category><![CDATA[interpretation]]></category>
		<category><![CDATA[User interface]]></category>
		<category><![CDATA[visualisation]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=3247</guid>
		<description><![CDATA[
Albert Einstein, you may have heard, was a clever man. He scribbled equations on blackboards, thought big thoughts, and all of that. But, allegedly, he also said
Everything should be made ...]]></description>
				<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/File:Albert_Einstein_Head.jpg" target="_blank"><img class="size-medium wp-image-3271   alignleft" style="margin: 6px; border: 0px;" src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/05/Albert_Einstein_Head-230x300.jpg" alt="Albert Einstein. Image © 1947." width="230" height="300" /></a></p>
<p>Albert Einstein, you may have heard, was a clever man. He scribbled equations on blackboards, thought big thoughts, and all of that. But, <a href="http://quoteinvestigator.com/2011/05/13/einstein-simple/">allegedly</a>, he also said</p>
<blockquote><p>Everything should be made as simple as possible, but not simpler.</p></blockquote>
<p>These words have resonated with me recently, as I&#8217;ve heard pitches from one company after another, all of which are trying to cut through the complexity of data to make it accessible. Their goals appear laudable, but all too often I find myself wondering how simple this stuff can be? If we make it too simple, do we run the risk of unleashing a flood of half-baked &#8216;analysis,&#8217; undertaken by people who really shouldn&#8217;t be allowed near a calculator, let alone a Hadoop cluster? On the other hand there&#8217;s a persuasive argument to be made for democratising access to data and tools, freeing organisations from over-reliance upon their new High Priests of Data.</p>
<p>Every data question should not require a <a class="zem_slink" title="Data science" href="http://en.wikipedia.org/wiki/Data_science" rel="wikipedia" target="_blank">data scientist</a>, but maybe we really shouldn&#8217;t be making it too easy for people to tackle the hard questions <em>without</em> support from someone who knows what they&#8217;re doing.</p>
<p>How simple, then, is too simple? And can we use data in a different way; in order to offer enough simplicity, smartly?</p>
<p>One company which has tried to do that in an interesting way is <a href="http://datahero.com/">Datahero</a>, and I spoke with co-founder Chris Neumann this week to learn some more.</p>
<p>There&#8217;s been some good coverage of the company over the past few weeks, and if you&#8217;ve not come across them before then it&#8217;s worth taking a skim through the Related articles at the end of this piece. I won&#8217;t bother repeating those pieces here.</p>
<p>The thing that interested me — other than Chris&#8217; obvious passion and enthusiasm for his subject — was the way in which Datahero plans to use a mix of data analysis, user experience design and machine learning in order to <em>guide</em> the user toward analyses and visualisations that are likely to be of use to them. Chris is quick to stress that the company isn&#8217;t looking too closely at the data <em>values</em> its customers upload. Instead, the system studies the structure of the data (this column contains dates, this column contains place names, etc) and any associated metadata in order to make recommendations for logical ways to visualise the dataset.</p>
<p>As the number of users grows, the roadmap also includes Amazon-style recommendations. If a lot of other people uploading their quarterly sales forecast graph it in a certain way, then it makes sense to recommend that type of graph to a new user who uploads data with a similar structure. The recommendation won&#8217;t always be right, but it should go a long way toward minimising the fear of staring at columns and columns of data without a good idea about where to turn first.</p>
<p>Smart recommendations, clever algorithms, and an engaging UI will not — on their own — turn everyone into a data scientist. But nor, really, should they. What they can do, though, is what Neumann described as &#8220;enabling the 99%;&#8221; those people who have a dump of data from Mailchimp or Google Analytics or Salesforce or Excel, and who don&#8217;t really know how to begin sensibly visualising the multitude of columns and rows of numbers.</p>
<p>It&#8217;s an intriguing idea, and it will be interesting to see how successfully the system can deliver value to a potentially huge pool of individuals who find themselves data-rich, but skills-poor.</p>
<p>Even if technically successful, of course, the challenge will be persuading those same users to continue paying for the service. If they think they&#8217;ll use it often enough to pay for, won&#8217;t they end up acquiring enough of the skills to work with their own data anyway? Unless, of course, Datahero manages to grow and add complexity along with its users&#8230;</p>
<p><em><a href="http://en.wikipedia.org/wiki/File:Albert_Einstein_Head.jpg">Image</a> of Albert Einstein, ©1947. Image sourced from the Library of Congress&#8217; Prints and Photographs division with identifier cph.3b46036. Shared on Wikipedia, and deemed to be in the Public Domain.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://techcrunch.com/2013/05/01/datahero-turns-data-into-rich-visuals-without-the-need-for-a-data-analyst/" target="_blank">Datahero Turns Data Into Rich Visuals Without The Need For A Data Analyst</a> (techcrunch.com)</li>
<li class="zemanta-article-ul-li"><a href="http://venturebeat.com/2013/04/23/data-vis-for-the-99-percent-datahero-launches-its-free-service/" target="_blank">&#8216;Data vis for the 99 percent&#8217;: DataHero launches its free service</a> (venturebeat.com)</li>
<li class="zemanta-article-ul-li"><a href="http://thenextweb.com/apps/2013/04/23/datahero-launch/" target="_blank">Datahero launches to the public, making genius-level data science simple, automated and free</a> (thenextweb.com)</li>
<li class="zemanta-article-ul-li"><a href="http://gigaom.com/2013/04/23/visualization-startup-datahero-opens-its-doors-and-delivers-data-analysis-for-the-masses/" target="_blank">Visualization startup Datahero opens its doors and delivers data analysis for the masses [GigaOM]</a> (gigaom.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=fe698f7d-d371-485d-944f-01f163ed5d93" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2013/05/seeking-simplicitys-sweet-spot/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Find the data, aggregate the data, make the data useful</title>
		<link>http://cloudofdata.com/2013/05/find-the-data-aggregate-the-data-make-the-data-useful/</link>
		<comments>http://cloudofdata.com/2013/05/find-the-data-aggregate-the-data-make-the-data-useful/#comments</comments>
		<pubDate>Fri, 03 May 2013 19:12:55 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data markets]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[data aggregation]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[data management]]></category>
		<category><![CDATA[dataconf]]></category>
		<category><![CDATA[enigma]]></category>
		<category><![CDATA[enigma.io]]></category>
		<category><![CDATA[GigaOM]]></category>
		<category><![CDATA[New York City]]></category>
		<category><![CDATA[startup]]></category>
		<category><![CDATA[structureconf]]></category>
		<category><![CDATA[TechCrunch]]></category>
		<category><![CDATA[TechCrunch Disrupt]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=3209</guid>
		<description><![CDATA[I was in New York in March, taking part in GigaOM&#8217;s Structure:Data event. As usual on these trips, I spent the day before the event walking around the city, soaking ...]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/zoonabar/4340886441/" target="_blank"><img class="alignleft size-medium wp-image-3221" style="border: 0px; margin: 6px;" src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/05/4340886441_b340faeef1_b-200x300.jpg" alt="4340886441_b340faeef1_b" width="200" height="300" /></a>I was in New York in March, taking part in GigaOM&#8217;s <a href="http://event.gigaom.com/structuredata/">Structure:Data</a> event. As usual on these trips, I spent the day before the event walking around the city, soaking up some air, getting rained on, using coffee to stay awake, and meeting with a number of local companies. Of the companies I met that day, one stood out. And this week, that same company was recognised by others <a href="http://techcrunch.com/2013/05/01/and-the-winner-of-techcrunch-disrupt-ny-2013-is-enigma/">when it won TechCrunch Disrupt NY</a>. That company was <a href="http://enigma.io/">Enigma</a>.</p>
<p>Enigma pulls data from tens of thousands of public data sets, and then offers up an interface that makes it pretty straightforward to trawl through the whole lot in search of the data points that you actually need. As the company&#8217;s Marc DaCosta introduced it, a &#8220;search and discovery platform for public data.&#8221;</p>
<p>At present, everything in Enigma is publicly available data. It&#8217;s mostly from the USA right now, and is acquired by a combination of screen scraping/ crawling of .gov sites&#8230; and calling government agencies to request that they ship CDs of offline material. DaCosta stresses that the data is all — theoretically — available to any US citizen, but that it&#8217;s not really that easy for them to access. Even with the advent of central sites like data.gov, primary data remains spread across a multitude of portals and web pages, stored in a dizzying array of machine readable formats&#8230; and (all too often) as PDFs full of tabular data. DaCosta finishes his preamble by stressing the need for</p>
<blockquote><p>infrastructure to acquire, index and search public data.</p></blockquote>
<p>The team&#8217;s Disrupt pitch and the subsequent back and forth with the judges provides a pretty good overview. It is worth 12 minutes of your time to <a href="http://techcrunch.com/2013/05/01/and-the-winner-of-techcrunch-disrupt-ny-2013-is-enigma/">watch over on TechCrunch</a>.</p>
<p>During the Q&amp;A, Freestyle Capital&#8217;s Dave Samuel raised the same concern I did during my meeting with the Enigma team. The set of possibilities with public data is extremely large, but still finite. Far more so than Google, there are real limitations to the sorts of questions that it makes sense to ask. Presented with a search box, users struggle to understand what&#8217;s <em>possible</em> and what&#8217;s <em>sensible</em>. Simple tricks would go a long way toward helping here, such as highlighting today&#8217;s most popular queries, or providing sets of sample queries as Wolfram Alpha does. Users need to learn how to work with aggregations of data such as this, and the onus is on sites like Enigma to help grow their potential market through lightweight and accessible education.</p>
<p>In many ways, though, I think the biggest opportunity for Enigma doesn&#8217;t lie in their website at all. The real opportunity lies in their api, and in licensing it to third parties who will construct entirely new Enigma-powered vertical applications or integrate Enigma data into existing software such as an investor&#8217;s due diligence systems.</p>
<p>In and of itself, the data Enigma has gathered is interesting, but neither indispensable nor unique. The real value comes from offering easy, reliable, comprehensive and cost-effective means to get this data integrated into existing workflows, and that&#8217;s where the api needs to shine. There&#8217;s also an opportunity to encourage/enable companies to upload their own data into the platform, letting them combine it with the public data already there. This is, apparently, on the road map. I would imagine that this corporate data will initially only be available to the company that owns it, but there are a whole set of other opportunities around sharing data with supply chain partners, and even making it available to anyone.</p>
<p>Nice (and fast!) as the website is once you understand how it works, I tend to see it far more as a shop window than as a revenue-generating service in its own right. The Enigma team disagrees, seeing subscription-based access to the website as one of their two main products. We shall see who proves more right in due course!</p>
<p>Finally, for now, the company faces a real challenge in scaling. It needs to pull in more data from inside the US, and it needs to broaden its coverage <em>outside</em> the US. Its data acquisition processes, although highly automated, remain pretty labour intensive. And its data teams need to <em>understand</em> the data they&#8217;re working with. Some government data is rigorously documented, whereas other data sets are almost unintelligible. Growth, as the Enigma team recognises all too well, requires far more than simply pointing their crawlers at some new web domains.</p>
<p>March and April were busy with travel. May&#8217;s pretty quiet on the travel front, with plenty of opportunity to get some proper work done before heading to Brussels at the end of the month. The next trip States-side looks like being to San Francisco in June. I wonder who I&#8217;ll meet there&#8230; and what <em>they&#8217;ll</em> go on to win?</p>
<p><em><a href="http://www.flickr.com/photos/zoonabar/4340886441/">Picture</a> of the Enigma cryptographic machine by Flickr user <a href="http://www.flickr.com/people/zoonabar/">Chris Brown</a>.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://techcrunch.com/2013/05/01/and-the-winner-of-techcrunch-disrupt-ny-2013-is-enigma/" target="_blank">And The Winner Of TechCrunch Disrupt NY 2013 Is&#8230; Enigma!</a> (techcrunch.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.businessinsider.com/techcrunch-disrupt-winner-enigma-2013-5" target="_blank">This TechCrunch Disrupt Winner Could Be The Future Of Search</a> (businessinsider.com)</li>
<li class="zemanta-article-ul-li"><a href="http://venturebeat.com/2013/05/01/enigma-brings-the-deep-dark-world-of-public-data-to-light/" target="_blank">Enigma brings the deep, dark world of public data to light</a> (venturebeat.com)</li>
<li class="zemanta-article-ul-li"><a href="http://techcrunch.com/2013/04/30/enigma-makes-unearthing-and-sifting-through-public-data-a-breeze/" target="_blank">Enigma Makes Unearthing And Sifting Through Public Data A Breeze</a> (techcrunch.com)</li>
<li class="zemanta-article-ul-li"><a href="http://venturebeat.com/2013/05/01/beating-out-tough-competition-google-for-public-data-enigma-wins-disrupt-ny/" target="_blank">Beating out tough competition, &#8216;Google for public data&#8217; Enigma wins Disrupt NY</a> (venturebeat.com)</li>
<li class="zemanta-article-ul-li"><a href="http://nibletz.com/2013/05/02/enigma-reinvents-public-data-wins-techcrunch-disrupt-battlefield/" target="_blank">Enigma Reinvents Public Data And Wins TechCrunch Disrupt Battlefield</a> (nibletz.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=e287247a-001f-4ec4-ac85-537ef5a36ea4" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2013/05/find-the-data-aggregate-the-data-make-the-data-useful/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Xeround, and a tale of evolving business models</title>
		<link>http://cloudofdata.com/2013/05/xeround-and-a-tale-of-evolving-business-models/</link>
		<comments>http://cloudofdata.com/2013/05/xeround-and-a-tale-of-evolving-business-models/#comments</comments>
		<pubDate>Thu, 02 May 2013 13:49:17 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Enterprise Computing]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[SaaS]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[Business model]]></category>
		<category><![CDATA[cloud database]]></category>
		<category><![CDATA[Database]]></category>
		<category><![CDATA[Freemium]]></category>
		<category><![CDATA[greenqloud]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[Rackspace]]></category>
		<category><![CDATA[Software as a service]]></category>
		<category><![CDATA[Xeround]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=3183</guid>
		<description><![CDATA[Last night, cloud database company Xeround announced that they&#8217;re shutting down the version of their service hosted in public clouds such as Amazon, Rackspace, GreenQloud, and others. Users of the ...]]></description>
				<content:encoded><![CDATA[<p><a href="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/05/iStock_000008993887Small.jpg" target="_blank"><img class="alignleft size-medium wp-image-3187" style="border: 0px; margin: 6px;" src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/05/iStock_000008993887Small-200x300.jpg" alt="iStock_000008993887Small" width="200" height="300" /></a>Last night, cloud database company <a href="http://xeround.com/">Xeround</a> <a href="http://xeround.com/blog/2013/05/discontinuing-of-xeround-cloud-database-public-service">announced</a> that they&#8217;re shutting down the version of their service hosted in public clouds such as Amazon, Rackspace, GreenQloud, and others. Users of the free service have until 8 May to move elsewhere, whilst paying customers have until 15 May. The company describes this as an attempt to &#8220;re-focus,&#8221; with the implication that other parts of the business remain viable. It&#8217;s never easy to admit mistakes and kill products, but the ability to do so is an essential part of running a business that&#8217;s viable for the long haul. Xeround&#8217;s announcement needn&#8217;t be interpreted as the end of the company, or the end of databases running in the public cloud. The challenge now is one of persuading staff, investors and customers to move past the short-term pain and uncertainty, and to get behind the new direction with conviction.</p>
<p>Xeround was founded back in 2005, initially delivering scalable data management solutions <a href="http://gigaom.com/2009/04/16/xeround-enables-telco-services-by-virtualizing-data-silos/">into the data centres of telcos such as T-Mobile</a>. The company&#8217;s work on <a href="http://news.cnet.com/8301-13846_3-20016428-62.html">scaling MySQL</a> and <a href="http://www.theregister.co.uk/2011/06/14/xeround_mysql_database_service/">offering it in the cloud</a> brought it to wider attention, and they seemed to maintain momentum by adding <a href="http://gigaom.com/2011/03/16/for-xeround-mysql-in-the-cloud-knows-no-bounds/">additional</a> <a href="http://diversity.net.nz/xeround-rolls-out-database-as-a-service-further/2012/05/14/">cloud</a> <a href="http://www.cloudave.com/23136/greenqloud-and-xeround-launch-cloud-database-on-100-renewable-energy/">partners</a>. Last year, the company <a href="http://www.theregister.co.uk/2012/02/07/xeround_cloud_database_rejig/">rolled out a freemium offering</a> to entice a far wider set of potential customers. It would appear, though, that any influx of paying customers was insufficient to meet the cost of this rapid expansion across disparate cloud providers.</p>
<p>In last night&#8217;s blog post, Xeround&#8217;s Avigail Ofer <a href="http://xeround.com/blog/2013/05/discontinuing-of-xeround-cloud-database-public-service">writes</a>;</p>
<blockquote><p>Xeround’s leadership forum has recently decided to re-focus the company’s effort. This means we will no longer be able to support our service over public clouds, across all of our currently active data centers.</p></blockquote>
<p>This presumably means a return towards the company&#8217;s roots, offering scalable data management solutions inside enterprise data centres. As a strategic decision it may make sense, although that market is far more crowded than it was in 2005. For now, though, the company website continues to talk almost exclusively about their Database as a Service (DBaaS) offering, hosted in public clouds. Anyone looking to see what else Xeround does <em>today</em>, or where it will be active after 15 May, will get few clues from the site. Current and prospective customers must, inevitably, feel anxious. What else might be consigned to the pyre?</p>
<p>Strategic refocusing, cutting underperforming products, and even pivoting are facts of business life. Well-known consumer services like Flickr and Twitter were the result of pivots that must have seemed painful at the time. In those cases, and others, pundits and commentators bemoaned the death of whatever went before&#8230; but few would dispute the success of the new ventures. Customers worry about what they <em>have</em>. I still miss <a href="http://cloudofdata.com/2009/04/tungle-goes-a-long-way-toward-reducing-the-pain-of-scheduling-meetings/">Tungle</a>, and have no idea how I will cope when Google cruelly sacrifices <a href="http://gigaom.com/2013/03/13/google-kills-google-reader-will-go-offline-on-july-1-2013/">the invaluable Reader</a>. But boards and managers need to worry about where they&#8217;re <em>going</em>. Sometimes, maintaining the status quo simply does not make sense. You&#8217;re breaking even or — worse — burning cash. You can see a new competitor that&#8217;s faster, better, or cheaper than you. Your revenues or usage figures are headed the wrong way. All are signs that something needs to change. And if you&#8217;ve got a <em>better</em> idea about how to focus resources, why not go for it?</p>
<p>Successful companies aren&#8217;t necessarily the ones with the most brilliant people, or the strongest teams, or the best idea, or the biggest pile of cash. Successful companies are those with people who are <em>good enough</em>, who can work together <em>well enough</em>, on an idea that&#8217;s <em>intriguing enough</em>, attracting <em>sufficient cash</em> to get noticed. If all of those pieces fall into place, successful companies then need a good dose of luck, and a great big dollop of smart timing. The market needs to be ready, customers need to be in the mood to buy, and the competitive landscape needs to be tilted in your favour. Sometimes, those pieces don&#8217;t all align, and the company needs to shift focus a little and try again.</p>
<p>Trying again is fine. There&#8217;s no shame in it. But when you try again people remember things about how you behaved the last time. They won&#8217;t hold failure or pivoting or downsizing or refocusing against you. But if they are considering becoming an investor, partner or customer, they&#8217;ll remember how you treated investors, partners, and customers last time around.</p>
<p>And that&#8217;s why Xeround&#8217;s timing for the shutdown bothers me. A week (or two, if you&#8217;ve given them cash) is not a lot of time to move off a database that might be baked deep into your business. Customers must be waking up this morning, and scrambling to adapt. Their roadmap is out the window. Their vacation is on hold. Their work to attract the next client abandoned, as they scrabble to keep the database running for the clients they already have. Xeround did that. And it didn&#8217;t need to.</p>
<p>Pivot if you want, Xeround, but you&#8217;ll have a much better hope of maintaining current and future custom if you treat your users with respect. And a week or two&#8217;s notice for the removal of something that may very well be mission critical is not respectful.</p>
<p>I did try to reach Xeround&#8217;s PR firm for comment, without success. I shall update this post if I receive anything relevant from them or the company.</p>
<p><em>Hat tip to <a href="https://twitter.com/gigabarb">Barb Darrow</a> at GigaOM, who was <a href="http://gigaom.com/2013/05/01/xeround-pulls-the-plug-on-free-cloud-database-option/">quick to cover the story</a>&#8230; and to update her post as more facts emerged.</em></p>
<p><em><a href="http://www.istockphoto.com/stock-photo-8993887-up-helly-aa-burning-galley-ship.php">Image</a> of a burning Viking ship from istockphoto.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.rackspace.com/blog/xeround-provides-auto-scaling-high-availability-for-your-mysql-database-in-the-cloud/?replytocom=2485" target="_blank">Xeround Provides Auto-Scaling &amp; High-Availability for your MySQL Database in the Cloud &#8211; The Official Rackspace Blog</a> (rackspace.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.i-programmer.info/news/84-database/5690-biggest-sql-disasters.html" target="_blank">Biggest SQL Disasters</a> (i-programmer.info)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=65ca1414-41dc-4366-8d02-de74d8386084" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2013/05/xeround-and-a-tale-of-evolving-business-models/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Survey lifts covers on Cloud Promiscuity: good thing, bad thing, or who cares?</title>
		<link>http://cloudofdata.com/2013/04/survey-lifts-covers-on-cloud-promiscuity-good-thing-bad-thing-or-who-cares/</link>
		<comments>http://cloudofdata.com/2013/04/survey-lifts-covers-on-cloud-promiscuity-good-thing-bad-thing-or-who-cares/#comments</comments>
		<pubDate>Mon, 29 Apr 2013 11:14:24 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Enterprise Computing]]></category>
		<category><![CDATA[Data center]]></category>
		<category><![CDATA[data centre]]></category>
		<category><![CDATA[enstratius]]></category>
		<category><![CDATA[hybrid cloud]]></category>
		<category><![CDATA[multi-cloud]]></category>
		<category><![CDATA[private cloud]]></category>
		<category><![CDATA[public cloud]]></category>
		<category><![CDATA[Rightscale]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=3151</guid>
		<description><![CDATA[Figures from RightScale&#8216;s latest State of the Cloud Report (free registration required) suggest &#8220;a strong interest in multi-cloud strategies&#8221; amongst respondents. The rationale for hybrid cloud (mixing a public cloud service like Amazon&#8217;s ...]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/nanymata/272683954/" target="_blank"><img class="alignleft size-medium wp-image-3159" style="border: 0px; margin: 6px;" src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/04/272683954_3e2b654df4_z-300x225.jpg" alt="272683954_3e2b654df4_z" width="300" height="225" /></a>Figures from <a href="http://www.rightscale.com/">RightScale</a>&#8216;s latest <em><a href="http://www.rightscale.com/lp/state-of-the-cloud-report.php">State of the Cloud Report</a></em> (free registration required) suggest &#8220;a strong interest in multi-cloud strategies&#8221; amongst respondents. The rationale for <em>hybrid</em> cloud (mixing a public cloud service like Amazon&#8217;s with something running in your own data centre, colocation site or hosting facility) is reasonably well understood, but why might companies choose to use more than one public cloud and/or more than one private cloud?</p>
<p>Surveys. Along with the loathsome infographic, they have become the default tool of the lazy PR person. I get loads of them, pretty much every day. And, almost without fail, they are a shocking waste of everyone&#8217;s time. Three out of the four people we gave some money to said our product was best. I don&#8217;t care. And I&#8217;m definitely not going to write about it. Every now and then, though, one that&#8217;s worth a little attention comes along. RightScale&#8217;s latest <em>State of the Cloud</em>, released last week, is one of those.</p>
<p>The sample is relatively small (just 625) and, as <em>Network World</em>&#8216;s Brandon Butler <a href="http://www.networkworld.com/news/2013/042613-cloud-impact-269166.html">emphasises</a>, they</p>
<blockquote><p>came from a community to [sic] people that had somehow, some way interacted with RightSacle [sic] in the past — they had registered on the company&#8217;s website, dropped a business card in a bowl at a trade show, or are a customer (although RightScale says only 30% of respondents were customers). The firm&#8217;s VP of Marketing Kim Weins says respondents to RightScale&#8217;s survey admittedly have some &#8216;affinity&#8217; to cloud computing.</p></blockquote>
<p>RightScale&#8217;s analysis of respondent demographics bears this out, with over half (51%) describing themselves as being from &#8216;Tech Software, Services, Hardware&#8217; businesses. The broader community of cloud-consumers in manufacturing, financial services, media and the rest is far less well represented.</p>
<div id="attachment_3161" class="wp-caption aligncenter" style="width: 310px"><a href="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/04/Screenshot_29_04_2013_10_49.png"><img class="size-medium wp-image-3161 " title="Survey Demographics" src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/04/Screenshot_29_04_2013_10_49-300x103.png" alt="Survey Demographics" width="300" height="103" /></a><p class="wp-caption-text">Survey Demographics. Image © RightScale 2013</p></div>
<p>The survey digs into a number of areas (see the Related articles, below), but it was the discussion of &#8216;multi-cloud&#8217; that particularly caught my attention.</p>
<div id="attachment_3163" class="wp-caption aligncenter" style="width: 310px"><a href="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/04/Screenshot_29_04_2013_11_17.png"><img class="size-medium wp-image-3163  " style="border: 0px; margin: 6px;" src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/04/Screenshot_29_04_2013_11_17-300x167.png" alt="Enterprise Cloud Strategy pie chart" width="300" height="167" /></a><p class="wp-caption-text">Image © RightScale 2013</p></div>
<p>According to the survey, 77% of respondents from enterprises with 1,000+ employees (see <a href="#disclaimer">Disclaimer</a> for an important proviso) had some sort of multi-cloud policy. The majority of those (47%) are pursuing some form of on- and off-premise hybrid strategy, whilst 15% are employing multiple private clouds and 15% are employing multiple public clouds. Amongst enterprises with 1,000+ employees that are <em>already</em> using a public cloud solution, the numbers are even higher; 88% are pursuing a multi-cloud policy, with 53% opting for a hybrid approach, 22% for multiple public clouds, and 13% for multiple private clouds. In both cases, it is unclear whether respondents could describe themselves as using multiple public <em>and</em> multiple private clouds.</p>
<p>A hybrid approach, in which some computing is done on-premise (or in a long-term and highly managed co-location or hosting arrangement) and some is done in the public cloud makes an awful lot of sense. Some data may be deemed too expensive or too sensitive to routinely move in and out of the cloud. Some workloads are steady, routine, and highly optimised to run most cost-effectively in a supported, SLA-wrapped and tightly managed dedicated facility. Other workloads demonstrate dramatic peaks and troughs in usage, or require irregular access to resources which are too expensive to maintain — frequently idle — in-house. Even in the most regulated, paranoid or old-fashioned organisations, there are plenty of workloads for which a hybrid solution is ideal.</p>
<p>More interesting than the simple use of a hybrid solution is the suggestion that companies are using more than one public cloud, more than one private cloud, and perhaps more than one of both. There is, perhaps, a clear opportunity here for cloud management solutions (such as RightScale&#8217;s, of course), but to really understand the nature of the opportunity we need to understand far more about <em>why</em> the trend we&#8217;re seeing is occurring. There&#8217;s insufficient data in RightScale&#8217;s published results to be sure, but possible explanations include, one, some, or all of the following.</p>
<h3>Eggs and Baskets</h3>
<p>It&#8217;s a conscious decision. Public clouds suffer outages. Private cloud providers get acquired, go bust, or change direction. To continue delivering core services to key customers, the modern enterprise is forced to spread its services across different solutions providers. The gradually increasing credibility of enterprise cloud solutions from HP, IBM, Microsoft and others may diminish this perceived requirement.</p>
<h3>The Search for Features</h3>
<p>It may be deliberate, but is more likely to be an accidental byproduct of something else. Different public and private cloud providers excel in different areas. They provide support for different virtual machine instances, their apis are better at some actions than others, they offer services in different jurisdictions, and with varying levels of support. The modern enterprise, in meeting a wide range of business requirements, will inevitably find itself selecting different clouds for different reasons. This is likely to continue for some time to come, despite the growing feature completeness of competing offerings.</p>
<h3>The Departmental Power Grab</h3>
<p>It may be deliberate, but is more likely to be an accidental byproduct of something else. The organisation has no centralised policy regarding the cloud. Different departments and divisions are pursuing their own formal strategies. Their differing requirements lead, almost inevitably, to the selection of different solutions. As the CIO attempts to reassert control, there may be scope for solutions providing a sensible view across these competing services. Alternatively, there may be an explicit move to consolidate, creating opportunities for those able to assist in migrating services from one provider to another.</p>
<h3>Shadow IT</h3>
<p>It may be deliberate, but is more likely to be an accidental byproduct of something else. The different solutions are being selected by individuals and small teams. They may be selecting those they&#8217;ve previously used elsewhere, or they may be employing a wide range of criteria from the frivolous to the robust. Differing requirements and perspectives lead to a wide range of overlapping services being selected. If the CIO attempts to reassert control, there may be scope for solutions providing a sensible view across these competing services. Alternatively, there may be an explicit move to consolidate, creating opportunities for those able to assist in migrating services from one provider to another.</p>
<h3> Play Time</h3>
<p>It may be deliberate, but is more likely to be an accidental byproduct of something else. Developers are casting the net wide, experimenting and exploring. The organisation may be littered with accounts for different cloud services that someone tried, explored for a while, and then moved away from. There may well be a large number of intranet-type applications that were built during the exploration phase, deployed to a cohort of internal users, and then simply left running.</p>
<h3>Oops</h3>
<p>It&#8217;s unlikely to have been a conscious business decision. No one knew someone else had signed up for that other cloud. The person who did it left last year, and we&#8217;re really not sure if there are some nasty dependencies in our upcoming product&#8230;</p>
<p>What do you think? Why are we seeing people using more than one public cloud, or more than one private cloud? Is it a trend that&#8217;s likely to continue? Is it an opportunity, or a cause for concern?</p>
<p>I suspect that the vast majority of &#8216;multi-public&#8217; and &#8216;multi-private&#8217; responses are not the result of a conscious and explicit decision to use more than one cloud. Instead, they&#8217;re simply a byproduct of the industry&#8217;s current state of evolution. One cloud, today, is better for some things, and its competitor is better for others. Mainstream enterprise adopters need both those capabilities, and therefore end up having to use two clouds.</p>
<p>Potential adopters of multi-cloud, though, would benefit from a more considered exploration of the issues. There are good reasons to learn and use more than one cloud, but there is significant scope for increased complexity in following such a path. Cloud management providers like Enstratius and RightScale may diminish some of that complexity, but it doesn&#8217;t go away altogether.</p>
<p>Multi-cloud should be a conscious decision.</p>
<p><em><a name="disclaimer"></a><strong>Disclaimer</strong>: It is important to note that the RightScale survey does not identify the number of responses upon which analysis is being performed. The total number of survey respondents was 625, but subsequent discussion in their report appears to be based upon subsets of the whole. Figures on enterprise cloud usage, for example, are drawn from &#8220;organisations with a hybrid cloud strategy&#8221; (p.6). There is no information on the proportion of respondents classed as &#8216;enterprise,&#8217; nor the proportion of those that may have a cloud strategy. The information that 29% currently run apps in a hybrid cloud may point to 29% of 625, or 29% of 3. Similarly, discussion (p.7) of multi-cloud implementation is based upon enterprise respondents with more than 1,000 employees! Unhelpfully, the descriptive text uses &#8220;all respondents,&#8221; &#8220;all enterprise respondents,&#8221; &#8220;respondents&#8221; and &#8220;enterprises&#8221; interchangeably. It would be useful if RightScale could provide explicit sample sizes next to each of their graphs.</em></p>
<p><em><a href="http://www.flickr.com/photos/nanymata/272683954/">Image</a> of feet in a bed by Flickr user <a href="http://www.flickr.com/people/nanymata/">Nany Mata</a>.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.zdnet.com/rightscale-survey-points-toward-preference-for-multi-cloud-deployments-7000014355/" target="_blank">RightScale survey points toward preference for &#8216;multi-cloud&#8217; deployments</a> (zdnet.com)</li>
<li class="zemanta-article-ul-li"><a href="http://gigaom.com/2013/04/26/rightscale-sees-uptick-in-cloud-adoption-and-multi-cloud-use/" target="_blank">RightScale sees uptick in cloud adoption and multi-cloud use [GigaOM]</a> (gigaom.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.networkworld.com/news/2013/042613-cloud-impact-269166.html" target="_blank">Cloud adoption growing, but how big depends on who you ask</a> (networkworld.com)</li>
<li class="zemanta-article-ul-li"><a href="http://blog.rightscale.com/2013/04/25/rightscale-state-of-the-cloud-2013-a-new-industry-survey/" target="_blank">RightScale State of the Cloud 2013: A New Industry Survey</a> (rightscale.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.forbes.com/sites/joemckendrick/2013/04/25/with-experience-comes-wisdom-in-cloud-computing-survey-shows/" target="_blank">With Experience Comes Wisdom in Cloud Computing, Survey Shows</a> (forbes.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.datacenterknowledge.com/archives/2013/04/25/survey-says-enterprise-cloud-maturing-devops-rising-multi-cloud-usage-growing/" target="_blank">Survey Says: Multi-Cloud Usage Growing, DevOps on the Rise</a> (datacenterknowledge.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=63b7a92f-dd0a-42b3-a97d-08a134549d14" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2013/04/survey-lifts-covers-on-cloud-promiscuity-good-thing-bad-thing-or-who-cares/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Visualisation &#8211; the key that unlocks data&#8217;s value?</title>
		<link>http://cloudofdata.com/2013/04/visualisation-the-key-that-unlocks-datas-value/</link>
		<comments>http://cloudofdata.com/2013/04/visualisation-the-key-that-unlocks-datas-value/#comments</comments>
		<pubDate>Fri, 26 Apr 2013 11:44:09 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[Data mining]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[data scientist]]></category>
		<category><![CDATA[data visualisation]]></category>
		<category><![CDATA[data visualization]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[infer]]></category>
		<category><![CDATA[rosslyn Analytics]]></category>
		<category><![CDATA[Vik Singh]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=3107</guid>
		<description><![CDATA[
As the Big Data hype machine continues its relentless attempt to gobble everything in its path, new business units and entire new domains buying into the promise find themselves faced ...]]></description>
				<content:encoded><![CDATA[<p><a href="http://shop.mapumental.com/static/i/gallery/EdinburghICC75mArr9am.png" target="_blank"><img class="alignleft size-medium wp-image-3111" style="border: 0px; margin: 6px;" src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/04/EdinburghICC75mArr9am-300x300.png" alt="Edinburgh journey times, from Mapumental" width="300" height="300" /></a></p>
<p>As the Big Data hype machine continues its relentless attempt to gobble everything in its path, new business units and entire new domains buying into the promise find themselves faced with unanticipated data volume and complexity. They see the potential for data-based decision making, but still face (short-term?) challenges in actually managing, analysing or interpreting the data they now collect.</p>
<p>Early iterations of core tools such as Hadoop were raw and unpolished, driving the emergence of a niche group of developers and data analysts with the specialist skills to cope. Those tools become easier to use with each new release, which goes some way toward countering panicked claims of <a href="http://online.wsj.com/article/SB10001424052702304723304577365700368073674.html">a massively yawning skills gap</a>.</p>
<p>There is, of course, still a need for people with specialist knowledge, hard-won skills, and painfully gained experience. But you no longer (if you ever really did) need to install a Hadoop cluster with your bare hands and juggle complex statistical formulae in your head to benefit from the growing prevalence of data in all aspects of business.</p>
<p>At the &#8216;softer&#8217; end of the market, specifically, there has been an explosion of new startups rushing to offer tools that make it easier to create visualisations and dashboards to deliver some value from the data whilst hiding its complexity. Some established players in the visualisation and business analytics market, such as <a href="http://gigaom.com/2013/04/03/a-tableau-ipo-could-validate-the-big-data-visualization-push-or-not/">IPO-prepping</a> <a href="http://www.tableausoftware.com/">Tableau</a> and <a href="http://www.ft.com/cms/s/0/f26f660e-97d0-11e2-97e0-00144feabdc0.html#axzz2RZ0Uud6C">IPO-considering</a> <a href="http://www.rosslynanalytics.com/">Rosslyn Analytics</a>, are hoping to press home their lead. Elsewhere, visualisation and business intelligence pros eye the influx of new users with horror, reprising the wailing of typesetters and designers when desktop publishing tools like Ventura and PageMaker first appeared; these amateurs, they protest, cannot do it <em>properly</em>. They do not <em>understand</em>. They — shock, horror — might <em>mislead</em> people.</p>
<p>We&#8217;ve been here before. A <a href="http://gigaom.com/2012/07/11/idc-analytics-a-51b-business-by-2016-thanks-to-big-data/">$50bn market opportunity</a> will tend to see smart young things — and charlatans — emerge from the woodwork in droves. The market will settle. Some of the incumbents will survive, some of the new entrants will succeed, and most of the customers will (eventually) work out what they really need. Visualisation&#8217;s equivalent of using a different font for every paragraph (just because you can) and the &lt;blink&gt; tag will fade, just as it did in desktop publishing and web &#8216;design&#8217;.</p>
<p>The new data visualisation tools from companies like <a href="http://chartio.com/">chart.io</a> demonstrate an interesting trend toward atomisation of business tasks. We&#8217;ve seen this in the consumer space for a long time, particularly since the advent of channels like Apple&#8217;s App Store. But in business, the trend has been towards consolidation, with fewer and fewer software tools trying to do more and more. These bloated behemoths become ever-harder to use, ever-harder to support, and developers find themselves increasingly constrained by dependencies and legacy as they try to innovate or encourage the next upgrade cycle.</p>
<p>Now, though, we&#8217;re seeing renewed enthusiasm for the small, focused, application. It needn&#8217;t do much, but it must do what it does compellingly and well. Increased provision of APIs helps, of course, with users more easily able to extract data from a source, work with it, and then pass the result on to the next application in the chain. In principle, a recipe for freedom of choice, best of breed, and healthy innovation. Whether it continues to work in practice remains to be seen.</p>
<p>Many of these topics surfaced in a conversation earlier this week, as I spoke with <a class="zem_slink" title="Vik Singh" href="http://zooie.wordpress.com/" rel="homepage" target="_blank">Vik Singh</a>, CEO and co-founder of <a href="https://www.infer.com/">Infer</a>. <a href="http://gigaom.com/2013/04/23/infer-takes-10m-to-find-the-sales-leads-most-likely-to-pay-off/">The company has been in the news this week</a> because it secured $10m in new funding. But Singh talks passionately of the need to &#8220;do one thing well.&#8221; Infer, he suggests, does that. They&#8217;ve identified a problem — lead scoring — in which the focused application of some effort can quickly lead to a &#8220;measurable lift on the top line&#8221; of their customers. Infer can be deployed (as SaaS) in a matter of days, integrated with <em>existing</em> tools such as Salesforce and Marketo, and quickly start to deliver demonstrable returns.</p>
<blockquote><p>We focus on a particular problem&#8230; Our team understands what matters. We don&#8217;t focus on counting features. <em>We focus on finding the least amount of features to deliver the most value</em>.</p></blockquote>
<p>It&#8217;s a refreshing attitude, but does it create a product — or a company — that can grow? Singh certainly seems convinced that it does, and talks of ways that the basic notion of lead scoring can be extended as customers move away from intuition and best-guesses towards a data-based decision making process.</p>
<p>Tools are increasingly emerging to help business decision makers be better informed by the data that matters to their organisation. Some of those tools are best used by people with the technical and business skills to understand data and its implications. Some are, perhaps, <em>too</em> easy to use, and will encourage half-baked analyses based upon inappropriate data. In purely Darwinian terms, those tools — and the companies that invested too heavily in them — will most likely fall by the wayside. We&#8217;re at an early stage in a journey that needs to see all of us become more data literate. Good tools, smart companies, and some great lighthouse examples can all help that. Just as use of desktop publishing and web design tools settled down, we&#8217;ll see the same with data manipulation tools. I&#8217;m not particularly worried about this industry&#8217;s ability to deliver good infrastructure tools, good data ingest, storage, and manipulation tools, or good data visualisation tools.</p>
<p>I&#8217;m more concerned to ensure that we nurture the softer skills. Almost anyone can make a chart (they teach it to 7 year olds!). Many people can even make the <em>right</em> chart. Far fewer people can craft the narrative that makes the data in that chart sing, that uses it as a tool to persuade, to shape, and to take decisions. We&#8217;re cracking the technical side of the problem. Now we need to find — and to celebrate — the story tellers.</p>
<p>And just to finish off, it was interesting to see <a href="http://en.wikipedia.org/wiki/Stephen_Wolfram">Stephen Wolfram</a>&#8216;s blog post this week, <a href="http://blog.stephenwolfram.com/2013/04/data-science-of-the-facebook-world/">Data Science of the Facebook World</a>. Lots of data, lots of graphs, and the beginnings of a story to make the data sing&#8230;</p>
<p><em><a href="http://shop.mapumental.com/static/i/gallery/EdinburghICC75mArr9am.png">Image</a>, produced by <a href="http://mapumental.com/">Mapumental</a>, shows average public transport journey times into central Edinburgh.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://r.zemanta.com/?u=http%3A//www.guardian.co.uk/news/datablog/2013/apr/25/forget-big-data-small-data-revolution&amp;a=163065586&amp;rid=92cb9c35-4704-4ed4-bb6b-76b11c1350e1&amp;e=9eac00d8aaef991fa79bfa0e7ce3aeeb" target="_blank">Forget big data, small data is the real revolution</a> (guardian.co.uk)</li>
<li class="zemanta-article-ul-li"><a href="http://www.ritholtz.com/blog/2012/07/10-reasons-why-we-visualise-data/" target="_blank">10 Reasons Why We Visualise Data</a> (ritholtz.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.visualisingdata.com/index.php/2013/04/discussion-storytelling-and-success-stories/" target="_blank">Discussion: Storytelling and success stories</a> (visualisingdata.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=92cb9c35-4704-4ed4-bb6b-76b11c1350e1" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2013/04/visualisation-the-key-that-unlocks-datas-value/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>OpenStack Summit &#8211; thoughts from Portland</title>
		<link>http://cloudofdata.com/2013/04/openstack-summit-thoughts-from-portland/</link>
		<comments>http://cloudofdata.com/2013/04/openstack-summit-thoughts-from-portland/#comments</comments>
		<pubDate>Thu, 18 Apr 2013 22:28:09 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[OpenStack]]></category>
		<category><![CDATA[Portland Oregon]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=3025</guid>
		<description><![CDATA[OpenStack has come a long way since the project was first unveiled at OSCon back in 2010. This week, almost 3,000 people gathered in Portland, Oregon, to continue the job ...]]></description>
				<content:encoded><![CDATA[<p><a href="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/04/portland-e1366251485477.jpg" target="_blank"><img class="alignleft size-medium wp-image-3059" style="border: 0px; margin: 6px;" src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/04/portland-e1366251485477-300x225.jpg" alt="portland" width="300" height="225" /></a><a href="http://www.openstack.org/">OpenStack</a> has come a long way since the project was first unveiled at OSCon back in 2010. This week, almost 3,000 people gathered in Portland, Oregon, to continue the job of defining, debating, developing, and delivering the code upon which the OpenStack community depends. Alongside the developers, though, there were some early signs of tangible adoption.</p>
<p>During Monday&#8217;s Analyst Day, and in sessions throughout the conference, we began to see evidence of fledgling steps being taken beyond the early adopters on the bleeding edge. As Hewlett Packard&#8217;s Florian Otel remarked in <a href="http://openstacksummitapril2013.sched.org/event/7053934d83b403845bff1de25a8c72be#.UW9dZGRARk8">his session</a>, &#8220;it&#8217;s called the <em>bleeding</em> edge for a reason.&#8221; Some people want to be there. Some people <em>need</em> to be there. The majority of adopters, however, simply want robust, credible, dependable, and wide-ranging solutions that work. After a period in which OpenStack&#8217;s hype, perhaps, outpaced its reality, might we finally be seeing some credible proof points beginning to emerge?</p>
<p>The fledgling <a href="http://www.openstack.org/foundation/">OpenStack Foundation</a> clearly recognises its own need to move beyond an introspective community of tech enthusiasts playing with an interesting new toy. The video (embedded below) which loudly greeted the audience for Tuesday&#8217;s opening keynote was not subtle. If OpenStack is to succeed, the e.n.t.e.r.p.r.i.s.e is where it must go.</p>
<p>Announcements and briefings during the event definitely lacked the wow factor, but that is not unexpected at this stage in the project&#8217;s evolution&#8230; and may even have been intentional. The message was one of steady progress (a 26% increase in patches to the code since Q4 of last year), broadening community (a 27% growth in participating companies), and growing interest (a doubling in website traffic).  Beyond some lighthouse deployments at Best Buy, Bloomberg, Comcast, the National Security Agency and the like, it was harder to assess the extent to which OpenStack code is actually being <em>used</em>. Josh McKenty of <a href="http://www.pistoncloud.com/">Piston</a> suggested that there were around 400 production deployments of OpenStack, plus 20 times that number of pilots. Amazon, again and again, was the competitor to beat, which may be a mistake. As <a href="http://www.cloudability.com/">Cloudability</a>&#8216;s Mat Ellis reminded me, this really isn&#8217;t a zero sum game. OpenStack doesn&#8217;t have to take users away from Amazon to win. There is a massive opportunity in greenfield deployments, and in helping customers to migrate from more traditional on-premise IT facilities. OpenStack may be better served by looking ahead, rather than always benchmarking itself against the Seattle behemoth. Yes, they compete to a degree. But there&#8217;s still plenty of room for both.</p>
<p>More worryingly, there was clear hubris with respect to OpenStack&#8217;s more obvious competitors; <a href="http://www.vmware.com">VMware</a>, <a href="http://www.eucalyptus.com/">Eucalyptus</a> and Apache&#8217;s <a href="http://cloudstack.apache.org/">CloudStack</a>. VMware, I was told far too often, &#8220;cannot deliver a viable solution.&#8221; Eucalyptus was &#8220;dead in the water,&#8221; and CloudStack &#8220;irrelevant.&#8221; More positive — or realistic! — assessments appeared disconcertingly thin on the ground.</p>
<p>Amongst the case studies, perhaps the strongest was the one we&#8217;re not allowed to talk about. I hope that the OpenStack Foundation realises what a powerful story it is, and that they&#8217;re doing all they can to get the company concerned to permit its story to be told.</p>
<p>So, if the sessions and attendees in Portland are any measure, OpenStack is doing fine. It&#8217;s at a tricky stage in its evolution, though, as it moves from rapid development and developer evangelism toward dependable delivery and enterprise adoption. The first evidence of that shift is there to be seen. By the next OpenStack Summit (in Hong Kong), I&#8217;d hope — and expect — to see quite a bit more.</p>
<p><span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='640' height='390' src='http://www.youtube.com/embed/KHqzTBPQYl8?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span></p>
<p><em>Disclosure: the OpenStack Foundation paid my travel and expenses to attend this event in Portland, Oregon.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://techcrunch.com/2013/04/17/at-nsa-the-cloud-is-about-big-data-and-moving-beyond-it/" target="_blank">At NSA, The Cloud Is About Big Data And Moving Beyond IT</a> (techcrunch.com)</li>
<li class="zemanta-article-ul-li"><a href="http://servicesangle.com/blog/2013/04/16/openstack-summit-highlights-hp-support-third-party-solutions-gain-traction/" target="_blank">OpenStack Summit Highlights: HP Support, Third Party Solutions Gain Traction</a> (servicesangle.com)</li>
<li class="zemanta-article-ul-li"><a href="http://readwrite.com/2013/04/15/aws-vs-vmware-vs-openstack-and-the-cloud-winner-is" target="_blank">AWS vs. VMware vs. OpenStack: And The Cloud Winner Is&#8230;</a> (readwrite.com)</li>
<li class="zemanta-article-ul-li"><a href="http://gigaom.com/2013/04/16/top-5-lessons-learned-at-openstack-summit/" target="_blank">Top 5 lessons learned at OpenStack Summit [GigaOM]</a> (gigaom.com)</li>
<li class="zemanta-article-ul-li"><a href="http://techcrunch.com/2013/04/14/rackspace-looks-to-the-telcos-in-global-expansion-of-its-openstack-cloud/" target="_blank">Rackspace Looks To The Telcos In Global Expansion Of Its OpenStack Cloud</a> (techcrunch.com)</li>
<li class="zemanta-article-ul-li"><a href="http://go.theregister.com/feed/www.theregister.co.uk/2013/04/18/canonical_ubuntu_openstack/" target="_blank">Shuttleworth: Canonical is fast and easy with OpenStack clouds</a> (go.theregister.com)</li>
<li class="zemanta-article-ul-li"><a href="http://r.zemanta.com/?u=http%3A//www.infoworld.com/d/cloud-computing/openstack-gains-momentum-vendors-give-grizzly-bear-hug-216458&amp;a=160210517&amp;rid=04d95d7c-10b9-4ada-82c5-469cd5e3f29e&amp;e=4d550e7c4375d3df4baed4babda04729" target="_blank">OpenStack gains momentum as vendors give Grizzly a bear hug</a> (infoworld.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=04d95d7c-10b9-4ada-82c5-469cd5e3f29e" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2013/04/openstack-summit-thoughts-from-portland/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Not quite ready to live in the cloud</title>
		<link>http://cloudofdata.com/2013/04/not-quite-ready-to-live-in-the-cloud/</link>
		<comments>http://cloudofdata.com/2013/04/not-quite-ready-to-live-in-the-cloud/#comments</comments>
		<pubDate>Thu, 18 Apr 2013 00:27:49 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Chromebook]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[offline access]]></category>
		<category><![CDATA[web browser]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=2895</guid>
		<description><![CDATA[Google&#8217;s impressive Chromebook Pixel is just the latest in a series of devices which are trying to entice users to compute in a different way. With (almost) ubiquitous connectivity, and ...]]></description>
				<content:encoded><![CDATA[<p><a href="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/04/chromebook-pixel.jpg"><img class="alignleft size-medium wp-image-3033" src="http://cloudofdata.zippykid.netdna-cdn.com/wp-content/uploads/2013/04/chromebook-pixel-300x205.jpg" alt="chromebook-pixel" width="300" height="205" /></a>Google&#8217;s impressive <a href="http://www.google.com/intl/en/chrome/devices/chromebooks.html#pixel">Chromebook Pixel</a> is just the latest in a series of devices which are trying to entice users to compute in a different way. With (almost) ubiquitous connectivity, and an increasing reliance upon web-based services for mail, calendars, document creation and more, might we be reaching a point at which the browser really can be our means of accessing <em>everything</em>? Philosophically, the idea resonates. And yet, although I am not a power user who needs to regularly process video or edit high resolution images (the usual excuses for not embracing the Chromebook vision), I still remain uncomfortable with giving up my non-browser tools. Despite living and working in the cloud, I find that locally installed client software continues to deliver real value. Maybe, the next time I upgrade a computer, I need to try installing nothing more than a browser for a week or two, and see if it&#8217;s as painful as I feel it could be&#8230;</p>
<p>The cloud powers my business. The cloud is what I talk to clients about, it&#8217;s what I write about, it&#8217;s what people pay me to know about. The cloud (and, more generally, the web) make it possible for me to work with clients around the world, often without leaving a small market town in the East Riding of Yorkshire. Every piece of software I buy, install and use (except <a class="zem_slink" title="Microsoft Office" href="http://office.microsoft.com/en-us/default.aspx" rel="homepage" target="_blank">Microsoft Office</a>, which I do still have to endure from time to time) is little more than a window onto the cloud. Most of those software tools have a web interface that I could use. Even tools which don&#8217;t (like <a class="zem_slink" title="OmniFocus" href="http://www.omnigroup.com/applications/omnifocus/" rel="homepage" target="_blank">OmniFocus</a>) have competitors that do. So in principle I could do pretty well all of my mainstream tasks in a web browser. But I don&#8217;t. And I&#8217;m not yet sure that I want to.</p>
<p>On one level, mainstays of my working day like <a href="http://www.evernote.com/">Evernote</a>, <a href="http://reederapp.com/">Reeder</a> and <a href="http://www.dropbox.com/">Dropbox</a> exist to ensure that the content I want is available on whichever device I&#8217;m using, whenever I need it. Desktop, laptop, tablet and mobile phone all access the same shared pool of content, banishing the old world in which either everything travelled on a USB stick or (more likely) the document you wanted was on your <em>other</em> device. Both Evernote and Dropbox (and other tools I use) have decent web interfaces. I could just use those, and (provided there&#8217;s a network connection) realise the same basic benefit of access to documents and data. But the desktop clients (and mobile apps) bring a degree of polish to the user experience that &#8211; to me &#8211; still seems to add real value. If nothing else, local caching makes them <em>quicker</em>. Indeed, Reeder is really <em>just</em> a local window onto a web-based service (the soon-to-die Google Reader). That dependency creates an interesting challenge for Reeder&#8217;s creator, following Google&#8217;s decision to shut down their service.</p>
<p>For email, calendaring and basic document creation, however, I&#8217;ve fully embraced the browser-based experience. Google Apps, running inside Chrome, with a few core extensions such as <a href="http://rapportive.com/">Rapportive</a>. Here, locally installed software no longer adds any value for me. Blogging, too, mostly happens inside a WordPress editor these days. Offline editors like <a class="zem_slink" title="MarsEdit" href="http://www.red-sweater.com/marsedit/" rel="homepage" target="_blank">MarsEdit</a>, although good, now languish unused on my hard drive. The boring financial side of my business is also entirely on the web, with everything handled on my bank&#8217;s web site or within <a class="zem_slink" title="FreeAgent Central" href="http://www.freeagentcentral.com" rel="homepage" target="_blank">FreeAgent</a>&#8216;s web-based tool.</p>
<p>And then there&#8217;s <a class="zem_slink" title="Skype" href="http://skype.com" rel="homepage" target="_blank">Skype</a>. With no sign of Google Voice in Europe, Skype remains the mainstay for voice and video communication. And it only works if you&#8217;ve downloaded and installed the software. It would be difficult to rely upon a device which wouldn&#8217;t let me use Skype.</p>
<p>It&#8217;s certainly <em>possible</em> to do almost everything without locally installing any software, and that&#8217;s a remarkable step forward. <em>But it&#8217;s really not clear that the web-only experience is good enough (yet) for people to embrace it by choice. </em>Thin clients (like the Chromebook) may be cheaper than their Windows and OS X-powered equivalents, especially if sales volumes grow. They may be easier to manage, which must appeal to enterprise IT managers. But would an individual choose to buy one, except to save themselves a bit of money? Not yet, I suspect.</p>
<p>I&#8217;ve moved entirely from local email, calendar and blog clients to the web. I&#8217;ve moved mostly from local word processing to the web. How long will it be until the web interface for other services becomes our first choice, rather than a useful backup in those situations where you&#8217;re borrowing someone else&#8217;s computer? It will be interesting to see&#8230; The Chromebook (both the eye-wateringly expensive Pixel and cheaper variants) offers an interesting illustration of future potential. With the current state of web tools, though, today&#8217;s Chromebooks cannot be more than a niche play. It will not be many years before that changes, and light, fast, cheap, well-connected devices with great batteries become a valid choice for the majority of users in need of a device that isn&#8217;t a tablet or a smartphone.</p>
<p><em>Image © Google.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://venturebeat.com/2013/02/21/googles-chromebook-pixel-1299-for-a-freaking-touchscreen-chromebook/" target="_blank">Google&#8217;s Chromebook Pixel: $1,299 for a freaking touchscreen Chromebook</a> (venturebeat.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.bbc.co.uk/news/technology-21534382" target="_blank">Google unveils touchscreen laptop</a> (bbc.co.uk)</li>
<li class="zemanta-article-ul-li"><a href="http://www.zdnet.com/first-real-world-usage-figures-suggest-chromebooks-are-struggling-7000014102/" target="_blank">First real-world usage figures suggest Chromebooks are struggling</a> (zdnet.com)</li>
<li class="zemanta-article-ul-li"><a href="http://r.zemanta.com/?u=http%3A//www.cnn.com/2013/04/10/tech/gaming-gadgets/chromebook-pixel-review/index.html&amp;a=159031685&amp;rid=18788fde-d7bc-4314-b119-d79ec85c325e&amp;e=a7457f0fb1a1757c953375b7fc05ab4f" target="_blank">My month with the Chromebook Pixel: A review</a> (cnn.com)</li>
<li class="zemanta-article-ul-li"><a href="http://gigaom.com/2013/03/28/naysayers-be-damned-why-i-bought-a-chromebook-pixel/" target="_blank">Naysayers be damned: Why I bought a Chromebook Pixel</a> (gigaom.com)</li>
</ul>
<p>&nbsp;</p>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=18788fde-d7bc-4314-b119-d79ec85c325e" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2013/04/not-quite-ready-to-live-in-the-cloud/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>