<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
		xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	xmlns:media="http://search.yahoo.com/mrss/"
>

<channel>
	<title>Paul Miller - The Cloud of Data &#187; Semantic Web</title>
	<atom:link href="http://cloudofdata.com/category/semantic-web/feed/" rel="self" type="application/rss+xml" />
	<link>http://cloudofdata.com</link>
	<description>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</description>
	<lastBuildDate>Fri, 10 Feb 2012 10:46:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<copyright>Licensed under the Creative Commons Attribution License, version 3.0 http://creativecommons.org/licenses/by/3.0/</copyright>
	<managingEditor>paul.miller@cloudofdata.com (Paul Miller)</managingEditor>
	<webMaster>paul.miller@cloudofdata.com (Paul Miller)</webMaster>
	<ttl>1440</ttl>
	<image>
		<url>http://cloudofdata.com/logo144x144.jpg</url>
		<title>Paul Miller - The Cloud of Data</title>
		<link>http://cloudofdata.com</link>
		<width>144</width>
		<height>144</height>
	</image>
	<itunes:subtitle>conversations with the executives shaping Cloud Computing and the Semantic Web.</itunes:subtitle>
	<itunes:summary>Linked Data, Cloud Computing, Semantic Web, SaaS, PaaS, more</itunes:summary>
	<itunes:keywords>Cloud Computing, Semantic Web, Linked Data, Open Data, SaaS, PaaS</itunes:keywords>
	<itunes:category text="Technology" />
	<itunes:category text="Business" />
	<itunes:author>Paul Miller</itunes:author>
	<itunes:owner>
		<itunes:name>Paul Miller</itunes:name>
		<itunes:email>paul.miller@cloudofdata.com</itunes:email>
	</itunes:owner>
	<itunes:block>no</itunes:block>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://cloudofdata.com/logo300x300.jpg" />
		<item>
		<title>A conversation with Richard Wallis, an experiment, and a survey</title>
		<link>http://cloudofdata.com/2012/02/a-conversation-with-richard-wallis-an-experiment-and-a-survey/</link>
		<comments>http://cloudofdata.com/2012/02/a-conversation-with-richard-wallis-an-experiment-and-a-survey/#comments</comments>
		<pubDate>Fri, 03 Feb 2012 14:49:39 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Administrivia]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Podcast]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[IMovie]]></category>
		<category><![CDATA[Richard Wallis]]></category>
		<category><![CDATA[Skype]]></category>
		<category><![CDATA[SurveyMonkey]]></category>
		<category><![CDATA[Talis Group]]></category>
		<category><![CDATA[Video]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1786</guid>
		<description><![CDATA[Richard Wallis left Talis (my former employer) last month, and has set up as a consultant at DataLiberate. In this short podcast, Richard shares some of his thoughts on data, semantics, and &#8216;the power of the link.&#8217; Our conversation is also an excuse for an experiment. I have been producing audio-only podcasts here and elsewhere [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://uk.linkedin.com/in/richardwallis"><img class="alignright  wp-image-1789" style="border-style: initial; border-color: initial; border-image: initial; border-width: 0px; margin: 5px;" title="Richard Wallis" src="http://cloudofdata.com/wp-content/uploads/2011/07/3543-293x3005.jpg" alt="" width="176" height="180" />Richard Wallis</a> left <a class="zem_slink" title="Talis Group" href="http://www.talis.com" rel="homepage">Talis</a> (<a href="http://cloudofdata.com/2008/12/paul-miller-is-bound-for-pastures-new/">my former employer</a>) last month, and has set up as a consultant at <a href="http://dataliberate.com/">DataLiberate</a>. In this short podcast, Richard shares some of his thoughts on data, semantics, and &#8216;the power of the link.&#8217;</p>
<p>Our conversation is also an excuse for an experiment. I have been producing audio-only podcasts here and elsewhere for a number of years, but have always tended to avoid producing video. It&#8217;s more effort, it requires more bandwidth at both ends of the conversation, and I&#8217;ve never really been convinced that it adds very much to a conversation between two people. Anecdotal evidence would also suggest that my current podcasts are consumed in environments where video would not work; washing dishes, walking dogs, and sitting on buses.</p>
<p>However, rather than just continue to presume that my biases are correct, I&#8217;ve decided to give video a try. Richard kindly agreed to participate, and the result is <a href="http://youtu.be/d4_tbNeoBTo">available on YouTube</a> and embedded here.</p>
<p style="text-align: center;"><iframe title="YouTube video player" class="youtube-player" type="text/html" width="425" height="344" src="http://www.youtube.com/embed/d4_tbNeoBTo" frameborder="0" allowFullScreen="true"> </iframe></p>
<p>An audio-only version is also available for download if you prefer. The introductory remarks in this version are slightly different to those on the video, as they come straight from the original conversation.</p>
<p></p>
<p>It&#8217;s perhaps unfair to draw too many conclusions from this first attempt, but a few things are immediately apparent. The whole process takes an awful lot longer. The files are larger, so processing and uploading times increase 2-3 fold. Uploading a separate audio file also takes a bit of time. Simply dumping the <a href="http://www.ecamm.com/mac/callrecorder/">Skype recording</a> into <a class="zem_slink" title="IMovie" href="http://www.apple.com/ilife/imovie/" rel="homepage">iMovie</a> worked just fine&#8230; but I&#8217;ve (so far) not managed to find any way to balance the audio levels. <a href="http://www.apple.com/ilife/garageband/">Garageband</a> lets me do this with my audio-only podcasts, but iMovie doesn&#8217;t seem to, so Richard&#8217;s side of the conversation comes across as quite a bit louder than mine.</p>
<p><strong>Having done one, I&#8217;m still not convinced that the video adds anything to the conversation. But what do you think? <a href="http://www.surveymonkey.com/s/PS87ZMX">If you&#8217;ve listened to any of my podcasts, please take a moment to complete the short survey over at SurveyMonkey.</a> Your responses will help me to decide where to go next.</strong></p>
<p>Many thanks.</p>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=c105eaa4-63ec-406a-a677-ec6bcb2513f9" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2012/02/a-conversation-with-richard-wallis-an-experiment-and-a-survey/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
			<enclosure url="http://cloudofdata.com/podpress_trac/feed/1786/0/20120203-RichardWallis.mp3" length="30953279" type="audio/mpeg" />
		<itunes:duration>0:32:14</itunes:duration>
		<itunes:subtitle>Richard Wallis left Talis (my former employer) last month, and has set up as a consultant at DataLiberate. In this short podcast, Richard shares some of his thoughts on data, semantics, and &#8216;the power of the link.&#8217;
Our conversation is al[...]</itunes:subtitle>
		<itunes:summary>Richard Wallis left Talis (my former employer) last month, and has set up as a consultant at DataLiberate. In this short podcast, Richard shares some of his thoughts on data, semantics, and &#8216;the power of the link.&#8217;
Our conversation is also an excuse for an experiment. I have been producing audio-only podcasts here and elsewhere for a number of years, but have always tended to avoid producing video. It&#8217;s more effort, it requires more bandwidth at both ends of the conversation, and I&#8217;ve never really been convinced that it adds very much to a conversation between two people. Anecdotal evidence would also suggest that my current podcasts are consumed in environments where video would not work; washing dishes, walking dogs, and sitting on buses.
However, rather than just continue to presume that my biases are correct, I&#8217;ve decided to give video a try. Richard kindly agreed to participate, and the result is available on YouTube and embedded here.
 
An audio-only version is also available for download if you prefer. The introductory remarks in this version are slightly different to those on the video, as they come straight from the original conversation.

It&#8217;s perhaps unfair to draw too many conclusions from this first attempt, but a few things are immediately apparent. The whole process takes an awful lot longer. The files are larger, so processing and uploading times increase 2-3 fold. Uploading a separate audio file also takes a bit of time. Simply dumping the Skype recording into iMovie worked just fine&#8230; but I&#8217;ve (so far) not managed to find any way to balance the audio levels. Garageband lets me do this with my audio-only podcasts, but iMovie doesn&#8217;t seem to, so Richard&#8217;s side of the conversation comes across as quite a bit louder than mine.
Having done one, I&#8217;m still not convinced that the video adds anything to the conversation. But what do you think? If you&#8217;ve listened to any of my podcasts, please take a moment to complete the short survey over at SurveyMonkey. Your responses will help me to decide where to go next.
Many thanks.
</itunes:summary>
		<itunes:keywords>Administrivia, Podcast</itunes:keywords>
		<itunes:author>Paul Miller</itunes:author>
		<itunes:explicit>no</itunes:explicit>
		<itunes:block>no</itunes:block>
	</item>
		<item>
		<title>Data Market Chat: Chris Hathaway discusses AggData</title>
		<link>http://cloudofdata.com/2012/01/data-market-chat-chris-hathaway-discusses-aggdata/</link>
		<comments>http://cloudofdata.com/2012/01/data-market-chat-chris-hathaway-discusses-aggdata/#comments</comments>
		<pubDate>Tue, 24 Jan 2012 15:00:39 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[data market chat]]></category>
		<category><![CDATA[data markets]]></category>
		<category><![CDATA[Podcast]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[AggData]]></category>
		<category><![CDATA[bestbuy]]></category>
		<category><![CDATA[chris hathaway]]></category>
		<category><![CDATA[GoodRelations]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[schema.org]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1728</guid>
		<description><![CDATA[Chris Hathaway sees basic location information scattered across the websites of hundreds — or thousands — of coffee shop chains, hotel groups, and fast food joints, but argues that it&#8217;s almost impossible to do anything more sophisticated with the data than find your closest Starbucks. His company, AggData, is attempting to fill what he sees as a [...]]]></description>
			<content:encoded><![CDATA[<div class="wp-caption alignright" style="width: 260px"><a href="http://www.crunchbase.com/company/aggdata"><img class="zemanta-img-inserted zemanta-img-configured" title="Image representing AggData as depicted in Crun..." src="http://cloudofdata.com/wp-content/uploads/2011/07/25946v2-max-250x25014.jpg" alt="Image representing AggData as depicted in Crun..." width="250" height="60" /></a><p class="wp-caption-text">Image via CrunchBase</p></div>
<p><a href="http://www.linkedin.com/in/chrishathaway">Chris Hathaway</a> sees basic location information scattered across the websites of hundreds — or thousands — of coffee shop chains, hotel groups, and fast food joints, but argues that it&#8217;s almost impossible to do anything more sophisticated with the data than find your closest Starbucks. His company, <a href="http://www.aggdata.com/">AggData</a>, is attempting to fill what he sees as a gap in the market; scraping addresses and other facts off company websites to create simple files of store locations that can then be enriched with coordinate data and sold.</p>
<p>Customers for this data include competitors, market researchers, consultants, and even the companies themselves; as is so often the case, it can be easier to buy data on store locations from a third party than to find the authoritative sources within your own organisation. AggData is strongest in the US today, but also offers a growing body of data for other countries. Although the data files are structurally simple, Chris sees plenty of opportunity to continue collecting and selling data to a growing community of customers.</p>
<p>Unlike Factual, <a href="http://cloudofdata.com/2012/01/data-market-chat-tyler-bell-discusses-factual/">which was the focus of last week&#8217;s podcast</a>, AggData is not currently interested in combining data from different sources. Customers download separate files on the locations of Starbucks, Peets and Tim Hortons, and not a single aggregated set of coffee shop locations. The AggData model is also predicated upon using their own scripts to extract data from third party sites; asked if he would accept a file of WalMart store locations supplied by WalMart, Hathaway explained why he would — and does — decline.</p>
<p>Have a listen to learn more about AggData, and to hear Chris&#8217; perspectives on the potential role of semantic technologies in making his job easier. And <a href="http://cloudofdata.com/category/podcast/data-market-chat/">check back on Thursday for the next podcast</a> in the series; <a href="http://is.linkedin.com/in/hjalli">Hjálmar Gíslason</a> of <a href="http://datamarket.com">DataMarket.com</a>.</p>
<p></p>
<p><em>Following up on <a href="http://cloudofdata.com/2012/01/nurturing-the-market-for-data-markets/">a blog post that I wrote at the start of 2012</a>, this is the second in a series of podcasts with key stakeholders in the emerging category of Data Markets. Other conversations, <a href="http://cloudofdata.com/category/podcast/data-market-chat/">all of which will be published here</a>, have been scheduled with BuzzData, DataMarket.com, Factual, Infochimps, Kasabi, and Microsoft. I am still adding conversations to the series, and intend to talk with more companies and with analysts and investors with insight to share. </em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://cloudofdata.com/2012/01/nurturing-the-market-for-data-markets/">Nurturing the market for Data Markets</a> (cloudofdata.com)</li>
<li class="zemanta-article-ul-li"><a href="http://cloudofdata.com/2012/01/data-market-chat-the-podcasts-are-a-coming/">Data Market Chat: the podcasts are a-coming&#8230;</a> (cloudofdata.com)</li>
<li class="zemanta-article-ul-li"><a href="http://cloudofdata.com/2012/01/data-market-chat-tyler-bell-discusses-factual/">Data Market Chat: Tyler Bell discusses Factual</a> (cloudofdata.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=877a78ef-6af5-4509-8430-791904d80ba8" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2012/01/data-market-chat-chris-hathaway-discusses-aggdata/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
			<enclosure url="http://cloudofdata.com/podpress_trac/feed/1728/0/20120118-ChrisHathaway.mp3" length="25187838" type="audio/mpeg" />
		<itunes:duration>0:52:23</itunes:duration>
		<itunes:subtitle>Image via CrunchBase
Chris Hathaway sees basic location information scattered across the websites of hundreds — or thousands — of coffee shop chains, hotel groups, and fast food joints, but argues that it&#8217;s almost impossible to do anything mor[...]</itunes:subtitle>
		<itunes:summary>Image via CrunchBase
Chris Hathaway sees basic location information scattered across the websites of hundreds — or thousands — of coffee shop chains, hotel groups, and fast food joints, but argues that it&#8217;s almost impossible to do anything more sophisticated with the data than find your closest Starbucks. His company, AggData, is attempting to fill what he sees as a gap in the market; scraping addresses and other facts off company websites to create simple files of store locations that can then be enriched with coordinate data and sold.
Customers for this data include competitors, market researchers, consultants, and even the companies themselves; as is so often the case, it can be easier to buy data on store locations from a third party than to find the authoritative sources within your own organisation. AggData is strongest in the US today, but also offers a growing body of data for other countries. Although the data files are structurally simple, Chris sees plenty of opportunity to continue collecting and selling data to a growing community of customers.
Unlike Factual, which was the focus of last week&#8217;s podcast, AggData is not currently interested in combining data from different sources. Customers download separate files on the locations of Starbucks, Peets and Tim Hortons, and not a single aggregated set of coffee shop locations. The AggData model is also predicated upon using their own scripts to extract data from third party sites; asked if he would accept a file of WalMart store locations supplied by WalMart, Hathaway explained why he would — and does — decline.
Have a listen to learn more about AggData, and to hear Chris&#8217; perspectives on the potential role of semantic technologies in making his job easier. And check back on Thursday for the next podcast in the series; Hjálmar Gíslason of DataMarket.com.

Following up on a blog post that I wrote at the start of 2012, this is the second in a series of podcasts with key stakeholders in the emerging category of Data Markets. Other conversations, all of which will be published here, have been scheduled with BuzzData, DataMarket.com, Factual, Infochimps, Kasabi, and Microsoft. I am still adding conversations to the series, and intend to talk with more companies and with analysts and investors with insight to share. 
Related articles

Nurturing the market for Data Markets (cloudofdata.com)
Data Market Chat: the podcasts are a-coming&#8230; (cloudofdata.com)
Data Market Chat: Tyler Bell discusses Factual (cloudofdata.com)

</itunes:summary>
		<itunes:keywords>Podcast</itunes:keywords>
		<itunes:author>Paul Miller</itunes:author>
		<itunes:explicit>no</itunes:explicit>
		<itunes:block>no</itunes:block>
	</item>
		<item>
		<title>Top Level Domain for data answers the wrong question</title>
		<link>http://cloudofdata.com/2012/01/top-level-domain-for-data-answers-the-wrong-question/</link>
		<comments>http://cloudofdata.com/2012/01/top-level-domain-for-data-answers-the-wrong-question/#comments</comments>
		<pubDate>Wed, 11 Jan 2012 14:41:35 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Enterprise Computing]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[content negotiation]]></category>
		<category><![CDATA[Cybersquatting]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[data publishing]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[Data sharing]]></category>
		<category><![CDATA[Data Web]]></category>
		<category><![CDATA[Domain name]]></category>
		<category><![CDATA[Domain Name System]]></category>
		<category><![CDATA[ICANN]]></category>
		<category><![CDATA[Open University]]></category>
		<category><![CDATA[Southampton University]]></category>
		<category><![CDATA[Stephen Wolfram]]></category>
		<category><![CDATA[TLD]]></category>
		<category><![CDATA[Top-level domain]]></category>
		<category><![CDATA[Wolfram Research]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1640</guid>
		<description><![CDATA[British-born computer scientist Stephen Wolfram sees ongoing efforts to extend the Internet&#8217;s top-level domains (TLDs) beyond the familiar .com, .org, .uk etc as an opportunity to raise the profile of machine-readable data. In a blog post published yesterday, he argues that a new .data domain would increase &#8220;exposure of data on the internet—and [provide] added impetus for [...]]]></description>
			<content:encoded><![CDATA[<div class="wp-caption alignright" style="width: 310px"><a href="http://commons.wikipedia.org/wiki/File:Stephen_Wolfram_PR.jpg"><img class="zemanta-img-inserted zemanta-img-configured" title="English: Publicity photo of en:Stephen Wolfram." src="http://cloudofdata.com/wp-content/uploads/2011/07/300px-Stephen_Wolfram_PR2.jpg" alt="English: Publicity photo of en:Stephen Wolfram." width="300" height="428" /></a><p class="wp-caption-text">Image of Stephen Wolfram via Wikipedia</p></div>
<p>British-born computer scientist <a class="zem_slink" title="Stephen Wolfram" href="http://en.wikipedia.org/wiki/Stephen_Wolfram" rel="wikipedia">Stephen Wolfram</a> sees ongoing efforts to extend the Internet&#8217;s top-level domains (<a class="zem_slink" title="Top-level domain" href="http://en.wikipedia.org/wiki/Top-level_domain" rel="wikipedia">TLDs</a>) beyond the familiar .com, .org, .uk etc as an opportunity to raise the profile of machine-readable data. <a href="http://blog.stephenwolfram.com/2012/01/a-data-top-level-internet-domain/">In a blog post published yesterday</a>, he argues that a new .data domain would increase &#8220;exposure of data on the internet—and [provide] added impetus for organizations to expose data in a way that can efficiently be found and accessed.&#8221; Whilst wholly in favour of Wolfram&#8217;s stated aim, I can&#8217;t help feeling that his suggested solution is at best unnecessary and at worst a worrying segregration of data from the &#8216;proper&#8217; web that everyone else will continue to exploit.</p>
<p>Back in June of last year, the body responsible for coordinating the global domain name system <a href="http://arstechnica.com/business/news/2011/06/icann-approves-plan-to-vastly-expand-top-level-domains.ars">approved a plan to permit new top-level domains</a> (the letters after the final dot in an internet address — the .com in cloudofdata.<strong>com</strong>, the .uk in bbc.co.<strong>uk</strong>, the .edu in harvard.<strong>edu</strong>). Until recently, these top-level domains have been tightly controlled, with a small set of generic domains (<a class="zem_slink" title=".edu" href="http://en.wikipedia.org/wiki/.edu" rel="wikipedia">.edu</a>, <a class="zem_slink" title=".gov" href="http://en.wikipedia.org/wiki/.gov" rel="wikipedia">.gov</a>, <a class="zem_slink" title=".mil" href="http://en.wikipedia.org/wiki/.mil" rel="wikipedia">.mil</a>, <a href="http://en.wikipedia.org/wiki/.org">.org</a>, etc), a larger set of country domains (<a href="http://en.wikipedia.org/wiki/.uk">.uk</a>, <a href="http://en.wikipedia.org/wiki/.fi">.fi</a>, <a href="http://en.wikipedia.org/wiki/.nz">.nz</a>, etc) and one or two others such as <a href="http://en.wikipedia.org/wiki/.eu">.eu</a>. <a href="http://www.wired.com/epicenter/2012/01/icann-pushes-ahead-with-january-12-launch-for-new-top-level-domains/">From tomorrow</a>, anyone with $185,000 will be able to submit a proposal to create and manage a new top level domain, and it&#8217;s possible that there could eventually be <em>thousands</em> of them. Wolfram is keen to ensure that data doesn&#8217;t miss out on the &#8216;opportunity.&#8217;</p>
<p>As Wolfram himself recognises, there is already an awful lot of machine-readable data on the web. Some of it sits embedded within the web pages that humans read, with specially formatted code waiting to be triggered by the calendars, the address books, or the browser plugins of site visitors. Some of it is packaged up in data files, offered for download. And some of it waits inside a database, ready to be delivered in response to an API call or a query typed into a web form.</p>
<p>There is a growing enthusiasm for exposing this data for reuse. Government transparency agendas have driven public sector data sites like <a href="http://data.gov.uk">data.gov.uk</a> and <a href="http://data.gov/">data.gov</a>. Similarly, efforts such as <a href="http://data.open.ac.uk/">data.open.ac.uk</a> and <a href="http://data.southampton.ac.uk">data.southampton.ac.uk</a> see universities beginning to consciously collect data sets together and offer them up for reuse. Similar efforts in the commercial world are less easy to point to, but that reticence has nothing whatsoever to do with the lack of a ford.data, boeing.data, ge.data or astrazeneca.data domain!</p>
<p>In some ways, the convention for gathering significant chunks of data on a data.xxx.yyy site echoes Wolfram&#8217;s intention, but with a number of advantages. Data without context is far less valuable than data with context. Much of that context may be inferred from the domain in which the data lives, with data delivered from a .gov or .edu (or .gov.uk or .ac.uk) site perhaps interpreted differently to data hosted on .com, .biz, or .xxx. Southampton University, the Open University, and the US Federal Government are able to gather data up and make it available for download via their existing data. sites if they choose. This offers human visitors to their sites a degree of convenience, whilst retaining the power and brand attributes of their existing domain. Gov.data, gov.uk.data, open.ac.uk.data, southampton.ac.uk.data, though? All are messy, in ways that Wolfram&#8217;s own wolfram.data would admittedly not be, and all are simply additional registrations that the institutions would have to pay for in order to stop someone else grabbing the domain.</p>
<p>At the end of the day, the machines don&#8217;t actually care. The existing data.open.ac.uk-type sites are human conveniences, not machine enablers. The computers, and the software they run, are quite capable of crawling the public web and finding accessible data wherever it lies on a site. There are plenty of reasons to continue embedding little snippets of data inside human readable web pages, regardless of whether you have a data.wolfram.com or a wolfram.data site. <a href="http://en.wikipedia.org/wiki/Content_negotiation">Content negotiation</a> is becoming increasingly capable, such that there really is no need for what Wolfram calls a &#8216;parallel construct to the ordinary web&#8217; at all. A human being arriving at a web site sees human readable content, whilst various software tools would <a href="http://www.w3.org/TR/cooluris/#implementation">automatically</a> be presented with very different data or functions, optimised to their capabilities and requirements.</p>
<p>By all means, let us show the curious some of the existing techniques that work in making data more easily accessible. By all means, let us identify the gaps, the issues, the problems (<em>none</em> of which a new TLD even begins to address). Yes, let us definitely and unambiguously set about &#8220;highlighting the exposure of data on the internet—and providing added impetus for organizations to expose data in a way that can efficiently be found and accessed.&#8221;</p>
<p>But please, let us not be distracted by the false hope that adding yet another TLD to the babel that ICANN is about to unleash can do anything more than consign data to some online ghetto, wallowing unwanted, unloved and unused as companies and their customers lavish love, attention, and clicks upon the .com domain over on the &#8216;proper&#8217; web.</p>
<p><em>Thanks to <a href="http://www.eurecom.fr/~troncy/">Raphaël Troncy</a>, whose <a href="https://twitter.com/rtroncy/status/156850031670988800">tweet</a> first drew the story to my attention.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://techcrunch.com/2012/01/10/computers-data-domains/">Is It Time For Computers To Have Their Own .Data Domains?</a> (techcrunch.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.wired.com/epicenter/2012/01/icann-pushes-ahead-with-january-12-launch-for-new-top-level-domains/">ICANN Pushes Ahead With January 12 Launch For New Top-Level Domains</a> (wired.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.wired.com/epicenter/2012/01/icaan-president-beckstrom/all/1">The biggest change in DNS since Dot-Com</a> (wired.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=7fe922ae-4cad-445c-9f64-1df043b7dd90" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2012/01/top-level-domain-for-data-answers-the-wrong-question/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>June is San Francisco month</title>
		<link>http://cloudofdata.com/2011/05/june-is-san-francisco-month/</link>
		<comments>http://cloudofdata.com/2011/05/june-is-san-francisco-month/#comments</comments>
		<pubDate>Sun, 08 May 2011 17:48:46 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[GigaOM]]></category>
		<category><![CDATA[Om Malik]]></category>
		<category><![CDATA[San Francisco]]></category>
		<category><![CDATA[Semantic Technology Conference]]></category>
		<category><![CDATA[semanticconf]]></category>
		<category><![CDATA[SemTech]]></category>
		<category><![CDATA[Structure 2011]]></category>
		<category><![CDATA[structureconf]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1551</guid>
		<description><![CDATA[For real-world applications of Linked Data and the Semantic Web, the long-running Semantic Technology Conference is hard to beat. For getting a real handle on the Cloud Computing landscape, GigaOM&#8216;s Structure Conference is also a leading light. Working across both areas as I do, these events tend to figure prominently in my calendar for the [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="margin: 1em; display: block;">
<div class="wp-caption alignright" style="width: 310px"><a href="http://commons.wikipedia.org/wiki/File:BayareaUSGS.jpg"><img title="USGS Satellite photo of the San Francisco Bay ..." src="http://upload.wikimedia.org/wikipedia/commons/thumb/5/5c/BayareaUSGS.jpg/300px-BayareaUSGS.jpg" alt="USGS Satellite photo of the San Francisco Bay ..." width="300" height="311" /></a><p class="wp-caption-text">Image via Wikipedia</p></div>
</div>
<p>For real-world applications of Linked Data and the Semantic Web, <a href="http://semtech2011.semanticweb.com/">the long-running Semantic Technology Conference</a> is hard to beat. For getting a real handle on the Cloud Computing landscape, <a href="http://event.gigaom.com/">GigaOM</a>&#8216;s <a href="http://event.gigaom.com/structure/">Structure Conference</a> is also a leading light.</p>
<p>Working across both areas as I do, these events tend to figure prominently in my calendar for the year<a href="#disclosure">*</a>. Last year, both took place in San Francisco during the same week. I tried to attend both, and therefore succeeded in spending most of my week in cabs, shuttling between meetings at the two venues. I saw very few sessions that I wasn&#8217;t personally involved in, and <a href="http://cloudofdata.com/2010/07/a-tale-of-two-conferences/">the experience wasn&#8217;t a huge success</a>.</p>
<p>This year the conference organisers have taken pity on me, and moved their events to opposite ends of June. <a href="http://semtech2011.semanticweb.com/">The Semantic Technology Conference</a> is up first, back at the Hilton Union Square from 5-9 June. <a href="http://event.gigaom.com/structure/">Structure</a> follows, returning to the Mission Bay Conference Centre on 22 and 23 June.</p>
<p>I&#8217;ll be attending both, and probably doing various official things during each event. At the moment, the only thing we&#8217;ve definitely nailed down is a special live appearance by <a href="http://semanticweb.com/category/the-semantic-link">The Semantic Link crew</a> on <a href="http://semtech2011.semanticweb.com/sessionPop.cfm?confid=62&amp;proposalid=4338">the evening of 5 June</a>; we&#8217;ll be taking a look at the highlights expected for the conference, offering some tips for those new to the event and its multitude of parallel sessions, and generally bringing our usual podcast chatter to the stage.</p>
<p>So if you&#8217;re in town around the time of either event, <a href="http://cloudofdata.com/contact/">do get in touch</a>; it promises to be an interesting month.</p>
<p><a name="disclosure">*</a> <em>Disclosure: <a href="http://www.webmediabrands.com/">WebMediaBrands</a> pay me to host <a href="http://semanticweb.com/category/the-semantic-link">the monthly Semantic Link podcast</a>, and to <a href="http://semanticweb.com/category/paulmiller">write a monthly column</a> on <a href="http://semanticweb.com">SemanticWeb.com</a>. <a href="http://gigaom.com/about/">GigaOM</a> pay me to curate the <a href="http://pro.gigaom.com/topic/infrastructure/">Infrastructure/ Cloud Computing channel</a> on their <a href="http://pro.gigaom.com/">Pro site</a>. I attended and participated in both of these events before that was the case, and still would today without the contractual relationship.</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.datacenterknowledge.com/archives/2011/04/06/gigaom-structure-2011/">GigaOM Structure 2011</a> (datacenterknowledge.com)</li>
<li class="zemanta-article-ul-li"><a href="http://fakeiitian.com/event-2/semantic-technology-conference/">Semantic Technology Conference</a> (fakeiitian.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=043dfe9c-e827-48fe-8819-d54993b7502c" alt="" /></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2011/05/june-is-san-francisco-month/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>February&#8217;s Semantic Link Podcast Discusses Marketing the Semantic Web</title>
		<link>http://cloudofdata.com/2011/02/februarys-semantic-link-podcast-discusses-marketing-the-semantic-web/</link>
		<comments>http://cloudofdata.com/2011/02/februarys-semantic-link-podcast-discusses-marketing-the-semantic-web/#comments</comments>
		<pubDate>Wed, 16 Feb 2011 11:19:15 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Andraz Tori]]></category>
		<category><![CDATA[Bernadette Hyland]]></category>
		<category><![CDATA[Christine Connors]]></category>
		<category><![CDATA[Eric Franzon]]></category>
		<category><![CDATA[Eric Hoffer]]></category>
		<category><![CDATA[Ivan Herman]]></category>
		<category><![CDATA[Krista Thomas]]></category>
		<category><![CDATA[Paul Miller]]></category>
		<category><![CDATA[Podcast]]></category>
		<category><![CDATA[Scott Brinker]]></category>
		<category><![CDATA[Semantic Link]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1509</guid>
		<description><![CDATA[February&#8217;s episode of the Semantic Link podcast is now online at SemanticWeb.com. During the show, regulars Christine Connors, Eric Franzon, Ivan Herman, Eric Hoffer, Bernadette Hyland and Andraz Tori are joined by two special guests with some experience in both marketing and semantic technologies. Krista Thomas was responsible for marketing at Thomson Reuters&#8216; semantic technology [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-1390" title="The Semantic Link podcast" src="http://cloudofdata.com/wp-content/uploads/2010/12/semanticlink.gif" alt="" width="200" height="200" /><a href="http://semanticweb.com/the-semantic-link-episode-3-february-11-2011_b17728">February&#8217;s episode</a> of the <a href="http://semanticweb.com/category/the-semantic-link">Semantic Link podcast</a> is now online at SemanticWeb.com. During the show, regulars Christine Connors, Eric Franzon, Ivan Herman, Eric Hoffer, Bernadette Hyland and Andraz Tori are joined by two special guests with some experience in both marketing and semantic technologies.</p>
<p>Krista Thomas was responsible for marketing at <a class="zem_slink" title="Reuters" rel="homepage" href="http://reuters.com">Thomson Reuters</a>&#8216; semantic technology success story, <a class="zem_slink" title="OpenCalais" rel="homepage" href="http://www.opencalais.com">Open Calais</a>, and is now VP Marketing at Los Angeles startup <a href="http://ad.ly/">Ad.ly</a>.</p>
<p>Scott Brinker is President &amp; CTO at <a href="http://www.ioninteractive.com/">ion interactive</a>, and writes the <a href="http://www.chiefmartec.com/">ChiefMarTec</a> blog.</p>
<p>During the conversation, regulars and guests discuss the way in which &#8216;semantic technology&#8217; solutions are being successfully marketed to potential beneficiaries.</p>
<p><a href="http://semanticweb.com/the-semantic-link-episode-3-february-11-2011_b17728">Have a listen</a>, and learn that pitches to non-semantic technology enthusiasts beginning &#8220;We&#8217;ll convert all your data to RDF, leverage the Semantic Web, and construct an OWL-based ontology&#8221; are probably doomed to failure.</p>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=71a75476-8080-4d40-acdd-a11f68e3e84b" alt="" /><span class="zem-script pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2011/02/februarys-semantic-link-podcast-discusses-marketing-the-semantic-web/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Episode 2 of The Semantic Link podcast discusses Drupal and more</title>
		<link>http://cloudofdata.com/2011/01/episode-2-of-the-semantic-link-podcast-discusses-drupal-and-more/</link>
		<comments>http://cloudofdata.com/2011/01/episode-2-of-the-semantic-link-podcast-discusses-drupal-and-more/#comments</comments>
		<pubDate>Tue, 18 Jan 2011 21:41:58 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Andraz Tori]]></category>
		<category><![CDATA[Bernadette Hyland]]></category>
		<category><![CDATA[Christine Connors]]></category>
		<category><![CDATA[Content management system]]></category>
		<category><![CDATA[Drupal]]></category>
		<category><![CDATA[Eric Franzon]]></category>
		<category><![CDATA[Eric Hoffer]]></category>
		<category><![CDATA[Ivan Herman]]></category>
		<category><![CDATA[Peter Brown]]></category>
		<category><![CDATA[Podcast]]></category>
		<category><![CDATA[RDFa]]></category>
		<category><![CDATA[Semantic Technology]]></category>
		<category><![CDATA[SemanticLink]]></category>
		<category><![CDATA[thesemanticlink]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1416</guid>
		<description><![CDATA[Episode 2 of our new Semantic Link podcast went up on SemanticWeb.com this evening, and it&#8217;s another good one. Not that I&#8217;m biased or anything. The whole team is present once more, and we start the show discussing the implications of Drupal 7 and its newly formalised RDFa-publishing capabilities. Unlike regular semantic technology solutions, which [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-1390" title="The Semantic Link podcast" src="http://cloudofdata.com/wp-content/uploads/2010/12/semanticlink.gif" alt="" width="140" height="140" /><a href="http://semanticweb.com/the-semantic-link-episode-2-january-2011_b17338">Episode 2</a> of our new Semantic Link podcast went up on <a href="http://semanticweb.com/">SemanticWeb.com</a> this evening, and it&#8217;s another good one. Not that I&#8217;m biased or anything.</p>
<p>The whole team is present once more, and we start the show discussing the implications of <a class="zem_slink" title="Drupal" rel="homepage" href="http://www.drupal.org">Drupal</a> 7 and its newly formalised <a class="zem_slink" title="RDFa" rel="wikipedia" href="http://en.wikipedia.org/wiki/RDFa">RDFa</a>-publishing capabilities. Unlike regular semantic technology solutions, which someone has to consciously procure <em>as a Semantic Technology solution</em>, Drupal is first and foremost a (popular, free) <a class="zem_slink" title="Content management system" rel="wikipedia" href="http://en.wikipedia.org/wiki/Content_management_system">Content Management System</a>; the semantic smarts come for free, and therefore reach a massive new audience.</p>
<p>From there, we move into a broader discussion of the ways in which semantic projects take root within organisations.</p>
<p><a href="http://semanticweb.com/the-semantic-link-episode-2-january-2011_b17338">Have a listen</a>, and let us know what you think. As ever, suggestions for future topics or guests are always welcome.</p>
<p><a href="http://www.twitter.com/thesemanticlink">@theSemanticLink</a> is the show&#8217;s Twitter id, which we will use to invite questions ahead of shows on particular topics. If you&#8217;d like to ask questions, comment on shows, hear about new episodes as they&#8217;re released, etc, please do sign up to follow.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://cloudofdata.com/2010/12/the-semantic-link-is-open/">The Semantic Link is open</a> (cloudofdata.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.nytimes.com/external/readwriteweb/2011/01/05/05readwriteweb-drupal-7-released-with-improved-ui-and-sema-32028.html">Drupal 7 Released, With Improved UI and Semantic Technology</a> (nytimes.com)</li>
<li class="zemanta-article-ul-li"><a href="http://radar.oreilly.com/2011/01/house-drupal.html">House.gov moves to Drupal</a> (radar.oreilly.com)</li>
<li class="zemanta-article-ul-li"><a href="http://semanticweb.com/dries-buytaert-explains-how-drupal-gardens-saas-can-contribute-to-semantic-web-momentum_b674">Dries Buytaert Explains How Drupal Gardens SaaS Can Contribute to Semantic Web Momentum &#8211; semanticweb.com</a> (semanticweb.com)</li>
<li class="zemanta-article-ul-li"><a href="http://go.theregister.com/feed/www.theregister.co.uk/2011/01/07/drupal_7_released/">Drupal 7 dives into machine-readable web</a> (go.theregister.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=e9cf09bf-44e6-4b94-9e69-348f46b7d774" alt="" /><span class="zem-script pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2011/01/episode-2-of-the-semantic-link-podcast-discusses-drupal-and-more/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Semantic Link is open</title>
		<link>http://cloudofdata.com/2010/12/the-semantic-link-is-open/</link>
		<comments>http://cloudofdata.com/2010/12/the-semantic-link-is-open/#comments</comments>
		<pubDate>Thu, 09 Dec 2010 18:48:12 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Andraz Tori]]></category>
		<category><![CDATA[Bernadette Hyland]]></category>
		<category><![CDATA[Christine Connors]]></category>
		<category><![CDATA[Eric Franzon]]></category>
		<category><![CDATA[Eric Hoffer]]></category>
		<category><![CDATA[Ivan Herman]]></category>
		<category><![CDATA[Peter Brown]]></category>
		<category><![CDATA[Podcast]]></category>
		<category><![CDATA[SemanticLink]]></category>
		<category><![CDATA[thesemanticlink]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1389</guid>
		<description><![CDATA[At the end of last month, I wrote about the new Semantic Link podcast that I&#8217;m involved with for SemanticWeb.com. We recorded the first episode earlier this week and it&#8217;s now online, along with an introduction to the series from myself. Please do have a listen, and let me know what you think whilst I [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://semanticweb.com/the-semantic-link-episode-1-december-2010_b16991" target="_blank"><img class="alignright size-full wp-image-1390" style="margin: 5px;" title="The Semantic Link podcast" src="http://cloudofdata.com/wp-content/uploads/2010/12/semanticlink.gif" alt="" width="200" height="200" /></a>At the end of last month, <a href="http://cloudofdata.com/2010/11/looking-forward-to-the-semantic-link-podcast-on-semanticweb-com/">I wrote about the new Semantic Link podcast</a> that I&#8217;m involved with for SemanticWeb.com.</p>
<p>We recorded the first episode earlier this week and <a href="http://semanticweb.com/the-semantic-link-episode-1-december-2010_b16991">it&#8217;s now online</a>, along with <a href="http://semanticweb.com/welcome-to-the-podpanel_b16974">an introduction to the series</a> from myself.</p>
<p>Please do <a href="https://semanticweb.com/the-semantic-link-episode-1-december-2010_b16991">have a listen</a>, and let me know what you think whilst I work out how to automate adding these to my sidebar on this site&#8230;</p>
<p>I look forward to lining up some great conversations &#8211; and guests &#8211; in the year ahead. We will record and publish regularly each month. Where we can agree a topic well ahead of the recording date, I&#8217;ll let you know via the <a href="http://twitter.com/thesemanticlink">@TheSemanticLink</a> twitter account, and invite questions for me to throw at both regulars and guests.</p>
<p>If there are particular topics you&#8217;d like to see covered, particular guests you&#8217;d like us to secure, or other comments you&#8217;d like to make, <a href="http://cloudofdata.com/contact">please do let me know</a>&#8230;</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/archives/top_10_semantic_web_products_of_2010.php">Top 10 Semantic Web Products of 2010</a> (readwriteweb.com)</li>
<li class="zemanta-article-ul-li"><a href="http://cloudofdata.com/2010/11/looking-forward-to-the-semantic-link-podcast-on-semanticweb-com/">Looking forward to The Semantic Link podcast on SemanticWeb.com</a> (cloudofdata.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=f1877782-f3e2-4f5f-9ad3-343c10a6e3bc" alt="" /><span class="zem-script pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2010/12/the-semantic-link-is-open/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Is there a disconnect between Big Data and the Web of Data ?</title>
		<link>http://cloudofdata.com/2010/11/is-there-a-disconnect-between-big-data-and-the-web-of-data/</link>
		<comments>http://cloudofdata.com/2010/11/is-there-a-disconnect-between-big-data-and-the-web-of-data/#comments</comments>
		<pubDate>Tue, 30 Nov 2010 16:36:08 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Enterprise Computing]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[BigData]]></category>
		<category><![CDATA[Defrag]]></category>
		<category><![CDATA[Glue]]></category>
		<category><![CDATA[LinkedData]]></category>
		<category><![CDATA[OpenData]]></category>
		<category><![CDATA[strataconf]]></category>
		<category><![CDATA[structureconf]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1308</guid>
		<description><![CDATA[Image via Wikipedia &#8216;Big Data&#8216; is currently capturing the imagination, attracting hype, investment and ambitious startups in almost equal measure. Kim and Eric Norlin&#8217;s excellent Defrag and Glue events have gained big-name company, with O&#8217;Reilly&#8216;s Strata and GigaOM&#8216;s Structure both set to arrive in the first quarter of 2011. Venture firms like IA Ventures have emerged, specifically [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="margin: 1em; display: block;">
<div>
<dl class="wp-caption alignright" style="width: 310px;">
<dt class="wp-caption-dt"><a href="http://commons.wikipedia.org/wiki/File:WorldWideWebAroundWikipedia.png"><img title="A data visualization of Wikipedia as part of t..." src="http://upload.wikimedia.org/wikipedia/commons/thumb/b/b9/WorldWideWebAroundWikipedia.png/300px-WorldWideWebAroundWikipedia.png" alt="A data visualization of Wikipedia as part of t..." width="300" height="216" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image via <a href="http://commons.wikipedia.org/wiki/File:WorldWideWebAroundWikipedia.png">Wikipedia</a></dd>
</dl>
</div>
</div>
<p>&#8216;<a class="zem_slink" title="Big data" rel="wikipedia" href="http://en.wikipedia.org/wiki/Big_data">Big Data</a>&#8216; is currently capturing the imagination, attracting hype, investment and ambitious startups in almost equal measure. Kim and Eric Norlin&#8217;s excellent <a href="http://www.defragcon.com/">Defrag</a> and <a href="http://www.gluecon.com/">Glue</a> events have gained big-name company, with <a href="http://conferences.oreillynet.com/">O&#8217;Reilly</a>&#8216;s <a href="http://strataconf.com/strata2011">Strata</a> and <a href="http://gigaom.com/events/">GigaOM</a>&#8216;s <a href="http://gigaom.com/bigdata/">Structure</a> both set to arrive in the first quarter of 2011. Venture firms like <a href="http://www.iaventurepartners.com/">IA Ventures</a> have emerged, specifically targeted at finding, funding, and profiting from the <em>big</em> Big Data idea. Giants of the web from <a class="zem_slink" title="Yahoo!" rel="homepage" href="http://www.yahoo.com">Yahoo!</a> and <a href="http://www.amazon.com/">Amazon</a> to <a class="zem_slink" title="Twitter" rel="homepage" href="http://twitter.com">Twitter</a> and <a class="zem_slink" title="Facebook" rel="homepage" href="http://facebook.com">Facebook</a> solve their own Big Data problems in very different ways, contributing valuable code and experience to the community whilst simultaneously diluting focus and adding to the cacophony.</p>
<p>Flippantly reckoned by many to be &#8216;anything that requires more than a single machine to run,&#8217; the Big Data reality remains somewhat harder to pin down. To those seeking routine business insight, that mammoth Excel spreadsheet they laboriously query overnight at the end of each month might quite justifiably be thought of as &#8216;Big.&#8217; At the other end of the scale, data wizards scorn anything that doesn&#8217;t require a room full of servers, a mountain of empty pizza boxes, and the careful construction of a bespoke data ingest, management and querying system atop the most bare-bones version of the Linux kernel they can find. Somewhere between the two, a growing mass of cheaply gathered data holds out the promise of invaluable insight. Remote sensors, web clickstreams, social graph interactions, purchaser (and non-purchaser) behaviours. All these, and more, have much to tell planners, builders, makers, sellers, and buyers. If only we could formulate the right questions. If only we could devise the right sampling strategies. If only we had big enough machines to ask lots of questions using lots of sampling strategies. If only we had big enough machines to not bother sampling at all.</p>
<p>On the hardware side of things, even humble domestic laptops typically ship with at least two cores these days; two separate little computers ready to do the data processor&#8217;s bidding. Four, eight, sixteen and more cores are not far behind, but mainstream software products typically fail to exploit anything more than a single core. Push Excel as hard as you like, and it won&#8217;t do more than take <em>one</em> of your computer&#8217;s multiple cores to the max. On that 12-core Mac Pro you persuaded the boss to buy, only one core will be hard at work on your data. Twitter, Mail, YouTube, and ripping DVDs  will each be giving other cores a little light exercise whilst others sit idly by, waiting for the arrival of operating systems and applications capable of exploiting multi-core power. The same is true as jobs grow and move to run across multiple machines, whether under your desk, in your data centre, or out in the Cloud. Those big datasets need to be carved up and shared amongst the available computers before any analysis takes place. You&#8217;re typically not accessing a &#8216;big computer in the Cloud&#8217; at all&#8230; but lots of relatively small (commodity) computers, and it takes careful planning and smart software to manage the division and recombination of those jobs in a cost-effective manner. Projects such as <a href="http://db.cs.berkeley.edu/jmh/">Joseph Hellerstein</a>&#8216;s Berkeley Orders of Magnitude (<a href="http://boom.cs.berkeley.edu/">BOOM</a>) begin to demonstrate some of the potential for working natively with multiple processors, but there&#8217;s a long way to go before those advances reach the mainstream.</p>
<p><a href="http://en.wikipedia.org/wiki/Hadoop">Hadoop</a>, <a href="http://en.wikipedia.org/wiki/Apache_Cassandra">Cassandra</a>, <a class="zem_slink" title="MapReduce" rel="wikipedia" href="http://en.wikipedia.org/wiki/MapReduce">MapReduce</a>, <a href="http://en.wikipedia.org/wiki/Dynamo_(storage_system)">Dynamo</a>, <a href="http://en.wikipedia.org/wiki/Project_Voldemort#SNA_LinkedIn">Voldemort</a>. These, and more, are solutions developed by the likes of Yahoo!, Facebook, Google, Amazon and <a class="zem_slink" title="LinkedIn" rel="homepage" href="http://www.linkedin.com">LinkedIn</a> to tackle the influx of data that each faced &#8211; and for which each had failed to find an existing solution. Hadoop, with the addition of <a href="http://www.cloudera.com/">Cloudera</a>&#8216;s commercial polish, is rapidly emerging as the front runner for an off the shelf Big Data solution, but all of these tools remain rather narrow in their abilities. Find the type of data or the nature of query for which each of these was built and its performance will be unbeatable, but we are a very long way from Big Data&#8217;s equivalent of the jack-of-all-trades SQL-powered relational database of old.</p>
<p>And there, for many enterprises, lies the problem. Useful Google searches require the crawler, index and UI to do a relatively small number of essentially similar tasks, very quickly, very cost-effectively, and at massive scale. Focus on that finite set of problems, and you build a solution that delivers the experience we&#8217;ve all come to know. Each type of data manipulation or analysis requires a different tool, differently optimised, with the inevitable result that a typically diverse organisation may require a plethora of Big Data tools to get their work done. Or they might just continue to muddle along with Oracle or <a class="zem_slink" title="MySQL" rel="homepage" href="http://www.mysql.com">mySQL</a>, churning inefficiently through their data analysis jobs for interminably long periods of time. These relational database tools are understood, they are mature, and they get the job done. Except in the most data-intensive industries, they have a market presence that will be difficult to disrupt.</p>
<p>The Big Data space is seeing remarkable innovation, but there is a long way to go in order to lift it out of the domain of the technically proficient specialist and place it on desktops across the organisation. As IA Ventures&#8217; Brad Gillespie notes, &#8220;Excel is where the world&#8217;s data lives&#8230; [and] Big Data has to get to that place&#8230; so that a CMO can leverage it directly.&#8221;</p>
<p>And in all of this fervent of innovation, to return to the title of the post, it strikes me that Big Data is becoming disconnected from the fabric of the web itself. Oh, much of the data certainly <em>comes</em> from the Web, and a lot of it might even be queried on the Web after processing. But, somewhere along the line, the <em>linkedness</em> of the Web has either been forgotten or ignored. That rich set of connections, interconnections and associations has been reduced to a table, an index, or a (large) set of key-value pairs. And in the process, something fundamental has gone away.</p>
<p>This is enough for now, though. Looking more closely at different Big Data approaches, and exploring the potential for re-introducing the Web must wait for future posts.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://venturebeat.com/2010/10/26/cloudera-raises-25m-to-help-deal-with-the-enterprise-data-deluge/">Cloudera raises $25M to help deal with the enterprise data deluge</a> (venturebeat.com)</li>
<li class="zemanta-article-ul-li"><a href="http://radar.oreilly.com/2010/10/strata-week-building-data-star.html">Strata Week: Building data startups</a> (radar.oreilly.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/cloud/2010/09/hadoop-and-a-critique-on-geek.php">Big Data and a Critique of Geek Culture</a> (readwriteweb.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.nytimes.com/external/gigaom/2010/10/30/30gigaom-big-data-and-nosql-march-to-the-enterprise-73963.html">Big Data and NoSQL March to the Enterprise</a> (nytimes.com)</li>
<li class="zemanta-article-ul-li"><a href="http://news.cnet.com/8301-21546_3-20023969-10253464.html?part=rss&amp;subj=news">Does &#8216;big data&#8217; equal big opportunity for storage vendors?</a> (news.cnet.com)</li>
<li class="zemanta-article-ul-li"><a href="http://blog.programmableweb.com/2010/11/29/new-york-times-event-shows-the-promise-of-big-data/">New York Times Event Shows the Promise of Big Data</a> (programmableweb.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.readwriteweb.com/enterprise/2010/11/executives-are-addicted-to-big.php">Overwhelmed Executives Still Crave Big Data, Says Survey</a> (readwriteweb.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=5578c8b2-c0db-4f2b-b846-1aac2b8adc42" alt="" /><span class="zem-script pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2010/11/is-there-a-disconnect-between-big-data-and-the-web-of-data/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Looking forward to The Semantic Link podcast on SemanticWeb.com</title>
		<link>http://cloudofdata.com/2010/11/looking-forward-to-the-semantic-link-podcast-on-semanticweb-com/</link>
		<comments>http://cloudofdata.com/2010/11/looking-forward-to-the-semantic-link-podcast-on-semanticweb-com/#comments</comments>
		<pubDate>Thu, 25 Nov 2010 12:21:25 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Andraz Tori]]></category>
		<category><![CDATA[Bernadette Hyland]]></category>
		<category><![CDATA[Christine Connors]]></category>
		<category><![CDATA[Eric Franzon]]></category>
		<category><![CDATA[Eric Hoffer]]></category>
		<category><![CDATA[Ivan Herman]]></category>
		<category><![CDATA[Peter Brown]]></category>
		<category><![CDATA[Podcast]]></category>
		<category><![CDATA[Semantic Link]]></category>
		<category><![CDATA[Semantic Web Gang]]></category>
		<category><![CDATA[SemanticLink]]></category>
		<category><![CDATA[SemanticWeb.com]]></category>
		<category><![CDATA[Talis]]></category>
		<category><![CDATA[The Semantic Link]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1367</guid>
		<description><![CDATA[Image via Wikipedia Almost exactly two years ago, I announced my intention to leave Talis and strike out on my own. A year ago, the final piece of that transition concluded when we published the last episode of the Semantic Web Gang; a round table podcast that I began inside Talis and subsequently produced with [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="margin: 1em; display: block;">
<div>
<dl class="wp-caption alignright" style="width: 310px;">
<dt class="wp-caption-dt"><a href="http://commons.wikipedia.org/wiki/File:King_Arthur_and_the_Knights_of_the_Round_Table.jpg"><img title="King Arthur and the Knights of the Round Table..." src="http://upload.wikimedia.org/wikipedia/commons/thumb/f/fc/King_Arthur_and_the_Knights_of_the_Round_Table.jpg/300px-King_Arthur_and_the_Knights_of_the_Round_Table.jpg" alt="King Arthur and the Knights of the Round Table..." width="300" height="261" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image via <a href="http://commons.wikipedia.org/wiki/File:King_Arthur_and_the_Knights_of_the_Round_Table.jpg">Wikipedia</a></dd>
</dl>
</div>
</div>
<p>Almost exactly two years ago, <a href="http://cloudofdata.com/2008/12/paul-miller-is-bound-for-pastures-new/">I announced my intention to leave</a> <a class="zem_slink" title="Talis Group" rel="homepage" href="http://www.talis.com/">Talis</a> and strike out on my own. A year ago, the final piece of that transition concluded when we published the last episode of the <a class="zem_slink broken_link" title="Semantic Web Gang" rel="homepage" href="http://semanticgang.talis.com/">Semantic Web Gang</a>; a round table podcast that I began inside Talis and subsequently produced with their sponsorship.</p>
<p>Fast forward to the end of 2010, and it&#8217;s time to revisit the value that a monthly round table conversation offered to the community. Talking with friends, customers and contacts, many of them pointed to the useful role that something like the round table discussion format plays in cutting through marketing hype, community groupthink and competitor obfuscation. The old Semantic Web Gang did that, and it did it because it comprised a great group of contributors who knew their stuff.</p>
<p>Working with Eric Franzon at <a href="http://semanticweb.com/">semanticweb.com</a> we are about to recreate some of that value by launching <em>The Semantic Link</em>, and I am really pleased with the panel of regulars that Eric and I have signed up; a completely new group of contributors who also know their stuff.</p>
<div id="_mcePaste">
<ul>
<li><a href="http://www.linkedin.com/in/pensivepeter">Peter Brown</a></li>
<li><a href="http://www.linkedin.com/in/cjmconnors">Christine Connors</a></li>
<li><a href="http://nl.linkedin.com/in/iherman">Ivan Herman</a>, W3C</li>
<li><a href="http://www.linkedin.com/in/erichoffer">Eric Hoffer</a>, Second Integral</li>
<li><a href="http://www.linkedin.com/in/bhyland">Bernadette Hyland</a>, Talis Inc.</li>
<li><a href="http://www.linkedin.com/in/ericfranzon">Eric Franzon</a>, SemanticWeb.com</li>
<li><a href="http://www.linkedin.com/in/andraz">Andraz Tori</a>, Zemanta</li>
</ul>
</div>
<p>Our first episode will be recorded in early December, and should be pushed out via semanticweb.com (and iTunes, no doubt) before Christmas. We&#8217;ll then settle down to a regular cycle, recording and publishing each month, dissecting whichever trends, stories, initiatives and companies happen to be topical. I&#8217;m also keen to get guests on the show from time to time, especially where they&#8217;re involved in something truly interesting and newsworthy.</p>
<p>If you have ideas for topics you would like to hear us dissect, please do <a href="http://cloudofdata.com/contact/">let me know</a>.</p>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/pixy.gif?x-id=c9a226da-6e9c-4a09-ada2-9f8822bb8c8c" alt="" /><span class="zem-script pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2010/11/looking-forward-to-the-semantic-link-podcast-on-semanticweb-com/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>&#8216;Towards a Web of Data?&#8217; presentation</title>
		<link>http://cloudofdata.com/2010/07/towards-a-web-of-data-presentation/</link>
		<comments>http://cloudofdata.com/2010/07/towards-a-web-of-data-presentation/#comments</comments>
		<pubDate>Thu, 22 Jul 2010 12:45:13 +0000</pubDate>
		<dc:creator>Paul Miller</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Web 3.0]]></category>
		<category><![CDATA[#tx6]]></category>
		<category><![CDATA[Bill Roberts]]></category>
		<category><![CDATA[Data Web]]></category>
		<category><![CDATA[Iconomical]]></category>
		<category><![CDATA[Liz Turner]]></category>
		<category><![CDATA[Manchester]]></category>
		<category><![CDATA[Swirrl]]></category>
		<category><![CDATA[Vision+Media]]></category>
		<category><![CDATA[visualisation]]></category>
		<category><![CDATA[visualization]]></category>
		<category><![CDATA[Web of Data]]></category>

		<guid isPermaLink="false">http://cloudofdata.com/?p=1075</guid>
		<description><![CDATA[I was in Manchester yesterday, having been invited over by Paul Collins to speak at Vision+Media&#8216;s final Transmissions workshop. The topic was &#8216;Towards a Web of Data,&#8217; and the other speakers were Bill Roberts of Swirrl and Liz Turner of Iconomical. Bill&#8217;s slides are here, and mine are embedded below. Thanks to Paul for the [...]]]></description>
			<content:encoded><![CDATA[<p>I was in <a href="http://en.wikipedia.org/wiki/Manchester">Manchester</a> yesterday, having been invited over by <a href="http://uk.linkedin.com/in/pau1co11ins">Paul Collins</a> to speak at <a href="http://www.visionandmedia.co.uk/">Vision+Media</a>&#8216;s final Transmissions workshop. The topic was &#8216;<a href="http://transmission6.eventbrite.com/">Towards a Web of Data</a>,&#8217; and the other speakers were <a href="http://uk.linkedin.com/pub/bill-roberts/5/a51/456">Bill Roberts</a> of <a class="zem_slink" title="Swirrl" rel="homepage" href="http://www.swirrl.com">Swirrl</a> and <a href="http://nl.linkedin.com/in/elizaturn">Liz Turner</a> of <a href="http://www.iconomical.com/">Iconomical</a>.</p>
<p>Bill&#8217;s slides are <a href="http://www.slideshare.net/billroberts/transmission6-publishing-linked-data">here</a>, and mine are embedded below.</p>
<p style="text-align: center;"><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=id=4795811&amp;doc=20100721-towardsawebofdata-100720072603-phpapp02" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><param name="wmode" value="transparent" /><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=id=4795811&amp;doc=20100721-towardsawebofdata-100720072603-phpapp02" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355" wmode="transparent"></embed></object></p>
<p>Thanks to Paul for the invite, and to the audience for braving an intermittently (very) wet Manchester evening to spend a few hours discussing Linked Data, the Semantic Web, and related topics; it was fun.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://techcrunch.com/2010/07/16/google-acquires-metaweb-to-make-search-smarter/">Google Acquires Metaweb To Make Search Smarter</a> (techcrunch.com)</li>
<li class="zemanta-article-ul-li"><a href="http://ironick.typepad.com/ironick/2010/07/tim-berners-lee-doesnt-seem-to-think-linked-data-requires-rdf.html">Tim Berners-Lee Doesnt Seem to Think Linked Data Requires RDF</a> (ironick.typepad.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.oscon.com/oscon2010/public/schedule/detail/13867">CubicWeb &#8211; The Semantic Web is a Construction Game!: OSCON 2010, The O&#8217;Reilly Open Source Convention &#8211; O&#8217;Reilly Conferences, July 19 &#8211; 23, 2010, Portland, OR</a> (oscon.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.chiefmartec.com/2010/04/15-sessions-on-marketing-and-the-semantic-web.html">15 sessions on marketing and the semantic web</a> (chiefmartec.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Enhanced by Zemanta" href="http://www.zemanta.com/"><img class="zemanta-pixie-img" style="border: none; float: right;" src="http://img.zemanta.com/zemified_e.png?x-id=b75960c6-d583-4fd3-8d15-fbfc9a7a5dd3" alt="Enhanced by Zemanta" /></a><span class="zem-script more-info pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://cloudofdata.com/2010/07/towards-a-web-of-data-presentation/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

