
Microsoft Corporate Vice President Quentin Clark discusses data, data platforms, and more
The Data Platform Group at Microsoft does a lot, from SQL Server and their Hadoopey HDInsight offering through to Business Intelligence and analytics capabilities which sit in or on top of the humble Excel spreadsheet. I’ve touched upon pieces of this whole before, in a 2009 podcast on Azure with Amitabh Srivastava (then Corporate VP...

This is the news… today
I recently set up a new Tumblr site, in addition to my main blog here at cloudofdata.com. It’s been running for a few days now, and I’m deliberately posting just one entry each day to explore a topical news item, issue, or trend. We are all inundated by a constant stream of press releases, tweets,...

Getting it right with data attribution
There have always, it seems, been people for whom attribution and citation really matter. Some of them passionately engage in arguments that last months or years, debating the merits of comma placement in written citations for the work of others. Bizarre, right? But, as we all become increasingly dependent upon data sourced from third parties,...

Find the data, aggregate the data, make the data useful
I was in New York in March, taking part in GigaOM’s Structure:Data event. As usual on these trips, I spent the day before the event walking around the city, soaking up some air, getting rained on, using coffee to stay awake, and meeting with a number of local companies. Of the companies I met that...

Doing the DataBeat
For the past two years, Ben Kepes and I have helped the team at VentureBeat assemble the programme for their annual Cloud Computing event, CloudBeat. It looks as though we may end up doing something similar with them this year, as CloudBeat moves from Redwood City to downtown San Francisco, and from November to September....

Thinking about Open Data, with a little help from the Data Hub
Continuing to explore the adoption of explicit Open Data licenses, I’ve been having a trawl through some of the data in the Open Knowledge Foundation‘s Data Hub. I’m disappointed – but not surprised – by the extent to which widely applicable Open Data licenses are (not!) being applied. For those who are impatient or already aware of the background,...

Survey: How open is your data?
Back in 2006 as we rolled out the first public draft of the Talis Community Licence, the world of data licensing seemed a simple place. Today, the Open Knowledge Foundation‘s Data Hub contains 3,888 data sets, many of which are explicitly licensed with respect to the Open Definition. But many are still not explicitly licensed. Over at...
Data Market Chat: Rufus Pollock and Irina Bolychevsky discuss the Open Knowledge Foundation and CKAN
The Open Knowledge Foundation (OKFN) promotes the creation, dissemination and use of ‘open knowledge.’ As part of this activity OKFN developed a data repository called CKAN, and has seen this become increasingly important to a range of data dissemination activities such as data.gov.uk and publicdata.eu. In this podcast I talk with OKFN Director Rufus Pollock...
Open is good – but encouragement better than mandate
Openness is undeniably cool right now, at least if you move in the slightly odd circles that I do. Openly available scientific papers are disrupting the world of scholarly publishing (which may not be all good, but that’s a post for another day). Openly available university courses are finally beginning to work out how to...
CloudCamp London: the Big Data Special
The CloudCamp unconference returned to London for the 14th time this evening, regaling a capacity crowd in the Crypt below Clerkenwell’s St James Church with several hours of discussion and debate on the somewhat elusive topic of ‘Big Data’. Rather rough notes of the proceedings follow, after the break.
Top Level Domain for data answers the wrong question
British-born computer scientist Stephen Wolfram sees ongoing efforts to extend the Internet’s top-level domains (TLDs) beyond the familiar .com, .org, .uk etc as an opportunity to raise the profile of machine-readable data. In a blog post published yesterday, he argues that a new .data domain would increase “exposure of data on the internet—and [provide] added impetus for...
Trust, Big Data, Semantics, Data Marketplaces, and More Trust
I’ve had a few posts published over the weekend, picking up some things I have written about before. These are; My latest monthly column on SemanticWeb.com; Big Data Presents a Big Opportunity? My latest weekly wrap-up on GigaOMPro; Rosslyn Analytics, Microsoft Finding Value in Data Aggregation The teaser piece on GigaOM’s public Cloud site; In...