Big data, but with a familiar face
To prepare for O’Reilly’s upcoming Strata Conference, we’re continuing our series of conversations with some of the leading innovators working with big data and analytics. Today, we have a brief chat...
View ArticleStrata Gems: Who needs disks anyway?
We’re publishing a new Strata Gem each day all the way through to December 24. Yesterday’s Gem: Kinect democratizes augmented reality. Today’s databases are designed for the spinning platter of the...
View ArticleStrata Gems: Turn MySQL into blazing fast NoSQL
We’re publishing a new Strata Gem each day all the way through to December 24. Yesterday’s Gem: What your inbox knows. The trend for NoSQL stores such as memcache for fast key-value storage should give...
View ArticleThe growing importance of data journalism
One of the themes from News Foo that continues to resonate with me is the importance of data journalism. That skillset has received renewed attention this winter after Tim Berners-Lee called analyzing...
View ArticleBig data faster: A conversation with Bradford Stephens
To prepare for O’Reilly’s upcoming Strata Conference, we’re continuing our series of conversations with some of the leading innovators working with big data and analytics. Today, we hear from Bradford...
View ArticleHadoop: What it is, how it works, and what it can do
Hadoop gets a lot of buzz these days in database and content management circles, but many people in the industry still don’t really know what it is and or how it can be best applied. Cloudera CEO and...
View ArticleOracle’s NoSQL
Oracle’s turn-about announcement of a NoSQL product wasn’t really surprising. When Oracle spends time and effort putting down a technology, you can bet that its secretly impressed, and trying to...
View ArticleHeavy data and architectural convergence
Recently I spent a day at the Hadoop Summit in San Jose. One session in particular caught my attention because it hints at a continued merging of the RDBMS and Hadoop worlds. EMC’s Lei Chang gave a...
View ArticleGoogle’s Spanner is all about time
In case you missed it, Google Research published another one of “those” significant research papers — a paper like the BigTable paper from 2006 that had ramifications for the entire industry (that...
View ArticleFour short links: 19 April 2013
Bruce Sterling on Disruption — If more computation, and more networking, was going to make the world prosperous, we’d be living in a prosperous world. And we’re not. Obviously we’re living in a...
View ArticleFour short links: 5 July 2013
Quantitative Analysis of the Full Bitcoin Transaction Graph (PDF) — We analyzed all these large transactions by following in detail the way these sums were accumulated and the way they were dispersed,...
View ArticleAugmenting Unstructured Data
Our world is filled with unstructured data. By some estimates, it’s as high as 80% of all data. Unstructured data is data that isn’t in a specific format. It isn’t separated by a delimiter that you...
View ArticleNoSQL Choices: To Misfit or Cargo Cult?
Retreading old topics can be a powerful source of epiphany, sometimes more so than simple extra-box thinking. I was a computer science student, of course I knew statistics. But my recent years as a...
View ArticleDealing with Data in the Hadoop Ecosystem
Kathleen Ting (@kate_ting), Technical Account Manager at Cloudera, and our own Andy Oram (@praxagora) sat down to discuss how to work with structured and unstructured data as well as how to keep a...
View ArticleRestructuring the Web with Git
Web designers? Git? Github? Aren’t those for programmers? At Artifact, Christopher Schmitt showed designers how much their peers are already doing with Github, and what more they can do. Github (and...
View ArticleMore than enough Arel
In Just Enough Arel, we explored a bit into how the Arel library transforms our Ruby code into SQL to be executed by the database. To do so, we discovered that Arel abstracts database tables and the...
View ArticleFour short links: 10 June 2014
Trusting Browser Code (Tim Bray) — on the fundamental weakness of the ‘net as manifest in the browser. Deep Learning in the Raspberry Pi (Pete Warden) — $30 now gets you a computer you can run deep...
View ArticleFour short links: 24 July 2014
Neglected Machine Learning Ideas — Perhaps my list is a “send me review articles and book suggestions” cry for help, but perhaps it is useful to others as an overview of neat things. First Crowdfunded...
View ArticleFour short links: 28 July 2014
streisand — sets up a new server running L2TP/IPsec, OpenSSH, OpenVPN, Shadowsocks, Stunnel, and a Tor bridge. It also generates custom configuration instructions for all of these services. At the end...
View ArticleDynamic columns in MariaDB
Buy Learning MySQL and MariaDB. MariaDB and similar SQL database systems allow for a variety of data types that may be used for storing data in columns within tables. When creating or altering a...
View Article
More Pages to Explore .....