It’s no secret that rocket .. err … data scientists are in short supply.
The explosion of data and the corresponding explosion of tools, and the
knock-on impacts of Moore’s and Metcalfe’s laws, is that there is more
data, more connections, and more technology to process it than ever. At last
year’s Hadoop World, there was a feeding frenzy for data scientists, which
only barely dwarfed demand for the more technically oriented data architects.
In English, that means:
Potential MacArthur Grant recipients who have a passion and insight for data,
the mathematical and statistical prowess for ginning up the algorithms, and
the artistry for painting the picture that all that data leads to. That’s
what we mean by data scientists. People who understand the platform side of
Big Data, a.k.a., data architect or data engineer.
The data architect side will be the more straightforw... (more)
At this point, probably at least 90 percent or more of analytic systems/data
warehouses are easily contained within the SQL-based technologies that are
commercially available today. We’ll take that argument a step further: Most
enterprise data warehouses are less than 5 terabytes. So why then all the
excitement about big data, and why are acquisitions in this field becoming
almost a biweekly thing?
To refresh the memory, barely a couple weeks back, HP announced its intention
to buy Vertica. And this morning came the news that Teradata is buying the
other 89 percent of Aster Data ... (more)
Is it more than coincidence that IT acquisitions tend to come in waves? Just
weeks after IBM's announcement to snap up Lombardi, Progress Software today
responds with an agreement to put Savvion out of its misery? In such a small
space that is undergoing active consolidation, it is hard not to know who's
Nonetheless, Progress's acquisition confirms that Business Process Management
(BPM)'s pure play days are numbered, if you expect executable BPM.
The traditional appeal of BPM was that it was a business stakeholder-friendly
approach to developing solutions that didn't rely... (more)
When we last left Oracle’s big data plans, there was definitely a missing
piece. Oracle’s Big Data Appliance as initially disclosed at last fall’s
OpenWorld was a vague plan that appeared to be positioned primarily as an
appliance that would accompany and feed data to Exadata. Oracle did specify
some utilities, such as an enterprise version of the open source R
statistical processing program that was designed for multithreaded execution,
plus a distribution of a NoSQL database based on Oracle’s BerkeleyDB as an
alternative to Apache Hive. But the emphasis appeared to be extractio... (more)
With Strata, IBM IOD, and Teradata Partners conferences all occurring this
week, it’s not surprising that this is a big week for Hadoop-related
announcements. The common thread of announcements is essentially, “We know
that Hadoop is not known for performance, but we’re getting better at it,
and we’re going to make it look more like SQL.” In essence, Hadoop and
SQL worlds are converging, and you’re going to be able to perform
interactive BI analytics on it.
The opportunity and challenge of Big Data from new platforms such as Hadoop
is that it opens a new range of analytics. On o... (more)