Skip to main content

Posts Tagged ‘Cloudera’

Harness your data with a data strategy

Data Strategy at Strata Data Conf New York

It’s no secret that data is a massive asset when it comes to making better business decisions. But gaining the valuable insights required to make those decisions requires quality data that you can trust. And to accomplish this you need a data strategy. Without understanding your business objectives, identifying use cases, knowing how your users […]

Hadoop, Spark, Cassandra, Oh My!

Previously, I reviewed why Spark will not by itself replace Hadoop, but Spark combined with other data storage and resource management technologies creates other options for managing Big Data.  Today we will investigate how an enterprise should proceed in this new, “Hadoop is not the only option” world.  Hadoop, Spark, Cassandra, Oh My!  Open source Hadoop and […]

IBM’s Spark Investment is Evidence Big Data is Dead

  Right after I posted my blog on Spark and Hadoop, I came across this article. IBM made a big announcement that they are putting their weight behind Spark.  They are committing more than 3,500 developers and programmers to help move Spark forward. This combined with significant support from the Big 3 Hadoop distributors (HortonWorks, Cloudera, […]

Hadoop’s Ever-Increasing Role

With the advent of Splice Machine and the release of Hive 0.14 we are seeing Hadoop’s role in the data center continue to grow. Both of these technologies support limited transactions against data stored in HDFS. Now, I would not suggest moving your mission-critical ERP systems to Hive or Splice Machine, but the support of […]

A little stuffed animal called Hadoop

Doug Cutting – Hadoop creator – is reported to have explained how the name for his Big Data technology came about: “The name my kid gave a stuffed yellow elephant. Short, relatively easy to spell and pronounce, meaningless, and not used elsewhere: those are my naming criteria.” The term, of course, evolved over time and […]