Skip to main content

Posts Tagged ‘Hadoop’

Data Warehouse Role in Big Data

Last year the Data Warehouse was on the endangered species list.   A number of Hive solutions were being marketed as the Data Warehouse killers. However, this message has been muted this year and is evidenced by some developing trends. First, all of the mega-vendors have announced technologies to access data in Hadoop.   Oracle and IBM […]

Yarn – The Big Data Accelerator

Yarn….. Yes, Hadoop may be changing everything, but when Yarn was released, the change pedal has been pushed aggressively to the floor. Putting the technical details aside, the bottom-line is that now multiple concurrent workloads can be executed and managed on Hadoop clusters. This “pluggable” service layer has separated the data processing and cluster resource […]

Making Big Data Real

Update from the Hadoop Summit: Its only part way through day one and there is an un-mistakable theme: Interactive SQL on top of Hadoop is here, and in a big way.   Stinger, Impala, and a number of other niche providers are not promising, but delivering on interactive SQL.   Benchmarks, case studies of production clients, hands […]

Three Big Data Best Practices

One of the benefits of the Hadoop is its ability to be configured to address a number of diverse business challenges and integrated into a variety of different enterprise information ecosystems.  With proper planning these analytical big data systems have shown to be valuable assets for companies.  However, without significant attention to data architecture best […]

Strengthen Company Culture with Yammer enhanced by HDInsight

In a world of broadband internet connections, online collaboration tools and the ability to work from almost anywhere – office culture can be difficult to sustain.  This especially holds true for people who live in large cities (where the commute can be problematic) or in harsh climates (like the never ending winter in Chicago this […]

10 Considerations for a Successful Sitecore Azure Implementation

You’ve likely heard of Sitecore, an enterprise-class .NET web content management system (CMS) with extensive tools designed for marketers, as well as Microsoft’s cloud computing platform and infrastructure, Windows Azure – but how much do you know about Sitecore Azure? If you are already using Sitecore, or are considering it, and are also interested in […]

Setting up a Recommendation Engine (Mahout) on Windows Azure

A Brief Background In my previous posts I have walked through setting up Hadoop on Windows Azure using HDInsight.  Hadoop is an extremely powerful distributed computing platform with the ability to process terabytes of data.  Many of the situations when you hear the term “Big Data”, Hadoop is the enabler.  One of the complications with […]

How to: Setting up an HDInsight Hadoop cluster in Windows Azure

Edit: Part 3 using Mahout here In my previous post I described the basics of HDInsight on Windows Azure and an example of what a Hadoop cluster can do for you. Without further delay, lets build a cluster!  If you don’t already have a Windows Azure account go here and sign up (it’s free!!) Setup […]

Windows Azure and the future of the personalized web : Intro

Edit: Part 2 (setup) : Part 3 (Mahout) The internet is becoming increasingly personalized.  It has transitioned from indexing massive wells of information to delivering personalized information, or recommendations based on complex searches.  Evidence of this is seen in Google’s Knowledge graph, Amazon, the Bing engine, Facebook friends and twitter recommending people you may be interesting in […]

Microsoft PDW Blog Series – Part 1

Introduction If you follow the SQL Server community at all, you’ve probably heard a lot of buzz around PDW (Parallel Data Warehouse).  This is the first in a series of blogs I am going to be writing about PDW.  In this series, I am going to cover everything from PDW nuts and bolts to how […]

Technology Confusion

While returning from a client presentation and reflecting on the meeting conversations I was struck by a similarity that seems to be creeping into the minds of our clients. While discussing our approach to performing a strategy assessment for this new client we were reviewing an example architectural diagram and a question was raised. One […]

Will EDW Vanish Because of Hadoop?

Nowadays when people talk about IT technology and trend, there was overwhelming world of big data as we entering the new technology era. I know that big data has been promoted to national strategy in many countries. A few days ago I found an interesting debate whether the traditional data warehouse will be replaced by […]

Load More