Skip to main content

Posts Tagged ‘Bigdata’

Kafka And Mongodb Integration In Big Data

Introduction to Kafka and MongoDB Integration

Introduction: Apache Kafka is a distributed streaming platform that enables businesses to build real-time streaming applications. First developed at LinkedIn in 2010, it has become one of the most widely used messaging systems for big data and real-time analytics. Kafka can process and transmit massive amounts of data in real-time, and its design ensures fault […]

2 Choices for Big Data Analysis on AWS: Amazon EMR or Hadoop on EC2

What are the key differentiators to determine Hadoop distribution for Big Data analysis on AWS? We have two choices: Amazon EMR or a third-party provided Hadoop (ex: Core Apache Hadoop, Cloudera, MapR etc). Yes, cost is important. But, aside from cost, other things to look for include ease of operation, controlling, managing, performance, features etc. 1. Cost […]

Big Data Trends

Introduction My first blog in nearly 2 years with Perficient. I have been watching this space for a while and have been wondering for some time about potentials topics to blog. I have decided to initially focus my blogs on some interesting Business Analytics technologies which are discussed infrequently. ( Data mining, Predictive Analytics, Text […]

Big Data – Organizational Challenges

My previous blog post was about Enterprise Search.  Within my research, I continuously came across the term Big Data.  Obviously, I had heard of this before, but I was not well versed on what it actually meant.  While I was waiting at the airport, I saw a Harvard Business Review (HBR) magazine staring at me, […]

Why Establish a Data Warehouse? (Part I in a Series)

A Data Warehouse (DW), among other things, can be an important step in establishing a holistic Business Intelligence program.  But, even by itself, there are some very good reasons to implement a DW, as cited below. Due to unacceptable transaction processing times in server/disk bound tasks, many firms find that processing their reports and queries […]

Driving trends in information management and business intelligence

Scott Laningham (@ScottLaningham) and Todd Watson (@TurboTodd) of IBM’s DeveloperWorks podcasts series interviewed our IBM business intelligence lead, Andy Ho, while at IBM’s Information On Demand 2011 conference this week in Las Vegas. As usual, Scott and Todd identified key insights from Andy’s experience working with clients in information management every day, but not without […]