What is Big Data? / Blogs / Perficient

As we head towards the end of the first quarter of 2012, there can be no doubt that concept of Big Data has arrived. But what is Big Data? In 1985 a PC with a 10 megabyte disk drive was state of the art; by 2010 1 terabyte drives were common-place. Is this Big Data? Ten years ago 10 to 20 terabytes was the high-end of commercial databases, today petabyte databases are not unheard of. Is this Big Data? While addressing one aspect of Big Data, they are missing two or more of the other commonly accepted dimensions of Big Data.

What are the dimensions of Big Data? In 2001, The Meta Group published a report that described the challenges that traditionally data management faced. In the report, the described the three terms that have become widely accepted as defining Big Data – Volume, Velocity, and Variety. The major software vendors have all accepted the definition and use it when describing the products whether it be SAP’s HANA, IBM’s Big Insights, or Oracle’s Big Data Appliance.

Volume

Data is being generated in larger and larger quantities every day. It comes not only from human sources but, more and more, it is machine generated. Advances in healthcare generate masses of patient data, smart meters flood energy companies with usage information, process manufacturers monitor every phase of their production.

Big Data comes in one size: large.

Velocity

Revolutionize Your Business With Generative AI

From product design and software development to virtual agents, content creation, and reporting, GenAI is transforming business. Our AI experts help you unlock GenAI’s full potential and drive growth.

Let’s Get Started

Not only is Big Data generated in large volumes, it is coming at us quickly. In many cases, it must be acted upon just as quickly to have value. Twitter’s tiny tweets generate many terabytes of data every day. Large organizations must mine and react to the information quickly to avoid unwanted publicity.

Big Data arrives at one speed: fast.

Variety.

Big Data includes data that cannot be easily described by the classic row-column, record-field paradigms. It includes unstructured data in all its many forms text, audio, video, streams, log files and more. As new products and services are envisioned new data types will be created and more Big Data will be generated.

Big Data manifests itself in one format: mixed.

Other Dimensions

Beyond the standard three descriptive dimensions, some proponents are suggesting other dimensions for Big Data – Value and Validity.

Value is surely a meaningful way to measure Big Data. There is value to be released from the masses of data we own; the challenge is creating our own environmentally acceptable “fracking” processes to release it.

The Validity of data is always of interest. Invalid “Small Data” can cause huge issues for business users. Imagine the effect of validity on Big Data. However, with Big Data the issue is not necessarily whether the data is wrong or right but whether we are arriving at the right or wrong conclusions as we analyze and consume it.

Conclusion

Big Data is the challenge-of-the-moment facing us all. It is something we can address. There are tools and technologies coming to bear that will help us to manage and leverage Big Data. Hadoop and MapReduce are new players in the field. They address the Volume and Variety dimensions. Traditionally Data Warehousing manages the Volume and Velocity dimensions very well. As time progresses, the marriage of these, and other technologies, will make Big Data old hat.

Thoughts on “What is Big Data?”

Brett L. Baloun March 5, 2012 at 12:18 pm

Great Blog Topic! I am very interested to see how this “big data” becomes actionable and how it is viewed as we begin to integrate it into BI solutions. I think the value-add of this data is yet to be determined.
Dominic Sagar Post author March 7, 2012 at 1:24 pm

Brett, it is going to be interesting to see how the space is transformed over the next few years. I think the value of “big data” is already proven. Take for instance the recent New York times article on how Target is able to mine their purchase data to identify and target customers creating significant uplift.

This site uses Akismet to reduce spam. Learn how your comment data is processed.

What is Big Data?

by Dominic Sagar on March 4th, 2012 | ~ minute read

Volume

Velocity

Revolutionize Your Business With Generative AI

Variety.

Other Dimensions

Conclusion

Tags

Thoughts on “What is Big Data?”

Leave a Reply

Dominic Sagar

Categories

Follow Us