What is Data Lake: Data Lake is a centralized repository with many data in native and raw formats. The data lake has Distributed File Systems that provide cheap storage of large volumes of data. It supports different file formats like Parquet, CSV, JSON, etc. It supports massive parallel execution engines that do not require special […]
Posts Tagged ‘data lake’
Becoming Data-Driven: Give Employees and Customers Access to the Data
This is the first blog in a series that dives into how organizations become data-driven, with insights and strategy from Perficient’s Senior Data Strategist and Solutions Architect, Dr. Chuck Brooks. A data-driven organization is one that effectively and consistently utilizes data in its decision-making process across all levels of the organization. It means driving change, […]
Data Lake and Information Governance – The Key Takeaways
A Data Lake can be a highly valuable asset to any enterprise, and there is a myriad of technology solutions available for leveraging the processes to feed, maintain and retrieve information from the Lake. But all this technology is, if not worthless, significantly less valuable, if the environment is not well governed and managed. This […]
The Path to Artificial Intelligence in Healthcare
There is a lot of excitement about Artificial Intelligence. The excitement is certainly warranted based on the potential that these solutions can offer. In an industry that has massive amounts of data and is very dependent on the data to both run efficiently and, more importantly, delivery high quality patient care any technology which can […]
Information Governance – Essential Ingredient for Business Value
In my last blog, you may recall that we were discussing the value and the need for Standards and Templates for ensuring a consistent and efficient use of the Data Lake, both in its population (supplying) and in its retrieval (consuming) of information. To achieve this level of consistency and efficiency, as well as reliability, […]
Working with the Data Lake Aggregator – Standards and Templates
In my previous blog, I described the concept of an “Information Catalog” and how it plays a vital role in ensuring communication between the Data Lake Aggregator and Suppliers and Consumers is efficient and effective due to the common language that it provides. I also included the following diagram as an example of how the […]
Data Lake as Aggregator – The Critical Role of the Catalog
My previous blog talked about a Data Lake using a Supplier-Aggregator-Consumer analogy and talking about the roles each of these parties play. One factor that is critical to the success of this approach is the use of a common vocabulary that ensures efficiency and effectiveness in the interactions and collaborations between the parties. The implication […]
Data Lake Participants – Roles and Responsibilities
As you may recall, in my last blog I introduced the analogy of the Aggregator to describe utilizing a Data Lake as a Consolidator of information, and I mentioned the three key roles in this model: the Supplier, the Aggregator and the Consumer. In this post I will provide a little more detail on the […]
Common Ingestion Framework
Big Data is the way to move forward for all enterprises today. May it be healthcare, retail, finance or manufacturing, everyone is at different stages in their journey to create their industry-grade, enterprise-ready Data Lake repository. Data Lake forms the core foundation for companies in their journey to build state-of-the-art intelligent applications and provide cutting-edge […]
Data Lake Consolidation – the Aggregator Analogy
In my last blog, I introduced the concept of the Data Lake as a Consolidator and the critical success factor of applying robust Information Governance to this environment. In this post, I want to introduce an analogy to help visualize this environment and the parties involved. So, a Data Lake as Consolidator. What does that […]
Data Lakes and the Information Governance Critical Success Factor
Since my last post I’ve been working for a client that is actively engaged in establishing a Data Lake for the purpose of supporting their analytics efforts, but also looking to “re-architect” the way their systems collaborate by using this Data Lake environment to control and consolidate all information-sharing interactions within their environment. I was […]
5 Oracle Analytics Trends to Watch Out for Starting Now
“Catching up” is the term that came to mind when I used to check out what’s new with Oracle Analytics in previous years. This year, however, I frankly say I was impressed with what I saw at Oracle Open World last week. The rules of the analytics platform game have changed, tremendously. This is after […]