In just two weeks Perficient leaders are headed to San Francisco to attend Data + AI Summit! This key conference for the data, analytics, and AI community take places June 26th – 29th in Moscone Center and will attract an estimated 10,000 data professionals from every industry all over the globe. The conference has something […]
Databricks
Real-time Data Processing: Databricks vs Flink
Real-time data processing is a critical need for modern-day businesses. It involves processing data as soon as it is generated to derive insights and take immediate actions. Databricks Streaming and Apache Flink are two popular stream processing frameworks that enable developers to build real-time data pipelines, applications and services at scale. In this article, we […]
Harden Databricks with Immuta’s Policy-As-Code Framework
Databricks Databricks provides a powerful, spark-centric, cloud-based analytics platform that enables users to rapidly process, transform and explore data. However, its preconfigured security can be insufficient in regulating or monitoring confidential information due to the flexibility it offers. This can be of particular concern to highly regulated enterprise, such a financial and health-care companies. Policy-as-code […]
Delta Sharing for Modern Secure Data Sharing
Delta Lake is an open-source framework under the Linux Foundation used to build Lakehouse architectures. A new project is Delta Sharing, which is an open protocol for secure real-time exchange of large datasets. Databricks provides production-grade implementations of the projects under delta-io, including Databricks Delta Sharing. Understanding the open-source foundation of the enterprise offering can […]
Top 5 take-aways from Databricks Data – AI Summit 2022
The Data and AI Summit 2022 had enormous announcements for the Databricks Lakehouse platform. Among these, there were several exhilarating enhancements to Databricks Workflows, the fully managed orchestration service that is deeply integrated with the Databricks Lakehouse Platform and Delta Live tables too. With these new efficacies, Workflows enables data engineers, data scientists and analysts […]
Databricks Integration with Snowflake
What is Databricks? Databricks is a unified cloud-based data platform that is powered by Apache Spark. It specializes in collaboration and analytics for big data. Databricks is a data science workspace, with Collaborative Notebooks, Machine Learning Runtime, and Managed ML flow. Collaborative Notebooks support multiple data analytics languages, such as SQL, Scala, R, Python, and […]