As times have evolved, so have our data challenges — but we have been trying to solve it for ages. Giving it dedicated resources, robust hardware, etc. we still end up missing SLAs in our traditional On-Premise & Cloud-hosted worlds. Add to the fact the constant dependency on personnel to manage the data loads, data […]
Data + Intelligence
Cognos Analytics: Nested Category-X axis in Combination Charts
In Cognos Analytics, the visualizations possible through Charts and graphs are quite intuitive. It offers various insights into the data that you have. Through this blog, a select category of Bar-chart and the construction behind it is highlighted. Typically, Bar-charts, as the name suggests, plots Rectangular bars on the Y-axis of varying height for each […]
Top-down approach for Migration of SQL code from Microsoft SQL Server to Presto
In typical Migration Projects, data housed in Database Server A (provided by Vendor A) is extracted, transformed, and loaded into Database Server B (produced by Vendor B). Each of these servers offers a wide array of functions to query data. Though the function names and syntaxes may vary, all the DB servers in the market […]
Handshake Added to Google Cloud Search Connector Directory
The framework continues to grow and new features are being added every day. We’re working on getting a public image of the Framework up and running, so you, dear readers, will be able to test the speed, flexibility, and extensibility of Handshake at your convenience! Until then, the Handshake footprint grows. We’re demoing a lot. […]
Don’t Miss Out! Join Us at Google Cloud Next ’20: OnAir
Experience cloud at its best during Google Cloud Next ’20: OnAir, July 14 – September 8, 2020. During this 9-week event, a new topic will be covered each week, complete with virtual breakout sessions and solution keynotes. Sign in every Tuesday at 9AM PT to discover new on-demand content, demos, success stories, consult with experts, […]
Using Cloud to Solve Healthcare Pain Points
This is the second post in a series on Google healthcare capabilities. To read the first post, click here. With the healthcare industry having to quickly jump on the cloud bandwagon with telehealth recently, it is becoming apparent that a new norm is (in my millennial opinion, finally) emerging. This begs the question – how […]
DataOps with IBM
DataOps seeks to deliver high quality data fast in the same way that DevOps delivers high quality code fast. The names are similar; the goals are similar; the implementation is very different. Code quality can be measured using similar tools across multiple projects. Data quality is a mission-critical, enterprise-wide effort. The effort has consistently proven […]
Trust models in distributed ledgers
Consensus, getting distributed processes to agree on a single value, is a fundamental problem in computer science. Distributed processing is difficult. In fact, there are logical proofs that show pretty conclusively that there won’t be a single perfect algorithm for handling consensus in an asynchronous system made of imperfect nodes. As long as there is […]
Detect Ransomware Using Splunk
Ransomware has become one of the most common and lucrative forms of malware, recently eclipsing even credit card theft incidents. With the potential for huge profits, you can be sure hackers will be coming up with even more effective and dangerous attacks. Recent ransomware attacks include data exfiltration, that is then held for ransom. Limitations […]
When to Shake: Handshake Use Cases (Part 1)
We continue to emphasize that connectors need to be as variable as search use cases, that the biggest benefit to handshake is its flexibility. But what are those use cases? And how does Handshake achieve those goals more quickly and completely? In this series of posts, we’ll describe the most common search use cases we’ve […]
Understanding Performance in Blockchain Systems
Blockchain is an example of distributed ledger systems and as such shares the same performance concerns as any other distributed system. In order to measure the performance of a distributed system with an acceptable degree of accuracy, it’s best to simplify as many of the variables under our control as possible. The size of the […]
Take advantage of windows in your Spark data science pipeline
Windows can perform calculations across a certain time frame around the current record in your Spark data science pipeline. Windows are SQL functions that allow you to access data before and after the current record to perform calculations. They can be broken down into ranking and analytic functions and, like aggregate functions. Spark provides the […]