Skip to main content

Kent Jiang

Currently I was working in Perficient China GDC located in Hangzhou as a Lead Technical Consultant. I have been with 8 years experience in IT industry across Java, CRM and BI technologies. My interested tech area includes business analytic s, project planning, MDM, quality assurance etc

Blogs from this Author

Heron: Is it Apache Storm 2.0?

In terms of real-time messaging or event analytics on the distributed node, we think of Storm and Kafka, both famous Apache projects. Storm has been used in Twitter’s production for many years, so it proves to be powerful. On June 4, Twitter shared news that the company developed a new system called Heron to handle […]

How to Configure Eclipse for Spark Application in the Cluster

Spark provides several ways for developer and data scientists to load, aggregate and compute data and return a result. Many Java or Scala developers would prefer to write their own application codes (aka Driver program) instead of inputting a command into the built-in spark shell or python interface. Below are some steps for how to quickly configure […]

Understanding Extract Strategy in Oracle DAC

Almost every type of ETL tool has its extract and load strategy with its configurations. In general the user should be concentrated in full load (aka initial load) and incremental load options in whatever tools to utilize. Oracle BI Apps component provides straightforward options to let developer configure appropriate approaches to achieve data load in […]

How to Setup Local Standalone Spark Node

From my previous post, we may know that Spark as a big data technology is becoming popular, powerful and used by many organizations and individuals. The Spark project was written in Scala, which is a purely object-oriented and functioning language. So, what can a Java developer do if he or she wants to learn about […]

Hangzhou Apache Spark Meetup

Similar to the Hadoop project, the Apache Spark project is a fast evolving in-memory engine for large-scale data processing platform. Particularly in recent years, Spark was widely used in many organizations and its community is being committed by many contributors. Perficient China GDC colleagues attended a recent Spark technology meetup in Hangzhou, during the meetup […]

Current Employee Status with SCD Type 2 in OBIEE

In the star-schema based data modeling, it is not very often to turn on slowly change dimension as type 2, while it will bring some advantage to feed some requirements such as ‘show me the history change for one business group folks’, ‘show me the revenue and cost incurred by the employees at point of […]

Get R Running over YARN-based MapReduce

Out of the mathematical and statistics language and tools such as SAS, SPSS, Matlab, etc. R language is a pretty good tool which provides the environment and essential packages for statistical computing and graphics. It is free and it offers an open environment and the means to allow users to develop custom package. In addition to […]

First Experience with Informatica Cloud

I remember that I heard the cloud concept about ten years ago while now the real cloud age is here. The cloud deployment and practice in CRM area might be the pioneer comparing with others. It is glad that we are seeing more and more BI vendors are boosting their cloud strategy and product development […]

Adopting Agile in BI Requirement Gathering

  There have been numerous discussions and even arguments in terms of a better implementation strategy in BI projects. But there is no doubt that more and more teams are adopting agile processes or spirit into the team as of its value. When we have conversations on agile methodology or agile teams we usually start […]

TM1 Project Agility with JIRA

The JIRA has been a powerful and useful tracking tool for most agile projects. It is being evolved to provide more convenient features onward. In past half year I and my team just worked on a TM1 project where we managed and tracked all requirements, tasks and efforts in JIRA tool between multiple teams. TM1 […]

Column-Oriented Storage for Massive Data

We have been using a number of database management system (DBMS) such as SQL Server, DB2, and Oracle, but probably we don’t care about how the data is stored and organized in disk. Actually most of popular DBMS are built upon the row-oriented structure which stores its content by row; while today’s DB market, column-oriented […]

Will EDW Vanish Because of Hadoop?

Nowadays when people talk about IT technology and trend, there was overwhelming world of big data as we entering the new technology era. I know that big data has been promoted to national strategy in many countries. A few days ago I found an interesting debate whether the traditional data warehouse will be replaced by […]

Load More