Kent Jiang, Author at Perficient Blogs
Blog

Posts by this Author

  • Topics
  • Industries
  • Partners

Explore

Topics

Industries

Partners

Kent Jiang

Currently I was working in Perficient China GDC located in Hangzhou as a Lead Technical Consultant. I have been with 8 years experience in IT industry across Java, CRM and BI technologies. My interested tech area includes business analytic s, project planning, MDM, quality assurance etc

Posts by this author:

MongoDB 4.0 will Support Multiple Document Transactions

In the middle of February 2018 during the Chinese spring festival holiday, we received the following news from the MongoDB community: “MongoDB 4.0 will add support for multi-document transactions, making it the only database to combine the speed, flexibility, and power of the document model with ACID guarantees.” We know that ACID is a very […]

Read more

Securing Rest Service with Spring Security and oAuth2 Method

Micro-services are very popular in the application architecture and more and more enterprises have turned the single instance system to micro-service.  Ideally, the variety of services can be built by different teams with different programming languages such as Nodejs, Java, Go or others. Security is ALWAYS one of the top factors we should take into […]

Read more

Building a Near Real-Time Dashboard with Big Data Platforms

At the start of the year, we launched an internal effort to build an internal dashboard for employees, to provide a full view of employee personas, performance, the growth of employee skills, machine log monitoring etc. Especially for the machine log, the server produces 1 terabyte of information every week, on which we were able to […]

Read more

Hangzhou Spark and Flink Meetup 2017

Almost every year I will be attending the Hangzhou Spark meetup, where we can meet the Spark and Hadoop fans and expert from the local community. I love it because it is an open meetup and nothing to do with the commercial. In this year there are great knowledges and experiences sharing on Flink as […]

Read more

Design and Build Your Own Restful API with Swagger

Though APIs have been used for decades, they are turning into the Micro level to fit more frequent changes and decoupling requirements. In recent years, one thing that we have been seeing is a transition from monolithic applications to microservice‑oriented architecture. Out of several API designing and building tools, we would talk about one of […]

Read more

Java SE 9 Discovery

Eventually the Java Standard Edition 9 was released along with J2EE 8 after a long wait. This news is not overwhelming in the media report like other exciting technologies of artificial intelligence, cloud etc. However, per the recent programming language ranking report, Java is still one of the most popular languages. In http://statisticstimes.com, for both […]

Read more

BigDL – An Open Source Framework for Deep Learning

We have been familiar with machine learning for several years. It has been used in many aspects of business and life such as natural language recognition, fraud detection and autopiloting. While deep learning is a branch of machine learning, it is a big part of the machine learning family based on data representation other than […]

Read more

NoSQL – CouchDB vs Couchbase

It has been a trend to employ NoSQL database in the modern application system development for either local or cloud platforms. We know that NoSQL stands for “Not the SQL relational database”, but more accurately, it is “Not only the SQL relational database.” NoSQL database can handle structured datasets well in the traditional way, but […]

Read more

3 Steps to Configure SQL Developer for Hive Data

In my previous post, I introduced a HiveServer2 based approach to query data in Java code. If we don’t have to write any additional code, we could leverage a Database tool to do the manipulation. As we know, Oracle SQL Developer is a free tool that can be used for different kinds of databases by installing […]

Read more

How to Query Hive Data in HDP via Java Code

What is Hive? You may have heard about Data Warehousing products such as Oracle, Teradata Netezza which have been used in different industries for decades. Recently, Hive emerged as a new generation of the open-source data warehouse product which features clustered nodes in the big data domain. It facilitates reading, writing, and managing super large […]

Read more

Druid – A Data Store to Support History and Real-time Analytics

As a part of Perficient big data practice, I have been working on identifying some open source data stores and search frameworks that enable the user to quickly query what he needs, and to process massive events/message stream, in addition to several frameworks such as Spark, ELK, Hadoop, HBase, Cassandra, I get to know about […]

Read more

Story Points Estimation on a Data Warehouse Project

It has been decades since people started to develop data warehousing (DW) systems. In fact, most of the delivery strategy and approach on DW is to follow the traditional waterfall cycle – discovery, requirement, design, development, test, training, and transition, etc. With this pattern, only the previous step is completed; the team will start on […]

Read more

How to Load Log Data into HDFS using Flume

Data acquisition is a very important part of building a big data ecosystem. Data acquisition allows you to extract various types of data such as a file, DB, streaming, web page etc. If you are just setting up your local environment, not in the real business scenarios, you can resolve data acquisition by making use […]

Read more

Machine Learning in Local with Microsoft CNTK Package

In July 2016 there was an international joint conference on artificial intelligence held in New York City where many experts and professors meet together to share their recent research and the commercial use cases. Machine learning (ML), deep learning(DL) and natural language processing (NLP) were the hot topics on the agenda. There are some interesting […]

Read more

Fog Computing – Next Buzzwords?

There is no doubt that cloud computing has been the buzzword for several years and will continue to dominate the IT and business world for quite a long time. In the cloud computing world, the computation resources, storage, algorithm, application and big data analytics are centralized and their service is provided to the consumer just […]

Read more

Hangzhou Spark Meetup 2016

Last weekend there was a meetup in Hangzhou for the Spark community, and about 100 Spark users or committers attended. It was great to meet so many Spark developers, users and data scientists and to learn about recent Spark community update issues, road maps and real use cases. The event organizer delivered the first presentation […]

Read more

Continuous Integration in the Analytics Project

Many people may have known that Continuous Integration (CI), Continuous Delivery (CD) is great part of the agile activity. In the Java related project, there were lots of open source tool such as Hudson, Continuum, Jenkins etc to support this automation process. However, if you are going to look for some tools to support the […]

Read more

How to Load Oracle Data into SparkR Dataframe

In the Spark 1.4 and onward, it supplied various ways to enable user to load the external data source such as RDBMS, JSON, Parquet, and Hive file into SparkR. Ok, when we talk about SparkR, we would have to know something about R. Local data frame is a popular concept and data structure in R […]

Read more

SparkR for Data Scientists

Although the title Data Scientist is not mentioned as often as other IT job titles, it has been in the IT world for a while and is becoming more important with the popularity of the Internet and eCommerce. What kind of skills should a data scientist have? It could be a long list, but I […]

Read more

A Spark Example to MapReduce System Log File

In some aspects, the Spark engine is similar to Hadoop because both of them will do Map & Reduce over multiple nodes. The important concept in Spark is RDD (Resilient Distributed Datasets), by which we could operate over array, dataset and the text files. This example gives you some ideas on how to do map/reduce […]

Read more

Subscribe to the Weekly Blog Digest:

Sign Up