Spark Project on Cloudera Hadoop(CDH) and GCP for Beginners




Spark Project on Cloudera Hadoop(CDH) and GCP for Beginners

  • In retail business, retail stores and eCommerce websites generates large amount of data in real-time.

  • There is always a need to process these data in real-time and generate insights which will be used by the business people and they make business decision to increase the sales in the retail market and provide better customer experience.

  • Since the data is huge and coming in real-time, we need to choose the right architecture with scalable storage and computation frameworks/technologies.

  • Hence we want to build the Data Processing Pipeline Using Apache NiFi, Apache Kafka, Apache Spark, Apache Cassandra, MongoDB, Apache Hive and Apache Zeppelin to generate insights out of this data.

  • The Spark Project is built using Apache Spark with Scala and PySpark on Cloudera Hadoop(CDH 6.3) Cluster which is on top of Google Cloud Platform(GCP).

Building Data Processing Pipeline Using Apache NiFi, Apache Kafka, Apache Spark, Cassandra, MongoDB, Hive and Zeppelin

Url: View Details

What you will learn
  • Complete Spark Project Development on Cloudera Hadoop and Spark Cluster
  • Fundamentals of Google Cloud Platform(GCP)
  • Setting up Cloudera Hadoop and Spark Cluster(CDH 6.3) on GCP

Rating: 4.5

Level: All Levels

Duration: 11 hours

Instructor: PARI MARGU


Courses By:   0-9  A  B  C  D  E  F  G  H  I  J  K  L  M  N  O  P  Q  R  S  T  U  V  W  X  Y  Z 

About US

The display of third-party trademarks and trade names on this site does not necessarily indicate any affiliation or endorsement of hugecourses.com.


© 2021 hugecourses.com. All rights reserved.
View Sitemap