Databricks Fundamentals & Apache Spark Core
Databricks Fundamentals & Apache Spark Core
Welcome to this course on Databricks and Apache Spark 2.4 and 3.0.0
Apache Spark is a Big Data Processing Framework that runs at scale.
In this course, we will learn how to write Spark Applications using Scala and SQL.
Databricks is a company founded by the creator of Apache Spark.
Databricks offers a managed and optimized version of Apache Spark that runs in the cloud.
The main focus of this course is to teach you how to use the DataFrame API & SQL to accomplish tasks such as:
- Write and run Apache Spark code using Databricks 
- Read and Write Data from the Databricks File System - DBFS 
- Explain how Apache Spark runs on a cluster with multiple Nodes 
Use the DataFrame API and SQL to perform data manipulation tasks such as
- Selecting, renaming and manipulating columns 
- Filtering, dropping and aggregating rows 
- Joining DataFrames 
- Create UDFs and use them with DataFrame API or Spark SQL 
- Writing DataFrames to external storage systems 
List and explain the element of Apache Spark execution hierarchy such as
- Jobs 
- Stages 
- Tasks 
Learn how to process big-data using Databricks & Apache Spark 2.4 and 3.0.0 - DataFrame API and Spark SQL
Url: View Details
What you will learn
- Databricks
- Apache Spark Architecture
- Apache Spark DataFrame API
 
                        
            Rating: 4.44737
Level: Beginner Level
Duration: 12 hours
Instructor: Wadson Guimatsa
Courses By: 0-9 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
About US
The display of third-party trademarks and trade names on this site does not necessarily indicate any affiliation or endorsement of hugecourses.com.
View Sitemap