Developer Training for Spark and Hadoop

 4 Days
Delivery Methods
 VILT    Private Group

This training course is the best preparation for the challenges faced by Hadoop developers. Participants will learn to identify which tool is the right one to use in a given situation, and will gain hands-on experience in developing using those tools.


Upcoming Class Dates and Times

This class is not currently scheduled.
Contact us and we will help you get the training you need!

Who Should Attend

Hadoop Developers

Course Objectives

    Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as: How data is distributed, stored, and processed in a Hadoop cluster How to use Sqoop and Flume to ingest data How to process distributed data with Apache Spark How to model structured data as tables in Impala and Hive How to choose the best data storage format for different data usage patterns Best practices for data storage


1 - Course Outline
  • Introduction
  • Introduction to Hadoop and the Hadoop Ecosystem
  • Hadoop Architecture and HDFS
  • Importing Relational Data with Apache Sqoop
  • Introduction to Impala and Hive
  • Modeling and Managing Data with Impala and Hive
  • Data Formats
  • Data Partitioning
  • Capturing Data with Apache Flume
  • Spark Basics
  • Working with RDDs in Spark
  • Writing and Deploying Spark Applications
  • Parallel Programming with Spark
  • Spark Caching and Persistence
  • Common Patterns in Spark Data Processing
  • Spark SQL and DataFrames
  • Conclusion

Do You Have Additional Questions? Please Contact Us Below.

contact us contact us 
Contact Us about Starting Your Business Training Strategy with United Training