Booking options
£93.99
£93.99
On-Demand course
3 hours 35 minutes
All levels
Big data certification for non-programmers, business analysts, testers, and SQL developers
This course will help you understand Hive, along with preparing you to achieve CCA159 (Cloudera Big Data Analyst) certification. You will start by delving into Hadoop and its distributed file system. Next, you'll become well-versed with the most common Hadoop commands you'll need to work with Hadoop file systems. Later, you'll explore the Apache Hive, starting with an introduction to it, before moving on to understanding external and managed tables. The next few sections will take you through insert and multi-insert. As you progress, the course will provide insights into different functions such as collection, conditional, Hive string functions, Hive date functions, and mathematical functions. In addition to this, you'll learn to work with different file formats and compressions. By the end of this course, you'll have comprehensive knowledge of Hive and Sqoop and gained the skills you need to pass the CCA Data Analyst Exam. All code and supporting files are available at - https://github.com/PacktPublishing/CCA-159-Expert-in-Big-Data-Analytics---Advance-Hive-Sqoop
Delve into Hive analysis
Get to grips with the ALTER TABLE command
Explore joins, multi-joins and Map joins
Work with different files such as Parquet and Avro
Understand partitioning and bucketing
Focus on views
Get up to speed with lateral views/explode
Delve into window functions - Rank/Dense Rank/Lead/Lag/Min/Max
Explore the window specification
This course is for anyone who wants to achieve CCA159 Cloudera Big Data Analyst certification or simply learn Hive and Sqoop.
This course systematically takes you through Hadoop and Hadoop distributed file systems, while also preparing you for the CCA Data Analyst Exam. You'll even get access to code and supporting files that will help you learn effectively.
Get to grips with Hive and Sqoop for big data analytics and ingestion * Become well-versed with the essential topics and concepts and achieve CCA159 (Cloudera Big Data Analyst) certification * Get to grips with data types and complex data types
Navdeep Kaur - Technical Trainer
Navdeep Kaur is a big data professionals with 11 years of industry experience in different technologies and domains. She has a keen interest in providing training in new technologies. She has received CCA175 Hadoop and Spark developer certification and AWS solution architect certification. She loves guiding people and helping them achieves new goals.
1. Hadoop Introduction
2. Hive
3. Hive Data Types
4. Hive Functions
5. Hive Join
6. Working with Different File Formats & Compressions
7. Advance Hive
8. Hive Windows Function
9. Sqoop Import
10. Sqoop Import