About The Course
Hive is an Open-source, peta-byte scale Data Warehousing framework based on Hadoop. Comprehensive Hive certified personnel are in demand in the fields of Database management to manage Big Data. On completing this course at LearnChase participants will have in-depth understanding of all the concepts in Hive.
After completing the Comprehensive Hive course at LearnChase, you will be able to:
1. Understand the benefits of Hive and HiveQL
2. Perform Data Analysis using Hive
Who should go for this course?
The course is designed for those who want to learn Hive and implement it in Hadoop framework. The following professionals can learn Comprehensive Hive:
1. Analytics Professionals
2. BI /ETL/DW Professionals
3. Project Managers
4. Testing Professionals
5. Mainframe Professionals
6. Software Developers and Architects
7. Graduates aiming to build a career in Big Data and Hadoop
What are the pre-requisites for this Course?
Understanding of Linux commands and SQL queries will be beneficial to learn Hive. It gives an opportunity to non-java professionals to work on Hadoop. Basic knowledge of core java will be helpful to work on UDF.
Project #1: Health & Hospital Management
Problem Statement: How many hospitals centres have more than 60% patient satisfaction with respect to cleanliness? Which hospital centre has the overall rating of either 9 or 10
Project #2: Country Project
Problem Statement: Sort the number of countries based on landmass. Find out the top 5 countries with sum of bars and strips in a flag. Count the number of countries with an icon in the flag. Count the number of countries which have the same colour in the top left and top right corners of their flag.
Why learn Comprehensive Hive ?
Hive is an essential tool that will help professionals to manage and control Big Data with ease. Hive brings the property of structuredness to Big Data when implemented in Hadoop and that is the reason Comprehensive Hive is a highly sought after course by professionals looking for expertise in Hadoop.
Learning Objectives – This module will help you in understanding concepts like Loading, Querying and Importing data in Hive.
Topics – Hive Background, Hive Use Case, About Hive, Hive vs Pig, Hive Architecture and Components, Meta-store in Hive, Limitations of Hive, Comparison with Traditional Database, Hive Data Types and Data Models, Partitions and Buckets, Hive Tables(Managed Tables and External Tables),Importing Data, Querying Data, Managing Outputs.
2. Advanced Hive
Learning Objectives – In this module you will learn more advanced concepts of Hive like the Hive querying language and Hive UDFs.
Topics – Hive Script, Hive UDF and Hive Demo on Healthcare Data set. Hive QL: Joining Tables, Dynamic Partitioning, Custom MapReduce Scripts, Thrift Server, User Defined Functions