About The Course
On course completion the learners will have in-depth understanding of the various concepts in Pig. The learners will be able to write complex MapReduce transformations using a simple scripting language called Pig Latin. It helps learners to work on Hadoop framework without using Java.
After the completion of the Comprehensive Pig course at LearnChase, you will be able to:
1. Understand the benefits and features of Pig
2. Perform Data Analytics using Pig
3. Learn to write your own functions to do special-purpose processing
4. Easy programming of Complex tasks involving interrelated data transformations encoded as data flow sequences
5. Understand advanced Pig concepts like Relational Operators, File Loaders, Group & Co-group operator, Union, Pig UDF
6. Work on a Real life project in Pig Latin and gain hands on Project Experience
Who should go for this Course?
The course is designed for all those who want to learn Pig and implement it in Hadoop. The following professionals can learn Pig:
1. Analytics Professionals
2. BI /ETL/DW Professionals
3. Project Managers
4. Testing Professionals
5. Mainframe Professionals
6. Software Developers and Architects
7. Graduates aiming to build a career in Big Data and Hadoop
Towards the end of the Course, you will be working on a live project where you will be using PIG to perform Big Data analytics. Here are the few Industry-wise Big Data case studies that you will work on:
Project #1: Analysing Aadhar Card Data
Industry: Government Sector
Data: The data set consists of the following fields: State:This field consists of the state names from all over India City:This field consists of city names in all states Approved:This field consists of the total count of approved cards in numbers Rejected:This field consists of the total count of rejected cards in numbers
Problem Statement: Below are few of the problem statements that we have chosen to work on this data set: 1.Find out the total number of cards approved by states. 2.Find out the total number of cards rejected by states. 3.Find out the total number of cards approved by cities. 4.Find out the total number of cards rejected by cities.
Project #2: Analysis of Afghan War Diaries
Industry: Government Sector
Data: The data was written by soldiers and intelligence officers of the United States Military. To keep it simple, we will analyse only four of the available columns (Type, Category, Region and Attack On) in the data set.
Problem Statement: Below are few of the problem statement that we have chosen to work on this data set: 1.To examine all the events that involve explosive hazards. 2.To examine explosive events that involves Improvised Explosive Devices (IEDs).
Why Learn Comprehensive Pig ?
Pig is a very important tool that helps professionals to manage and control Big Data with ease as it can be very effectively implemented in Hadoop. Working on Pig doesn’t require Java knowledge. Pig certified professionals are in huge demand in industries managing huge chunk of data.
Learning Objectives – In this module, you will learn the basics of Pig, types of use cases where Pig van be used, tight coupling between Pig and Map Reduce, and Pig Latin scripting.
Topics – About Pig, Map Reduce Vs Pig, Pig Use Cases, Programming Structure in Pig, Pig Running Modes, Pig components, Pig Execution, Pig Latin Program, Data Models in Pig, Pig Data Types, Project work
2. Advanced Pig
Learning Objectives -In this module you will learn more advanced concepts in Pig like Pig Latin and the different functions and tools like Operators and loaders etc.
Topics – Pig Latin : Relational Operators, File Loaders, Group Operator, CO GROUP Operator, Joins and CO GROUP, Union, Diagnostic Operators, Pig UDF, Pig Demo on Healthcare Data set, Project work.