HDP Developer: Apache Pig and Hive - Hortonworks Official Curriculum

Name: HDP Developer: Apache Pig and Hive - Hortonworks Official Curriculum
Start: 2018-12-10T09:00:00+07:00
End: 2018-12-13T17:00:00+07:00
Location: AXA Tower

Image from eventbrite.com

10 DEC Mon 2018

9:00 AM

13 DEC Thu 2018

5:00 PM

Event ended

Hosted by

Agilitics Pte. Ltd.

COURE OVERVIEW

This 4 day training course is designed for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Pig and Hive. Topics include: Hadoop, YARN, HDFS, MapReduce, data ingestion, workflow definition, using Pig and Hive to perform data analytics on Big Data and an introduction to Spark Core and Spark SQL.

COURSE CONTENT

DAY 1: AN INTRODUCTION TO THE HADOOP DISTRIBUTED FILE SYSTEM

OBJECTIVES

Understanding Hadoop
The Hadoop Distributed File System
Ingesting Data into HDFS
The MapReduce Framework

LABS

Starting an HDP Cluster
Demonstration: Understanding Block Storage
Using HDFS Commands
Importing RDBMS Data into HDFS
Exporting HDFS Data to an RDBMS
Importing Log Data into HDFS Using Flume
Demonstration: Understanding MapReduce
Running a MapReduce Job

DAY 2: AN INTRODUCTION TO APACHE PIG

OBJECTIVES

Introduction to Apache Pig
Advanced Apache Pig Programming

LABS

Demonstration: Understanding Apache Pig
Getting Starting with Apache Pig
Exploring Data with Apache Pig
Splitting a Dataset
Joining Datasets with Apache Pig
Preparing Data for Apache Hive
Demonstration: Computing Page Rank
Analyzing Clickstream Data
Analyzing Stock Market Data Using Quantiles

DAY 3: AN INTRODUCTION TO APACHE HIVE

OBJECTIVES

Apache Hive Programming
Using HCatalog
Advanced Apache Hive Programming

LABS

Understanding Hive Tables
Understanding Partition and Skew
Analyzing Big Data with Apache Hive
Demonstration: Computing NGrams
Joining Datasets in Apache Hive
Computing NGrams of Emails in Avro Format
Using HCatalog withApachePig

DAY 4: WORKING WITH SPARK CORE, SPARK SQL AND OOZIE

OBJECTIVES

Advanced Apache Hive Programming (Continued)
Hadoop 2 and YARN
Introduction to Spark Core and Spark SQL
Defining Workflow with Oozie

LABS

Advanced Apache Hive Programming
Running a YARN Application
Getting Started with Apache Spark
Exploring Apache Spark SQL
Defining an Apache Oozie Workflow

Views - 14/12/2018 Last update

culture sports courses

AXA Tower

8 Shenton Way, Singapore, 68811, Singapore, Singapore

Event from

eventbrite.com →

Create an event

Create events for free. They will be immediately recommended to interested users.

Nearby hotels and apartments

8 Shenton Way, Singapore, 68811, Singapore, Singapore

Discover more events in Kampung Pasir Gudang Baru

Discover now

Discover more events in Kampung Pasir Gudang Baru

Discover now

AXA Tower

8 Shenton Way, Singapore, 68811, Singapore, Singapore

Event from

eventbrite.com →

Create an event

Create events for free. They will be immediately recommended to interested users.