Create an event

HDP Developer: Apache Pig and Hive - Hortonworks Official Curriculum

From Mon 10 December 2018 to Thu 13 December 2018
9:00 AM - 5:00 PM
Ended


COURE OVERVIEW THIS 4 DAY TRAINING COURSE IS DESIGNED FOR DEVELOPERS WHO NEED TO CREATE APPLICATIONS TO ANALYZE BIG DATA STORED IN APACHE HADOOP USING PIG AND HIVE. TOPICS INCLUDE: HADOOP, YARN, HDFS, MAPREDUCE, DATA INGESTION, WORKFLOW DEFINITION, USING PIG AND HIVE TO PERFORM DATA ANALYTICS ON BIG DATA AND AN INTRODUCTION TO SPARK CORE AND SPARK SQL. COURSE CONTENT DAY 1: AN INTRODUCTION TO THE HADOOP DISTRIBUTED FILE SYSTEM OBJECTIVES * Understanding Hadoop * The Hadoop Distributed File System * Ingesting Data into HDFS * The MapReduce Framework LABS * Starting an HDP Cluster * Demonstration: Understanding Block Storage * Using HDFS Commands * Importing RDBMS Data into HDFS * Exporting HDFS Data to an RDBMS * Importing Log Data into HDFS Using Flume * Demonstration: Understanding MapReduce * Running a MapReduce Job DAY 2: AN INTRODUCTION TO APACHE PIG OBJECTIVES * Introduction to Apache Pig * Advanced Apache Pig Programming LABS * Demonstration: Understanding Apache Pig * Getting Starting with Apache Pig * Exploring Data with Apache Pig * Splitting a Dataset * Joining Datasets with Apache Pig * Preparing Data for Apache Hive * Demonstration: Computing Page Rank * Analyzing Clickstream Data * Analyzing Stock Market Data Using Quantiles DAY 3: AN INTRODUCTION TO APACHE HIVE OBJECTIVES * Apache Hive Programming * Using HCatalog * Advanced Apache Hive Programming LABS * Understanding Hive Tables * Understanding Partition and Skew * Analyzing Big Data with Apache Hive * Demonstration: Computing NGrams * Joining Datasets in Apache Hive * Computing NGrams of Emails in Avro Format * Using HCatalog withApachePig DAY 4: WORKING WITH SPARK CORE, SPARK SQL AND OOZIE OBJECTIVES * Advanced Apache Hive Programming (Continued) * Hadoop 2 and YARN * Introduction to Spark Core and Spark SQL * Defining Workflow with Oozie LABS * Advanced Apache Hive Programming * Running a YARN Application * Getting Started with Apache Spark * Exploring Apache Spark SQL * Defining an Apache Oozie Workflow
culture sports courses
123 Views
14/12/2018 Last update

AXA Tower
8 Shenton Way, Singapore, 68811, Singapore, Singapore

View event details


Are you an event organizer?
Create events for free. They will be immediately recommended to interested users.

  1. Agilitics Pte. Ltd.
  2. HDP Developer: Apache Pig and Hive - Hortonworks Official Curriculum