Big Data Analyst

Big Data Analytics Course in Singapore

Duration: 4 Days

Venue: 10 Anson Road, 26-08A International Plaza, Singapore 079903

About Course:  Hadoop is the cloud computing platform data scientists use to perform highly paralleled operations on big data. You will learn in this course that:
Click Here to Register

  • How to analyze data using Pig, Hive and YARN
  • How to configure the Hadoop distributed file system (HDFS)perform processing
  • Ingestion using Map Reduce Apache Hive is a tool of choice for many data scientists because it allows them to work with SQL, a familiar syntax, to derive insights from Hadoop, reflecting the information that businesses seek to plan effectively.

This course shows how to use Hive to process data, structure and optimize your data. The course will also show how to use HUE, the Hadoop user interface, to leverage HiveQL when analyzing data.

Target Student: All professional who work late or overtime, have back logs of work, and not punctual on time.

Bigdata Analysis Course  Big Data Training Course Outline

Module 1: Big Data Basics

  • Understanding Hadoop fundamentals
  • Introduction to Pig
  • Basic data analysis with Pig
  • Processing Complex data with Pig
  • Multi Dataset operations with Pig

Module 2: Optimising Data

  • Pig troubleshooting and Optimization
  • Introduction to Hive and Impala
  • Quering with Hive and Impala
  • Data Management
  • Data Storage and Performance

Module 3: Analysis of Data

  • Relational Data Analysis
  • Working with Impala
  • Analyzing Complex Data and Text with Hive
  • Hive Optimization

Module 4: Tools for Data Analysis

  • Extending Hive
  • Choosing the Best Tool for the Job
  • Use cases
  • Calculate Best possible location
  • Analyzing mp3 files

Our Clients