BIG DATA WORKSHOP

(Batches Starts from 6th Nov, 2020)

About The Program:

With the belief to build a healthy ecosystem as per the Industry Standards REGEX Software brings a Skill Development Program (SDP) on “BIG DATA”. We organize this Skill Development Program for improving the knowledge and skills of the Students/Professionals, so that they can become expert in the field of Big Data.

Timing: 06:00 PM – 08:00 PM [IST]

Duration: 7 Days [15 Hours]

Platform: Google Meet

What you will learn:

  • Big Data Analytics & Hadoop
  • HDFS [ Hadoop Distributed File System ]
  • Map-Reduce [ Data Processing ]
  • HIVE
  • Apache Spark on Azure DataBricks
  • Neo4j Graph Analytics & NoSQL DataBase
  • Amazon EMR
  • Learn how to use these tools in the field of Data Analytics

Study Material:

  • Live Session + Access of  Recorded Lecture Videos
  • E-Notes and an ISO Certified Certificate
  • Assignments per day
  • Poll test per day
  • 15 hours on demand Live Video Lectures
  • 24*7 Mentorship Support

Course Content

S.
No.
Topic
1
Day 1
Big Data Analytics & Hadoop :
-> Introduction to the Term Big Data
-> Work of Big Data Analytics
-> Journey to Start as Data Scientist
-> 6 V’s of Big Data
-> Configuring Hadoop Environment in your Local System
-> Cloudera Installation
-> Introduction to Hadoop
HDFS [ Hadoop Distributed File System ] :
-> Components of HDFS – NameNode & DataNode
-> Hadoop Daemons
-> Hadoop Admin & Commands
2
Day 2
Map-Reduce [ Data Processing ] :
-> Introduction to Map-Reduce
-> Map-reduce V1 vs YARN
-> Introduction to YARN
-> FS Image & Secondary Namenode
a) Shuffle, Sorting & Partitioning
b) Map reduce Word-Count Problem – Example
3
Day 3
HIVE
-> Introduction to Hive
-> Difference between SQL & Hive
-> Database & Table Creation
-> Internal vs External Table
-> Functions
a) Date & String Functions
-> PARTITIONING & BUCKETING
a) Static vs Dynamic Partitioning
b) Buckets in Hive
S.
No.
Topic
4
Day 4-5
Apache Spark on Azure DataBricks
-> Introduction to Apache Spark
-> Usage & Workflow of Spark
-> Trick – Account creation on Azure DataBricks
-> RDD – Resilient Distributed DataSet
a) Transformation & Action [Operation]
-> RDD Vs DataFrame
-> DataFrame –
a) Creating DataFrame with several file formats
b) Benefits of using Dataframes
c) Manipulating Data Frame
d) Group By operation on Data Frame
-> Introduction to MLLib
a) Linear Regression – Case study
5
Day 6
Neo4j Graph Analytics & NoSQL DataBase
-> Graph Analytics – Introduction
-> Understanding of Graphs
-> Installing & Running Neo4j
-> HBase – Introduction
-> SQL VS NoSQL [Which is better]
-> NoSQL – Introduction
a) Creating Table & column family
b) Create, retrieve, update & delete operation
-> Hands on learning with NoSQL from scratch
6
Day 7
Amazon EMR
-> Introduction to Cloud
-> Running Hadoop Eco-system on cloud
-> Creating 100 Notes cluster within seconds
Capstone Project

Benefits of attending this Program :

  • Get ISO Certified Certification
  • Update you skill set in the world of technology that moves quickly
  • Learn at Accelerated pace in your busy schedule
  • Opportunity to Learn from Industry Experts
  • Rediscovering your passion of learning new things

Output:

  • It will help you in Data Analytics Domain
  • Able to think out of the box
  • Expertise in different Big Data Tools like HDFS, Hive, Apache Spark, Amazon EMR
  • Able to solve many Interview Questions of Top MNCs
  • Package of Data Analyst in Big MNCs starts from 6 LPA

Fee Structure

Indian Fee: ₹1500/- (Flat 80% off) => ₹300/-
International Fee: 50 USD (Flat 80% off) => 10 USD

Registration Closed

Our New Batch will start from 16th Nov,2020. Enroll in that batch by clicking following Link:-

For detailed Big Data Workshop curriculum press following pdf button