BigData Workshop

(Batches Starts from 23rd Nov, 2020)

About The Program:

With the belief to build a healthy ecosystem as per the Industry Standards REGEX Software brings a Skill Development Program (SDP) on “BIG DATA” for Students/Professionals/Faculties. We organize this Skill Development Program for improving the knowledge and skills of the Students / Professionals / Faculties, so that they can become expert in the field of Big Data.

Timing: 06:00 PM – 08:00 PM [IST]

Duration: 7 Days [15 Hours]

Platform: Google Meet

What you will Learn

  • Big Data Analytics & Hadoop
  • HDFS [ Hadoop Distributed File System ]
  • Map-Reduce [ Data Processing ]
  • HIVE
  • Apache Spark on Azure DataBricks
  • Neo4j Graph Analytics & NoSQL DataBase
  • Amazon EMR
  • Learn how to use these tools in the field of Data Analytics

Study Material

  • Live Sessions + Access of Recorded Session
  • E-Notes and an ISO Certified Certificate
  • Assignments per day
  • Poll test per day
  • 15 hours on demand Live Video Lecture
  • 24*7 Mentorship Support

Output

  • It will help you in Data Analytics Domain
  • Able to think out of the box
  • Expertise in different Big Data Tools like HDFS, Hive, Apache Spark, Amazon EMR
  • Able to solve many Interview Questions of Top MNCs
  • Package of Data Analyst in Big MNCs starts from 6 LPA

Live Sessions

Live Sessions by Expertise Trainers and Access of Recorded Session is also available

Live Projects

Get a chance to work on Industry Oriented Projects to implement your learning

24*7 Support

24*7 Mentorship Support available for all Students to clear all of your doubts

ISO Certification

Get Certificate of Workshop Completion from ISO Certified Company

Our Students Placed In

Previous
Next

Course Content

S.
No.
Topic
1
Day 1
Big Data Analytics & Hadoop :
-> Introduction to the Term Big Data
-> Work of Big Data Analytics
-> Journey to Start as Data Scientist
-> 6 V’s of Big Data
-> Configuring Hadoop Environment in your Local System
-> Cloudera Installation
-> Introduction to Hadoop
HDFS [ Hadoop Distributed File System ] :
-> Components of HDFS – NameNode & DataNode
-> Hadoop Daemons
-> Hadoop Admin & Commands
2
Day 2
Map-Reduce [ Data Processing ] :
-> Introduction to Map-Reduce
-> Map-reduce V1 vs YARN
-> Introduction to YARN
-> FS Image & Secondary Namenode
a) Shuffle, Sorting & Partitioning
b) Map reduce Word-Count Problem – Example
3
Day 3
HIVE
-> Introduction to Hive
-> Difference between SQL & Hive
-> Database & Table Creation
-> Internal vs External Table
-> Functions
a) Date & String Functions
-> PARTITIONING & BUCKETING
a) Static vs Dynamic Partitioning
b) Buckets in Hive
S.
No.
Topic
4
Day 4-5
Apache Spark on Azure DataBricks
-> Introduction to Apache Spark
-> Usage & Workflow of Spark
-> Trick – Account creation on Azure DataBricks
-> RDD – Resilient Distributed DataSet
a) Transformation & Action [Operation]
-> RDD Vs DataFrame
-> DataFrame –
a) Creating DataFrame with several file formats
b) Benefits of using Dataframes
c) Manipulating Data Frame
d) Group By operation on Data Frame
-> Introduction to MLLib
a) Linear Regression – Case study
5
Day 6
Neo4j Graph Analytics & NoSQL DataBase
-> Graph Analytics – Introduction
-> Understanding of Graphs
-> Installing & Running Neo4j
-> HBase – Introduction
-> SQL VS NoSQL [Which is better]
-> NoSQL – Introduction
a) Creating Table & column family
b) Create, retrieve, update & delete operation
-> Hands on learning with NoSQL from scratch
6
Day 7
Amazon EMR
-> Introduction to Cloud
-> Running Hadoop Eco-system on cloud
-> Creating 100 Notes cluster within seconds
Capstone Project

Fee Structure

Indian Fee: ₹1500/- (Flat 80% off) => ₹300/-
International Fee: 50 USD (Flat 80% off) => 10 USD

Enroll Now

(Batches Starts from 23rd Nov, 2020)