About The Program:
With the belief to build a healthy ecosystem as per the Industry Standards REGEX Software brings a Skill Development Program (SDP) on “BIG DATA”. We organize this Skill Development Program for improving the knowledge and skills of the Students/Professionals, so that they can become expert in the field of Big Data.
Timing: 06:00 PM – 08:00 PM [IST]
Duration: 7 Days [15 Hours]
Platform: Google Meet
What you will learn:
Study Material:
24*7 Mentorship Support
S. No. | Topic |
---|---|
1 | Day 1 |
Big Data Analytics & Hadoop :
| |
-> Introduction to the Term Big Data -> Work of Big Data Analytics -> Journey to Start as Data Scientist -> 6 V’s of Big Data -> Configuring Hadoop Environment in your Local System -> Cloudera Installation -> Introduction to Hadoop | |
HDFS [ Hadoop Distributed File System ] :
| |
-> Components of HDFS – NameNode & DataNode -> Hadoop Daemons -> Hadoop Admin & Commands | |
2 | Day 2 |
Map-Reduce [ Data Processing ] : | |
-> Introduction to Map-Reduce -> Map-reduce V1 vs YARN -> Introduction to YARN -> FS Image & Secondary Namenode a) Shuffle, Sorting & Partitioning b) Map reduce Word-Count Problem – Example | |
3 | Day 3 |
HIVE | |
-> Introduction to Hive -> Difference between SQL & Hive -> Database & Table Creation -> Internal vs External Table -> Functions a) Date & String Functions -> PARTITIONING & BUCKETING a) Static vs Dynamic Partitioning b) Buckets in Hive |
S. No. | Topic |
---|---|
4 | Day 4-5 |
Apache Spark on Azure DataBricks | |
-> Introduction to Apache Spark -> Usage & Workflow of Spark -> Trick – Account creation on Azure DataBricks -> RDD – Resilient Distributed DataSet a) Transformation & Action [Operation] -> RDD Vs DataFrame -> DataFrame – a) Creating DataFrame with several file formats b) Benefits of using Dataframes c) Manipulating Data Frame d) Group By operation on Data Frame -> Introduction to MLLib a) Linear Regression – Case study | |
5 | Day 6 |
Neo4j Graph Analytics & NoSQL DataBase | |
-> Graph Analytics – Introduction -> Understanding of Graphs -> Installing & Running Neo4j -> HBase – Introduction -> SQL VS NoSQL [Which is better] -> NoSQL – Introduction a) Creating Table & column family b) Create, retrieve, update & delete operation -> Hands on learning with NoSQL from scratch | |
6 | Day 7 |
Amazon EMR | |
-> Introduction to Cloud -> Running Hadoop Eco-system on cloud -> Creating 100 Notes cluster within seconds | |
Capstone Project |
Benefits of attending this Program :
Output:
Indian Fee: ₹1500/- (Flat 80% off) => ₹300/-
International Fee: 50 USD (Flat 80% off) => 10 USD
WhatsApp us