Upload
anand-pandey
View
267
Download
5
Embed Size (px)
Citation preview
OBJECTIVE
At the end of today’s module, we will able to understand:-
What is Big-data What is Hadoop Eco-system & its featureIs it right time to learn Big data Career path through Hadoop & Big-data technologies Pre- requities for HadoopJob role in Big-data industry Complete Course curriculum
V’s of Big-Data
VOLUME
VARIETYVELOCITY
Definition Lots of data (TeraBytes or PetaBytes) Big-data is term for a collection of data
set so large & complex that it becomes difficult to process using on-hand data management tools or traditional data processing applications.
The challenges includes capture, curation, storage, search, sharing, transfer, analysis and visualization.
Definition-HADOOP Apache hadoop is a framework for the distributed
processing of large data sets across cluster of commodity computers using simple programming model.
It is an ecosystem of software packages, including MapReduce, HDFS, and a whole host of other software packages
Characteristics:- Reliable Scalable Flexible Economical
Working Domain •Travel•Retail•Finance•Healthcare•Advertising•Manufacturing•Telecommunications•Life Sciences •Media and Entertainment•Natural •Resources•Trade and Transportation•Government
Job-Roles
Employers
Forecast
Big data Career Path
Developer/Testing• Java/Python/Ruby,Scoop• Hadoop Eco-System | Big Data Hadoop PIG FLUME Map Reduce Design• NoSQL DB , PIG SCRIPT | Apache Spark & Scala Cassandra• Spark , HIVE, Oozie,Flume |----------------------------------------------- | Administration • Linux Administration• Cluster Management |• Cluster Performance |- Linux Administration Hadoop Admnistraton • Virtualization |------------------------------------------------|
Data Analyst• Statistical Skills• Machine Learning• Hadoop Essentials | Data Science Business Analytics Using R Talend
For Bigdata• Expertise in R | Data Visualization using Tableu -----------------------------------------------|
Course Curriculum
Module-1 Module-2 Understanding Big Data & Hadoop Hadoop Architecutre & HDFS
Module-3 Module-4 Hadoop MR Framework-I Hadoop MR Framework-II
Module-5 Module-6Advance Map Reduce PIG Latine
Module-7 Module-8 HIVE Hbase
Module-9 Module-10 Zookepar, Oozie Project
THANK YOU!