In this module you will learn everything you need to know about big data fundamentals and Hadoop.
We have covered the following topics in this module.
Session 1: Introduction to big data and hadoop
Session 2: Comparision between spark and hadoop, on-prem and cloud
Session 3: Types of storage systems - Databases, Data warehouse and Data lake
Session 4: Distributes storage system (HDFS) - Explained in depth
Session 5: Linux and HDFS commands
Session 6: Distributed processing engine (MapReduce) - Explained in depth
Hi there,
Currently, I am working as a Data Engineer at ThoughtWorks. In my 3 years of experience I have worked on some amazing projects and I will be sharing here everything I have learnt so far as well as new tools coming in the market.