This course introduces the fundamentals of modern data processing for data engineers, analysts, and IT professionals. You will learn the basics of Hadoop MapReduce, including how it works, how to compile and run Java MapReduce programs, and how to debug and extend them using other languages. The course includes practical exercises such as word counts across multiple files, log file analysis, and large-scale text processing with datasets like Wikipedia. You will also cover advanced MapReduce features and use tools like Yarn and the Job Browser. The course then covers higher-level tools such as Apache Pig and Hive QL for managing data workflows and running SQL-like queries. Finally, you will work with Apache Spark and PySpark to gain experience with modern data analytics platforms. By the end of the course, you will have practical skills to work with big data in various environments.

This Labor Day, enjoy $120 off Coursera Plus. Unlock access to 10,000+ programs. Save today.


Hadoop and Spark Fundamentals: Unit 2
This course is part of Hadoop and Spark Fundamentals Specialization

Instructor: Pearson
Included with
Recommended experience
What you'll learn
Understand and implement Hadoop MapReduce for distributed data processing, including compiling, running, and debugging applications.
Apply advanced MapReduce techniques to real-world scenarios such as log analysis and large-scale text processing.
Utilize higher-level tools like Apache Pig and Hive QL to streamline data workflows and perform complex queries.
Gain hands-on experience with Apache Spark and PySpark for modern, scalable data analytics.
Skills you'll gain
Details to know

Add to your LinkedIn profile
August 2025
4 assignments
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There is 1 module in this course
This module introduces the core components of big data processing with Hadoop and Spark. It covers the fundamentals of Hadoop MapReduce, including its operation, programming, and debugging, followed by practical examples such as word count, log analysis, and benchmarking. The module then explores higher-level tools like Apache Pig and Hive for simplified data processing. Finally, it introduces Apache Spark and its Python interface, PySpark, highlighting Spark’s growing role in data analytics.
What's included
20 videos4 assignments
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Explore more from Data Management
Why people choose Coursera for their career





Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
If you subscribed, you get a 7-day free trial during which you can cancel at no penalty. After that, we don’t give refunds, but you can cancel your subscription at any time. See our full refund policy.
More questions
Financial aid available,