Apache Spark

Apache Spark is a unified analytics engine for large-scale data processing, used for big data and machine learning tasks. Coursera's Apache Spark skill catalogue teaches you about this powerful tool for handling big data analytics. You'll learn the fundamentals of Spark's distributed computing model, its powerful data processing capabilities, and how to implement machine learning algorithms with Spark. You'll also delve into Spark SQL for structured data processing, Spark Streaming for real-time data processing, and MLlib for machine learning tasks. Master these aspects to enhance your data science skills and solve complex computational problems in various industries.
31credentials
72courses

Related roles

Gain the knowledge and skills you need to advance.

  • This role has a $137,984 median salary ¹.

    description:

    A Data Engineer builds data pipelines for large datasets, optimizing systems and ensuring reliable data flow using tools like Hadoop and Spark.

    This role has a $137,984 median salary ¹.

    Offered by

    DeepLearning.AI_logo
    Amazon Web Services_logo
    Google Cloud_logo

Most popular

Trending now

New releases

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Apache Spark Course Catalog

  • Status: Free Trial

    École Polytechnique Fédérale de Lausanne

    Skills you'll gain: Scala Programming, Apache Spark, Apache Hadoop, User Interface (UI), Programming Principles, Big Data, Software Design, Data Structures, Software Design Patterns, Functional Design, Data Manipulation, Object Oriented Programming (OOP), Heat Maps, Data Visualization Software, Interactive Data Visualization, Distributed Computing, Computer Programming, Data Processing, Real Time Data, Performance Tuning

  • Status: Free Trial

    Skills you'll gain: PySpark, Snowflake Schema, Databricks, Data Pipelines, Apache Spark, MLOps (Machine Learning Operations), Apache Hadoop, Big Data, Data Warehousing, Data Quality, Data Integration, Data Processing, DevOps, Data Transformation, SQL, Python Programming

  • Status: New
    Status: Free Trial

    Skills you'll gain: Databricks, CI/CD, Apache Spark, Microsoft Azure, Data Governance, Data Lakes, Data Architecture, Real Time Data, PySpark, Data Pipelines, Data Integration, Data Management, Automation, Data Storage, System Testing, Data Processing, Jupyter, Data Quality, User Provisioning, File Systems

  • Status: Free Trial

    Skills you'll gain: Big Data, Apache Hadoop, Serverless Computing, Applied Machine Learning, Apache Spark, Looker (Software), Cloud Computing, Artificial Intelligence and Machine Learning (AI/ML), Machine Learning, Jupyter, SQL, Data Transformation, Artificial Intelligence, Scalability, Performance Tuning

  • Status: Free Trial

    Skills you'll gain: Data Store, Extract, Transform, Load, Data Architecture, Data Pipelines, Big Data, Data Warehousing, Data Governance, Apache Hadoop, Relational Databases, Apache Spark, Data Lakes, Databases, SQL, NoSQL, Data Security, Data Science

  • Status: New
    Status: Free Trial

    Skills you'll gain: Responsible AI, MLOps (Machine Learning Operations), Artificial Intelligence and Machine Learning (AI/ML), Jenkins, CI/CD, Java, Continuous Deployment, Java Programming, Artificial Intelligence, Apache Spark, Applied Machine Learning, Decision Tree Learning, Deep Learning, Machine Learning, Fraud detection, Spring Boot, Natural Language Processing, Regression Analysis, Reinforcement Learning, Debugging

  • Coursera Project Network

    Skills you'll gain: PySpark, Matplotlib, Apache Spark, Big Data, Data Processing, Distributed Computing, Data Management, Data Visualization, Data Analysis, Data Manipulation, Data Cleansing, Query Languages, Python Programming

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Data Warehousing, Extract, Transform, Load, IBM DB2, IBM Cognos Analytics, Big Data, Databases, PostgreSQL, Relational Databases, Data Infrastructure, Data Architecture, NoSQL, Data Pipelines, Applied Machine Learning, MongoDB, SQL, MySQL, Data Analysis, Dashboard, Python Programming

  • Status: Free Trial

    Skills you'll gain: Extract, Transform, Load, Apache Spark, Data Pipelines, Data Integration, Big Data, Data Infrastructure, Data Processing, Dataflow, Data Management, Data Architecture, Scalability

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Data Pipelines, MLOps (Machine Learning Operations), PySpark, Application Deployment, IBM Cloud, Machine Learning, Containerization, Data Science, Python Programming, Performance Tuning, Scalability

  • Status: New
    Status: Free Trial

    Skills you'll gain: Java, Java Programming, Apache Spark, Applied Machine Learning, Deep Learning, Data Processing, Application Deployment, Natural Language Processing, Data Cleansing, Machine Learning Algorithms, Machine Learning, Feature Engineering, Data Transformation, Scalability, Artificial Neural Networks, Regression Analysis, Interoperability

  • Status: Preview

    École Polytechnique Fédérale de Lausanne

    Skills you'll gain: Apache Spark, Scala Programming, Apache Hadoop, Big Data, Data Manipulation, Distributed Computing, Data Processing, Performance Tuning, SQL, Programming Principles

What brings you to Coursera today?

Leading partners

  • Google Cloud
  • Packt
  • IBM
  • EDUCBA
  • University of California San Diego
  • Amazon Web Services
  • Edureka
  • Pearson