Apache Spark

Apache Spark is a unified analytics engine for large-scale data processing, used for big data and machine learning tasks. Coursera's Apache Spark skill catalogue teaches you about this powerful tool for handling big data analytics. You'll learn the fundamentals of Spark's distributed computing model, its powerful data processing capabilities, and how to implement machine learning algorithms with Spark. You'll also delve into Spark SQL for structured data processing, Spark Streaming for real-time data processing, and MLlib for machine learning tasks. Master these aspects to enhance your data science skills and solve complex computational problems in various industries.
31credentials
72courses

Related roles

Gain the knowledge and skills you need to advance.

  • This role has a $137,984 median salary ¹.

    description:

    A Data Engineer builds data pipelines for large datasets, optimizing systems and ensuring reliable data flow using tools like Hadoop and Spark.

    This role has a $137,984 median salary ¹.

    Offered by

    DeepLearning.AI_logo
    Amazon Web Services_logo
    Google Cloud_logo

Most popular

Trending now

New releases

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Apache Spark Course Catalog

  • Status: Preview

    École Polytechnique Fédérale de Lausanne

    Skills you'll gain: Apache Spark, Scala Programming, Apache Hadoop, Big Data, Data Manipulation, Distributed Computing, Data Processing, Performance Tuning, SQL, Programming Principles

  • Status: Free

    Skills you'll gain: Apache Spark, Data Pipelines, PySpark, Real Time Data, Query Languages, Data Transformation, SQL, Data Processing, Data Analysis

  • Skills you'll gain: Databricks, Data Lakes, Data Pipelines, Data Integration, Dashboard, PySpark, SQL, Apache Spark, Data Management, Data Transformation, Version Control

  • Skills you'll gain: Apache Spark, Managed Services, Google Cloud Platform, Big Data, Apache Hadoop, Data Management, Servers

  • Status: Preview

    Skills you'll gain: Azure Synapse Analytics, Data Warehousing, Power BI, Data Integration, Data Architecture, Data Visualization Software, Microsoft Azure, Apache Spark, Database Management, Data Pipelines, Performance Tuning, Data Processing, Data Security, Scalability

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, PySpark, Big Data, Data Pipelines, Data Processing, Distributed Computing, JSON, Data Manipulation, Data Transformation, Performance Tuning

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, Scala Programming, Apache Hadoop, Apache Maven, Real Time Data, Data Processing, Scalability, Data Structures, Object Oriented Programming (OOP), Systems Integration

  • Status: New
    Status: Free Trial

    Skills you'll gain: Extract, Transform, Load, Apache Spark, Data Pipelines, PySpark, Apache Hadoop, Data Transformation, MySQL, Data Manipulation, Java Platform Enterprise Edition (J2EE), Data Store, Data Import/Export, Development Environment, Software Installation, System Configuration

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Kafka, Apache Spark, Apache Hadoop, Scala Programming, Real Time Data, Apache Hive, Command-Line Interface, Distributed Computing, Data Processing, Big Data, Apache, Apache Cassandra, Applied Machine Learning, Data Pipelines, Java, Databases, MongoDB, IntelliJ IDEA, NoSQL, Application Deployment

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Kafka, Data Warehousing, Extract, Transform, Load, Microsoft SQL Servers, Snowflake Schema, Star Schema, Performance Tuning, Data Pipelines, Cloud Computing Architecture, Business Intelligence, Real Time Data, Apache Hadoop, Data Modeling, Data Quality, Responsible AI, Apache Spark, SQL, Generative AI, Data Governance, Quality Management

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Hadoop, Apache Spark, Big Data, Apache Hive, Analytics, Data Processing, Data Mapping, Text Mining, Distributed Computing, Java, Debugging, Java Programming

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, Generative AI, LLM Application, Large Language Modeling, Predictive Modeling, Matplotlib, Keras (Neural Network Library), Generative Model Architectures, Deep Learning, ChatGPT, OpenAI, Generative AI Agents, Tensorflow, Seaborn, A/B Testing, Statistical Modeling, Data Visualization, Regression Analysis, Big Data, Machine Learning

What brings you to Coursera today?

Leading partners

  • Google Cloud
  • Packt
  • IBM
  • EDUCBA
  • University of California San Diego
  • Amazon Web Services
  • Edureka
  • Pearson