Apache Spark

Apache Spark is a unified analytics engine for large-scale data processing, used for big data and machine learning tasks. Coursera's Apache Spark skill catalogue teaches you about this powerful tool for handling big data analytics. You'll learn the fundamentals of Spark's distributed computing model, its powerful data processing capabilities, and how to implement machine learning algorithms with Spark. You'll also delve into Spark SQL for structured data processing, Spark Streaming for real-time data processing, and MLlib for machine learning tasks. Master these aspects to enhance your data science skills and solve complex computational problems in various industries.
31credentials
72courses

Related roles

Gain the knowledge and skills you need to advance.

  • This role has a $137,984 median salary ¹.

    description:

    A Data Engineer builds data pipelines for large datasets, optimizing systems and ensuring reliable data flow using tools like Hadoop and Spark.

    This role has a $137,984 median salary ¹.

    Offered by

    DeepLearning.AI_logo
    Amazon Web Services_logo
    Google Cloud_logo

Most popular

Trending now

New releases

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Apache Spark Course Catalog

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Spark, Classification And Regression Tree (CART), Predictive Modeling, Applied Machine Learning, Statistical Machine Learning, Unsupervised Learning, Predictive Analytics, Random Forest Algorithm, Regression Analysis, Machine Learning Algorithms, Supervised Learning, Data Pipelines

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Hadoop, Apache Spark, Big Data, Apache Hive, Analytics, Data Processing, Data Mapping, Text Mining, Distributed Computing, Java, Debugging, Java Programming

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, Generative AI, LLM Application, Large Language Modeling, Predictive Modeling, Matplotlib, Keras (Neural Network Library), Generative Model Architectures, Deep Learning, ChatGPT, OpenAI, Generative AI Agents, Tensorflow, Seaborn, A/B Testing, Statistical Modeling, Data Visualization, Regression Analysis, Big Data, Machine Learning

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Hadoop, Apache Hive, Apache Spark, Big Data, Data Pipelines, Data Import/Export, Data Integration, Data Processing, Relational Databases, File Systems, Command-Line Interface, Configuration Management, Software Installation

  • Status: New
    Status: Free Trial

    Skills you'll gain: AI Personalization, Data Manipulation, Apache Spark, Tensorflow, Deep Learning, Artificial Intelligence and Machine Learning (AI/ML), PyTorch (Machine Learning Library), Natural Language Processing, AWS SageMaker, Scalability, Applied Machine Learning, Data Processing, Supervised Learning, Dimensionality Reduction, Machine Learning, Pandas (Python Package), Predictive Modeling, Python Programming, Time Series Analysis and Forecasting, Artificial Neural Networks

  • Status: New
    Status: Free Trial

    Skills you'll gain: Feature Engineering, AWS SageMaker, Data Cleansing, Apache Spark, Extract, Transform, Load, Data Pipelines, Data Transformation, Amazon Web Services, Responsible AI, Data Quality, Data Integrity, Data Validation, Personally Identifiable Information, Machine Learning Methods

  • Status: Free Trial

    Skills you'll gain: Big Data, Data Analysis, Statistical Analysis, Apache Hadoop, Data Wrangling, Apache Hive, Data Collection, Data Mart, Data Science, Data Warehousing, Data Visualization, Analytics, Data Cleansing, Apache Spark, Data Lakes, Data Visualization Software, Microsoft Excel

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, Keras (Neural Network Library), Deep Learning, Tensorflow, A/B Testing, Big Data, Data Ethics, Applied Machine Learning, Data Processing, Machine Learning Software, Artificial Neural Networks, Machine Learning Algorithms, Data Cleansing, Machine Learning, MLOps (Machine Learning Operations), Artificial Intelligence, Supervised Learning, Statistical Hypothesis Testing, Dimensionality Reduction, Reinforcement Learning

  • Status: New

    Skills you'll gain: Docker (Software), CI/CD, Cloud Computing Architecture, Application Performance Management, Apache Spark, Google App Engine

  • Status: New
    Status: Preview

    Northeastern University

    Skills you'll gain: Data Governance, Database Management, Database Systems, NoSQL, SQL, MongoDB, Relational Databases, Big Data, Graph Theory, Data Storage, Apache Hadoop, Data Manipulation, Apache Spark

  • Status: Free Trial

    Skills you'll gain: PySpark, Databricks, Apache Spark, MLOps (Machine Learning Operations), Microsoft Azure, Big Data, Scikit Learn (Machine Learning Library), Applied Machine Learning, Data Processing, Deep Learning, Data Transformation, Machine Learning, Exploratory Data Analysis

  • Status: New
    Status: Free Trial

    Skills you'll gain: Databricks, Apache Spark, Microsoft Azure, PySpark, Data Lakes, Data Processing, Jupyter, File Systems, File Management, Cloud Storage, Cloud Computing Architecture

What brings you to Coursera today?

Leading partners

  • Google Cloud
  • Packt
  • IBM
  • EDUCBA
  • University of California San Diego
  • Amazon Web Services
  • Edureka
  • Pearson