Apache Spark

Apache Spark is a unified analytics engine for large-scale data processing, used for big data and machine learning tasks. Coursera's Apache Spark skill catalogue teaches you about this powerful tool for handling big data analytics. You'll learn the fundamentals of Spark's distributed computing model, its powerful data processing capabilities, and how to implement machine learning algorithms with Spark. You'll also delve into Spark SQL for structured data processing, Spark Streaming for real-time data processing, and MLlib for machine learning tasks. Master these aspects to enhance your data science skills and solve complex computational problems in various industries.
31credentials
72courses

Related roles

Gain the knowledge and skills you need to advance.

  • This role has a $137,984 median salary ¹.

    description:

    A Data Engineer builds data pipelines for large datasets, optimizing systems and ensuring reliable data flow using tools like Hadoop and Spark.

    This role has a $137,984 median salary ¹.

    Offered by

    DeepLearning.AI_logo
    Amazon Web Services_logo
    Google Cloud_logo

Most popular

Trending now

New releases

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Apache Spark Course Catalog

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, Generative AI, LLM Application, Large Language Modeling, Predictive Modeling, Matplotlib, Keras (Neural Network Library), Generative Model Architectures, Deep Learning, ChatGPT, OpenAI, Generative AI Agents, Tensorflow, Seaborn, A/B Testing, Statistical Modeling, Data Visualization, Regression Analysis, Big Data, Machine Learning

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Hadoop, Apache Hive, Apache Spark, Big Data, Data Pipelines, Data Import/Export, Data Integration, Data Processing, Relational Databases, File Systems, Command-Line Interface, Configuration Management, Software Installation

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, MySQL, Data Pipelines, Apache Spark, Data Processing, SQL, Data Transformation, Data Manipulation, Distributed Computing, Programming Principles, Python Programming, Debugging

  • Status: New
    Status: Free Trial

    Skills you'll gain: AI Personalization, Data Manipulation, Apache Spark, Tensorflow, Deep Learning, Artificial Intelligence and Machine Learning (AI/ML), PyTorch (Machine Learning Library), Natural Language Processing, AWS SageMaker, Scalability, Applied Machine Learning, Data Processing, Supervised Learning, Dimensionality Reduction, Machine Learning, Pandas (Python Package), Predictive Modeling, Python Programming, Time Series Analysis and Forecasting, Artificial Neural Networks

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Spark, Classification And Regression Tree (CART), Predictive Modeling, Applied Machine Learning, Statistical Machine Learning, Unsupervised Learning, Predictive Analytics, Random Forest Algorithm, Regression Analysis, Machine Learning Algorithms, Supervised Learning, Data Pipelines

  • Status: Free Trial

    Skills you'll gain: Big Data, Data Analysis, Statistical Analysis, Apache Hadoop, Data Wrangling, Apache Hive, Data Collection, Data Mart, Data Science, Data Warehousing, Data Visualization, Analytics, Data Cleansing, Apache Spark, Data Lakes, Data Visualization Software, Microsoft Excel

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, Keras (Neural Network Library), Deep Learning, Tensorflow, A/B Testing, Big Data, Data Ethics, Applied Machine Learning, Data Processing, Machine Learning Software, Artificial Neural Networks, Machine Learning Algorithms, Data Cleansing, Machine Learning, MLOps (Machine Learning Operations), Artificial Intelligence, Supervised Learning, Statistical Hypothesis Testing, Dimensionality Reduction, Reinforcement Learning

  • Status: Free Trial

    University of Illinois Urbana-Champaign

    Skills you'll gain: Distributed Computing, Cloud Infrastructure, Cloud Services, Big Data, Apache Spark, Cloud Computing, Cloud Storage, Cloud Platforms, Network Architecture, Data Storage Technologies, Computer Networking, File Systems, Apache Hadoop, Network Infrastructure, Cloud Applications, Infrastructure As A Service (IaaS), Middleware, Data Storage, Software-Defined Networking, NoSQL

  • Status: Free Trial

    Skills you'll gain: NoSQL, Data Warehousing, SQL, Apache Hadoop, Extract, Transform, Load, Apache Airflow, Data Security, Linux Commands, Data Migration, Database Design, Data Governance, MySQL, Database Administration, Apache Spark, Data Pipelines, Apache Kafka, Database Management, Bash (Scripting Language), Data Store, Data Architecture

  • Status: New
    Status: Preview

    Northeastern University

    Skills you'll gain: Data Governance, Database Management, Database Systems, NoSQL, SQL, MongoDB, Relational Databases, Big Data, Graph Theory, Data Storage, Apache Hadoop, Data Manipulation, Apache Spark

  • Status: New
    Status: Free Trial

    Skills you'll gain: Feature Engineering, AWS SageMaker, Data Cleansing, Apache Spark, Extract, Transform, Load, Data Pipelines, Data Transformation, Amazon Web Services, Responsible AI, Data Quality, Data Integrity, Data Validation, Personally Identifiable Information, Machine Learning Methods

  • Status: Free Trial

    Skills you'll gain: Dataflow, Google Cloud Platform, Data Pipelines, Data Lakes, Looker (Software), Unstructured Data, Real Time Data, Cloud Engineering, Big Data, Data Warehousing, Data Infrastructure, Cloud Infrastructure, Cloud Storage, MLOps (Machine Learning Operations), Dashboard, Extract, Transform, Load, Tensorflow, Data Architecture, Data Processing, Apache Spark

What brings you to Coursera today?

Leading partners

  • Google Cloud
  • Packt
  • IBM
  • EDUCBA
  • University of California San Diego
  • Amazon Web Services
  • Edureka
  • Pearson