Secure AI: Red-Teaming & Safety Filters

Enjoy unlimited growth with a year of Coursera Plus for $199 (regularly $399). Save now.

Secure AI: Red-Teaming & Safety Filters

Instructors: Brian Newman

Included with

Learn more

3 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

3 hours to complete

Flexible schedule

Learn at your own pace

3 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

3 hours to complete

Flexible schedule

Learn at your own pace

What you'll learn

Design red-teaming scenarios to identify vulnerabilities and attack vectors in large language models using structured adversarial testing.
Implement content-safety filters to detect and mitigate harmful outputs while maintaining model performance and user experience.
Evaluate and enhance LLM resilience by analyzing adversarial inputs and developing defense strategies to strengthen overall AI system security.

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 3 modules in this course

As large language models revolutionize business operations, sophisticated attackers exploit AI systems through prompt injection, jailbreaking, and content manipulation—vulnerabilities that traditional security tools cannot detect. This intensive course empowers AI developers, cybersecurity professionals, and IT managers to systematically identify and mitigate LLM-specific threats before deployment. Master red-teaming methodologies using industry-standard tools like PyRIT, NVIDIA Garak, and Promptfoo to uncover hidden vulnerabilities through adversarial testing. Learn to design and implement multi-layered content-safety filters that block sophisticated bypass attempts while maintaining system functionality. Through hands-on labs, you'll establish resilience baselines, implement continuous monitoring systems, and create adaptive defenses that strengthen over time.

This course is designed for AI engineers, security professionals, data scientists, and developers interested in ensuring the safety and robustness of AI models. It’s also ideal for technology leaders seeking to implement secure, responsible AI frameworks within their organizations. Learners should have a basic understanding of machine learning, AI model architecture, and programming concepts. No prior experience with AI red-teaming or safety systems is required. By end of this course, you'll confidently conduct professional AI security assessments, deploy robust safety mechanisms, and protect LLM applications from evolving attack vectors in production environments.

This module introduces participants to the systematic creation and execution of red-teaming scenarios targeting large language models. Students learn to identify common vulnerability categories including prompt injection, jailbreaking, and data extraction attacks. The module demonstrates how to design realistic adversarial scenarios that mirror real-world attack patterns, using structured methodologies to probe LLM weaknesses. Hands-on demonstrations show how red-teamers simulate malicious user behavior to uncover security gaps before deployment.

What's included

4 videos2 readings1 peer review

4 videosTotal 27 minutes

Welcome to Secure AI Red-Teaming & Safety Filters2 minutes
Understanding AI Attack Vectors and Vulnerability Categories5 minutes
Designing Effective Red-Teaming Scenarios6 minutes
Hands-On Vulnerability Discovery with Automated Tools12 minutes

2 readingsTotal 10 minutes

Welcome to the Course: Course Overview5 minutes
LLM Red Teaming Guide (Open Source): Systematically Testing Large Language Models for Vulnerabilities5 minutes

1 peer reviewTotal 15 minutes

Hands-On-Learning: Red-Team Assessment of ChatAssist Customer Service Bot15 minutes

This module covers the design, implementation, and evaluation of content-safety filters for LLM applications. Participants explore multi-layered defense strategies including input sanitization, output filtering, and behavioral monitoring systems. The module demonstrates how to configure safety mechanisms that balance security with functionality, and shows practical testing methods to validate filter effectiveness against sophisticated bypass attempts. Real-world examples illustrate the challenges of maintaining robust content filtering while preserving user experience.

What's included

3 videos1 reading1 peer review

3 videosTotal 25 minutes

Multi-Layered Content-Safety Filter Architecture7 minutes
Implementing and Configuring Safety Filters for Production8 minutes
Testing Filter Effectiveness Against Bypass Attempts9 minutes

1 readingTotal 5 minutes

The Landscape of LLM Guardrails: Intervention Levels and Techniques5 minutes

1 peer reviewTotal 20 minutes

Hands-On-Learning: Safety Filter Implementation for SecureChat Enterprise Bot20 minutes

This module focuses on comprehensive resilience testing and systematic improvement of AI system robustness. Students learn to conduct thorough security assessments that measure LLM resistance to adversarial inputs, evaluate defense mechanism effectiveness, and identify areas for improvement. The module demonstrates how to establish baseline security metrics, implement iterative hardening processes, and validate improvements through continuous testing. Participants gain skills in developing robust AI systems that maintain integrity under real-world adversarial conditions.

What's included

4 videos1 reading1 assignment2 peer reviews

4 videosTotal 30 minutes

Establishing Baseline Security Metrics and Resilience Benchmarks5 minutes
Continuous Testing and Automated Vulnerability Assessment6 minutes
Systematic Security Improvement and Adaptive Hardening14 minutes
Course Wrap-Up3 minutes

1 readingTotal 5 minutes

10 LLM Security Tools to Know in 20255 minutes

1 assignmentTotal 20 minutes

Secure AI: Red-Teaming & Safety Filters20 minutes

2 peer reviewsTotal 80 minutes

Hands-On-Learning: Resilience Assessment and Continuous Hardening of DataSecure AI Assistant20 minutes
Project: SecureBank AI Chatbot Security Audit & Implementation 60 minutes

Instructors

Brian Newman

Coursera

3 Courses925 learners

Offered by

Coursera

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Secure AI: Red-Teaming & Safety Filters

What you'll learn

Skills you'll gain

Details to know

See how employees at top companies are mastering in-demand skills

There are 3 modules in this course

Red-Teaming Scenarios for LLM Vulnerabilities

What's included

4 videosTotal 27 minutes

2 readingsTotal 10 minutes

1 peer reviewTotal 15 minutes

Content-Safety Filters: Implementation and Testing

What's included

3 videosTotal 25 minutes

1 readingTotal 5 minutes

1 peer reviewTotal 20 minutes

Testing LLM Resilience and Improving AI Robustness

What's included

4 videosTotal 30 minutes

1 readingTotal 5 minutes

1 assignmentTotal 20 minutes

2 peer reviewsTotal 80 minutes

Instructors

Offered by

Why people choose Coursera for their career

Open new doors with Coursera Plus

Advance your career with an online degree

Join over 3,400 global companies that choose Coursera for Business

Frequently asked questions

More questions

Secure AI: Red-Teaming & Safety Filters

What you'll learn

Skills you'll gain

Details to know

See how employees at top companies are mastering in-demand skills

There are 3 modules in this course

Red-Teaming Scenarios for LLM Vulnerabilities

What's included

Content-Safety Filters: Implementation and Testing

What's included

Testing LLM Resilience and Improving AI Robustness

What's included

Instructors

Offered by

Why people choose Coursera for their career

Open new doors with Coursera Plus

Advance your career with an online degree

Join over 3,400 global companies that choose Coursera for Business

Frequently asked questions

When will I have access to the lectures and assignments?

What will I get if I subscribe to this Specialization?

Is financial aid available?

More questions