This course equips you with the skills to build smarter AI-driven systems using Retrieval Augmented Generation (RAG) and multimodal technology. You'll dive into the principles behind RAG and how it powers systems like advanced search engines, chatbots, and recommendation systems. The course will provide hands-on experience, enabling you to create multimodal systems that utilize images, text, and other forms of data to provide more intelligent and context-aware solutions.



Multimodal RAG with GPT – Build Smarter Search & AI Systems

Instructor: Packt - Course Instructors
Included with
Recommended experience
What you'll learn
Learn how to build and implement Retrieval Augmented Generation (RAG) systems with multimodal capabilities.
Understand the core principles of multimodal search systems and their advantages in real-world applications.
Gain hands-on experience in building a multimodal recommender system that integrates both text and image data.
Develop a user-friendly interface using Streamlit to deploy your multimodal AI systems effectively.
Skills you'll gain
Details to know

Add to your LinkedIn profile
May 2025
5 assignments
See how employees at top companies are mastering in-demand skills


Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review

There are 7 modules in this course
In this module, we will introduce the course’s objectives, explain the key concepts you'll need to understand, and give you a preview of the systems you'll build. We will also discuss the course structure, helping you prepare for what's ahead.
What's included
3 videos1 reading
In this module, we will guide you through the process of setting up the development environment for the course. You’ll ensure that all the necessary tools and dependencies are ready, setting you up for success in the hands-on sections.
What's included
1 video
In this module, we will dive into the fundamentals of RAG systems, their applications, and the benefits they bring. Additionally, we will introduce multimodal RAG systems, showcasing how they differ and how they function.
What's included
3 videos1 assignment
In this module, we will break down how search is integrated within a multimodal RAG system. We will explore its power and versatility, showcasing its transformative potential through visual explanations.
What's included
3 videos1 assignment
In this module, we will guide you through setting up a multimodal search system, from creating image embeddings to finalizing the system's functionality. You'll get hands-on experience with the full process.
What's included
2 videos1 assignment
In this module, we will guide you through building a multimodal recommender system, from dataset retrieval to embedding generation. You’ll also learn how to set up the RAG flow and integrate a UI for better user interaction.
What's included
8 videos1 assignment
In this module, we will help you chart your path forward after completing the course. We’ll provide actionable next steps to continue your learning journey and explore how to apply your skills in real-world scenarios.
What's included
1 video1 assignment
Instructor

Offered by
Why people choose Coursera for their career




New to Software Development? Start here.

Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy
Frequently asked questions
Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.
If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. You’ll be able to submit assignments once the session starts.
Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. You’ll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.
More questions
Financial aid available,