Build a Multimodal RAG with Azure AI (Process Images + Text)

GenAI Azure AIFoundry

Автор: Tech with Kirk

Загружено: 2025-10-02

Просмотров: 1305

Описание: You’ve built your first RAG—congrats! 🎉 But what happens when your documents include more than just text? Real-world data often contains images, charts, financial reports, and even videos. In this tutorial, I’ll show you how to extend your RAG into a multimodal RAG using Azure AI Search and AI Foundry.

We’ll cover:

Indexing and processing images alongside text
Using image verbalization vs. text extraction (OCR)
Building pipelines to handle real-world data
Uploading extracted content into Azure AI Search
Attaching your multimodal knowledge base to an agent

By the end, you’ll understand how to give your RAG the ability to work with both text and images—and how to decide whether to index contextual meaning or actual extracted text.

00:00 Introduction
00:19 Overview and Context
01:28 Setting Up Azure AI Search
09:35 Setting Up the AI Agent
17:11 Code Implementation
22:09 Testing Results
24:12 Conclusion

🔗 Related Tutorials:
• Build Your First AI Agentic RAG in Azure A...
• Agentic RAG at Scale with Azure AI Search ...

🔗 GitHub repo:
https://github.com/kirkmcpherson/Azur...

#AzureAI #RAG #AIEngineering #VectorDatabase

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Build a Multimodal RAG with Azure AI (Process Images + Text)

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео