Build a Multimodal RAG with Azure AI (Process Images + Text)
Автор: Tech with Kirk
Загружено: 2025-10-02
Просмотров: 1305
Описание:
You’ve built your first RAG—congrats! 🎉 But what happens when your documents include more than just text? Real-world data often contains images, charts, financial reports, and even videos. In this tutorial, I’ll show you how to extend your RAG into a multimodal RAG using Azure AI Search and AI Foundry.
We’ll cover:
Indexing and processing images alongside text
Using image verbalization vs. text extraction (OCR)
Building pipelines to handle real-world data
Uploading extracted content into Azure AI Search
Attaching your multimodal knowledge base to an agent
By the end, you’ll understand how to give your RAG the ability to work with both text and images—and how to decide whether to index contextual meaning or actual extracted text.
00:00 Introduction
00:19 Overview and Context
01:28 Setting Up Azure AI Search
09:35 Setting Up the AI Agent
17:11 Code Implementation
22:09 Testing Results
24:12 Conclusion
🔗 Related Tutorials:
• Build Your First AI Agentic RAG in Azure A...
• Agentic RAG at Scale with Azure AI Search ...
🔗 GitHub repo:
https://github.com/kirkmcpherson/Azur...
#AzureAI #RAG #AIEngineering #VectorDatabase
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: