The Architecture of RAG Systems Part 01

Автор: Mohamad Aoude

Загружено: 2026-03-09

Просмотров: 27

Описание: In this lecture, we explore Retrieval-Augmented Generation, or RAG, as a full AI systems architecture rather than just a popular buzzword. The session explains why standalone large language models are not enough for many real-world applications, especially when answers must be grounded in current, private, or domain-specific knowledge.

We walk through the complete RAG pipeline step by step: data ingestion, chunking, embedding generation, vector databases, retrieval flow, augmented prompt construction, hybrid retrieval, evaluation, debugging, and system limitations. The lecture also shows how RAG fits into the broader evolution of modern AI systems and why it serves as a foundation for the move toward agentic RAG.

To keep the discussion practical, the lecture uses a recurring engineering example based on a corporate document collection, showing how a system can retrieve relevant evidence and generate grounded answers from real sources.

This lecture is designed for students, engineers, and practitioners who want a clear architectural understanding of how RAG systems work in production settings.

Topics covered

Why RAG matters

Offline and online RAG pipelines

Data ingestion and chunking strategies

Embeddings and vector databases

Dense, lexical, and hybrid retrieval

Augmented prompt design

Evaluation metrics and debugging

Strengths and limitations of RAG

Transition from traditional RAG to agentic RAG

This session is part of a broader course on modern AI systems architecture.

#RAG #AI #ArtificialIntelligence #LLM #GenerativeAI #MachineLearning #VectorDatabase #SemanticSearch #AgenticAI #AIEngineering

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

The Architecture of RAG Systems Part 01

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

The Architecture of RAG Systems Part 02

The Architecture of RAG Systems Part 02

Напали на Иран. Уничтожили весь мир.

Напали на Иран. Уничтожили весь мир.

Лучший Гайд по Kafka для Начинающих За 1 Час

Лучший Гайд по Kafka для Начинающих За 1 Час

3 причины, почему я перешел на Claude: Реальный пример от не программиста.

3 причины, почему я перешел на Claude: Реальный пример от не программиста.

Стандартная модель Вселенной под вопросом? — Семихатов, Горбунов

Стандартная модель Вселенной под вопросом? — Семихатов, Горбунов

Lecture Three part 02 From RAG to Agentic RAG / Retrieval Systems Evolve into Decision-Capable AI

Lecture Three part 02 From RAG to Agentic RAG / Retrieval Systems Evolve into Decision-Capable AI

I spoke to AI agent Claude

I spoke to AI agent Claude

⚡️ Окружение с трёх сторон началось || Крупнейший в мире объект поражён

⚡️ Окружение с трёх сторон началось || Крупнейший в мире объект поражён

Запуск нейросетей локально. Генерируем - ВСЁ

Запуск нейросетей локально. Генерируем - ВСЁ

КАК УСТРОЕН TCP/IP?

КАК УСТРОЕН TCP/IP?

Lecture Three part 01 From RAG to Agentic RAG / Retrieval Systems Evolve into Decision-Capable AI

Lecture Three part 01 From RAG to Agentic RAG / Retrieval Systems Evolve into Decision-Capable AI

1С: ИИ пишет весь код без человека: магия нейросетей

1С: ИИ пишет весь код без человека: магия нейросетей

Третья неделя конфликта: План Нетаньяху и ловушка для американцев | Ростислав Ищенко

Третья неделя конфликта: План Нетаньяху и ловушка для американцев | Ростислав Ищенко

ЛУЧШАЯ БЕСПЛАТНАЯ НЕЙРОСЕТЬ Google, которой нет аналогов

ЛУЧШАЯ БЕСПЛАТНАЯ НЕЙРОСЕТЬ Google, которой нет аналогов

Китай требует капитуляции соседа / Войска стянуты к границе

Китай требует капитуляции соседа / Войска стянуты к границе

From LLMs to AI Systems

From LLMs to AI Systems

Gemini Embedding 2 — КОНЕЦ Всему RAG?

Gemini Embedding 2 — КОНЕЦ Всему RAG?

CAPA in GMP: Corrective & Preventive Action (Lecture 6)

CAPA in GMP: Corrective & Preventive Action (Lecture 6)

Как представить 10 измерений? [3Blue1Brown]

Как представить 10 измерений? [3Blue1Brown]

Всё закончится через 2 года.

Всё закончится через 2 года.