GenAI For Application Developers - Part 5 | Architecting AI Systems: Beyond API Calls
Автор: Code And Joy
Загружено: 2026-01-23
Просмотров: 116
Описание:
00:00 - Introduction: Advanced Orchestration for AI Systems
00:55 - Distributed Systems Pattern: Quorum Reads & Self-Consistency
03:17 - Tuning Temperature for Deterministic vs. Creative Outputs
05:55 - Practical Example: Sentiment Analysis with Majority Voting
08:52 - Weighted Voting: Adding Confidence Scores to Decisions
10:27 - High-Stakes Architecture: Designing a Medical Symptom Bot
14:35 - What is an AI Agent? (State, Orchestrator, Executor)
17:50 - The Agent Loop: Solving Complex Reasoning Problems
20:53 - The Memory Challenge: Managing Context Window Overflow
23:42 - How Agents Know When to Stop (Termination Conditions)
25:55 - Real World Case Study: Architecting a Code Review Agent
27:18 - Memory Optimization Technique: Semantic Compression
35:00 - Handling Massive Data: Async Queues & Event-Driven Architecture
37:15 - Token Economics: Prompt Caching & System Prompts
39:40 - Cost Optimization: Model Routing & Token Limiting
43:06 - The "Cost Guard": Preventing Infinite Loop Billing Disasters
47:45 - AI Monitoring Stack: Prometheus, Latency & Token Metrics
50:40 - Backend Engineering: Implementing Streaming with SSE (Server-Sent Events)
54:25 - Parallelism: Async Gather for RAG & Compliance Checks
55:45 - DevOps Deep Dive: System RAM vs. GPU VRAM
1:00:07 - Platform Engineering: The AI Gateway Architecture
1:03:08 - Conclusion: Shifting form API Wrapper to System Architect
Excalidraw Link: https://excalidraw.com/#json=KmovHDlO...
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: