Building Multimodal Agents with Gemma 3 | Build with AI S-13 ft. Ehtisham Raza (AI Engineer)
Автор: TensorFlow User Group Islamabad
Загружено: 2026-03-07
Просмотров: 62
Описание:
Missed Session 13 of Build with AI? We’ve got you covered!
Catch the highlights from Building Multimodal Agents with Gemma 3, featuring Ehtisham Raza, AI Engineer at Data Techcon 🎥
In this hands-on session, we explored how developers can move beyond traditional text-based AI systems and build multimodal agents capable of understanding visual and audio inputs using Gemma 3 and on-device AI technologies.
🎤 Featured Speaker: Ehtisham Raza (AI Engineer at Data Techcon)
Connect with Ehtisham on LinkedIn: / ehtisham-raza-0a5479163
💡 What we covered:
• Using Gemma 3 for multimodal AI beyond text-only bots
• Processing audio and visual inputs directly on-device
• Implementing on-device inference with LiteRT (formerly TFLite)
• Building a real-time Visual Assistant that describes its surroundings
This practical workshop guided participants through the process of building intelligent agents that combine vision, audio, and language capabilities, demonstrating how on-device AI and multimodal models can enable powerful real-time applications.
Notebook Link: https://drive.google.com/drive/folder...
Stay connected for more updates and future events on our Social Media:
• Commudle: https://www.commudle.com/communities/...
• Discord: / discord
• Facebook Group: / tfugislamabad
• Instagram: / tfugisl
• LinkedIn: / tfugisl
• Slack: https://join.slack.com/t/tfugislamaba...
• Twitter: / tfugislamabad
• WhatsApp Community: https://chat.whatsapp.com/HCAoCKBHahi...
🌐 Explore previous sessions, speakers, blogs, and upcoming campaigns on our official website:
https://TFUGIslamabad.tech
Don’t miss out on more engaging Build with AI sessions and be part of our growing community!
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: