Invited talk
in
Workshop: The 3rd DL4C Workshop: Emergent Possibilities and Challenges in Deep Learning for Code

Inited Talk: The Future of Multimodal AI Applications by Stefania Druga

Stefania Druga

2025 Invited talk
in
Workshop: The 3rd DL4C Workshop: Emergent Possibilities and Challenges in Deep Learning for Code

Abstract

Move beyond text-based paradigms and explore the frontier of Artificial Intelligence: multimodal applications that see, hear, and interact with the world in real-time. This technical keynote explores the practical implementation and potential of systems integrating and synthesizing information from diverse data streams – including live input from webcams, audio feeds, video, mobile sensors, and bespoke hardware. Stefania Druga (Google DeepMind) will provide a researcher's perspective, diving into the technical challenges – from data fusion and latency reduction to context modeling and robust interaction design – inherent in building AI that leverages a richer understanding of the physical world. Expect compelling live demonstrations showcasing interactive AI systems designed to understand context, anticipate user needs, and respond dynamically through multi-sensory feedback loops. Discover the future trajectory of intelligent systems and the potential for innovation in human-computer interaction, personalized assistance, and human-AI collaboration enabled by this shift towards truly multimodal AI.

Speaker

Stefania Druga

Stefania Druga is a Research Scientist at Google DeepMind, where she works on novel multimodal AI applications. She has a master's degree from MIT, PhD from UW and has been doing research on AI education since 2015. During graduate school, she built the first open-source platform for K12 AI Education - Cognimates . When she is not coding and writing papers, she enjoys trail running, yoga, and riding her bicycle.

Video

Chat is not available.