Skip to yearly menu bar Skip to main content


Invited talk
in
Workshop: The 3rd DL4C Workshop: Emergent Possibilities and Challenges in Deep Learning for Code

Inited Talk: The Future of Multimodal AI Applications by Stefania Druga

Stefania Druga


Abstract:

Move beyond text-based paradigms and explore the frontier of Artificial Intelligence: multimodal applications that see, hear, and interact with the world in real-time. This technical keynote explores the practical implementation and potential of systems integrating and synthesizing information from diverse data streams – including live input from webcams, audio feeds, video, mobile sensors, and bespoke hardware. Stefania Druga (Google DeepMind) will provide a researcher's perspective, diving into the technical challenges – from data fusion and latency reduction to context modeling and robust interaction design – inherent in building AI that leverages a richer understanding of the physical world. Expect compelling live demonstrations showcasing interactive AI systems designed to understand context, anticipate user needs, and respond dynamically through multi-sensory feedback loops. Discover the future trajectory of intelligent systems and the potential for innovation in human-computer interaction, personalized assistance, and human-AI collaboration enabled by this shift towards truly multimodal AI.

Chat is not available.