Deadline extended Submission deadline extended from Mar 7 to Mar 8, 2026 (AoE).

Speakers

Kristen Grauman

Kristen Grauman

University of Texas at Austin

Leads work on egocentric video understanding and retrieval at scale, with a focus on long-horizon grounding and efficient perception.
Mohit Bansal

Mohit Bansal

University of North Carolina at Chapel Hill

Researches multimodal language/vision agents, grounded reasoning, and controllable generation for real-world tasks.
Dan Roth

Dan Roth

University of Pennsylvania / Oracle AI

Pioneer in structured, grounded reasoning and robust inference for language/vision systems deployed in real settings.
Scott Wen-Tau Yih

Scott Wen-Tau Yih

Meta

Research Scientist at Meta FAIR and affiliate professor at the University of Washington. His work spans NLP, ML, and information retrieval, including DPR and RAG, and he was named an ACL Fellow in 2024.
Sujith Ravi

Sujith Ravi

Oracle

Vice President of GenAI at Oracle and former founder of SliceX AI, leading frontier models and agentic systems for enterprise reliability, safety, and large‑scale deployment.
Vijay Krishnan

Vijay Krishnan

Turing

Co-Founder and CTO of Turing, where he leads technical strategy for frontier AI systems across reasoning, coding, multimodality, and agentic workflows. His work focuses on scalable human-AI systems that advance practical and reliable intelligence.

More to be announced

Leaders from Industry & Academia

Panel

From Retrieval to Action: What Should Agentic Vision Systems Verify?

Diverse perspectives across industry and academia.

Workshop Organizers

Amit Agarwal

Amit Agarwal

Oracle AI

Builds agentic vision systems and retrieval pipelines with grounded evidence and data enrichment at scale.
Vivek Gupta

Vivek Gupta

Arizona State University

Researches heterogeneous retrieval over structured and unstructured sources, focusing on robust, grounded search.
Vivek Srikumar

Vivek Srikumar

University of Utah

Works on grounding, reliability, and structured prediction for language/vision systems.
Tao Sheng

Tao Sheng

Oracle AI

Focuses on agentic planning, tool use, and multimodal system integration for production deployments.
Alice Oh

Alice Oh

KAIST

Leads work on multilingual attribution, grounding, and socially responsible AI across modalities.
Sara Hooker

Sara Hooker

Adaption Labs

Researches efficient and responsible ML (distillation, compression) to make large models deployable.
Jyotika Singh

Jyotika Singh

Oracle AI

Works on agentic memory, data quality, and human-in-the-loop interaction for grounded systems.
Hitesh Patel

Hitesh Patel

Oracle AI

Focuses on multilingual and multimodal responsible AI with grounded retrieval and safety.

Program Committee

Karan Dua

Karan Dua

Expertise in scaling multimodal data pipelines and synthetic generation for generative model training. Advances the field through the design of calibrated metrics and comprehensive benchmarking for generative AI.
Hansa Meghwani

Hansa Meghwani

Holds an MSc from LJMU, UK. Specializes in the architecture of enterprise-grade RAG and agentic workflows with a focus on attribution. Expertise spans the rigorous evaluation of multilingual LLMs and VLMs.
Meizhu Liu

Meizhu Liu

Holds a PhD from the University of Florida. Specializes in the development of multimodal retrieval architectures and generative tools for image editing. Over 70+ publications in top-tier conferences.
Michael Avendi

Michael Avendi

Holds a PhD from UC Irvine. Focuses on generative AI, vision-language-action models, and robotics learning to bridge perception and physical autonomy.
Matthew Rowe

Matthew Rowe

Researches multimodal and hybrid search with a focus on enterprise-scale RAG. Specializes in architecting retrieval systems for high-throughput production deployments.
Yassi Abbasi

Yassi Abbasi

Holds a PhD from USC. Specializes in multimodal AI with an emphasis on image-text representation learning, generative modeling, and the design of robust evaluation metrics.
Peerat Limkonchotiwat

Peerat Limkonchotiwat

Research Fellow at AI Singapore, NUS. Leads SEA-LION, SEA-HELM, and SEA-Guard; collaborates on LLM, multimodal, safety, and dataset research. Contributor to SEACrowd and SEA-VL; invited researcher at Chulalongkorn University and SIGSEA advisory board member.
Taki Hasan

Taki Hasan

PhD candidate at Hanyang University, South Korea. Research focuses on trustworthy AI, including efficient test-time adaptation of vision-language and multimodal models for handling distribution shifts.
Praneet Pabolu

Praneet Pabolu

Holds a Master’s from Cornell and leads foundation model development at Splunk, focusing on representation learning and generative reasoning for complex environments.
Dr. Pao-Ann Hsiung

Dr. Pao-Ann Hsiung

Professor of Computer Science and Dean of the College of Engineering. His work spans AI, IoT, and smart city applications, and he previously led smart city initiatives in Chiayi City that earned international recognition.
Bronson Bakunga

Bronson Bakunga

Ugandan ML engineer, NLP researcher, and co-founder of Crane AI Labs. He builds compact offline-first language models for low-power devices to improve education access across Sub-Saharan Africa.

Stay connected

General inquiries

Reach out to the organizers with questions about submissions, sponsorship, or program.