Speakers

Kristen Grauman

University of Texas at Austin

Leads work on egocentric video understanding and retrieval at scale, with a focus on long-horizon grounding and efficient perception.

Mohit Bansal

University of North Carolina at Chapel Hill

Researches multimodal language/vision agents, grounded reasoning, and controllable generation for real-world tasks.

Dan Roth

University of Pennsylvania / Oracle AI

Pioneer in structured, grounded reasoning and robust inference for language/vision systems deployed in real settings.

Group Vice President of GenAI at Oracle, former founder of SliceX AI and ex-Head of AI at Google AI and Amazon Alexa AI, currently leading frontier models and agentic systems for enterprise reliability, safety, and large‑scale deployment.

Vijay Krishnan

Founder and CTO, Turing

Founder and CTO of Turing, where he leads technical strategy for frontier AI systems across reasoning, coding, multimodality, and agentic workflows. His work focuses on scalable human-AI systems that advance practical and reliable intelligence.

Kenneth Marino

University of Utah / Ex-DeepMind

Kenneth Marino is an Assistant Professor at the Kahlert School of Computing at the University of Utah and former Research Scientist at Google DeepMind. His research focuses on integrating multimodal language models into embodied agent problems, including computer use, games, and robotics.

Ming-Hsuan Yang

UC Merced / DeepMind

Ming-Hsuan Yang is a Professor in Electrical Engineering and Computer Science at the University of California, Merced. His research interests include computer vision, pattern recognition, artificial intelligence, robotics, and machine learning.

Panel

From Understanding to Action: Building the next AI frontier with Multimodal Agents, World Models, and Real-world Intelligence

Diverse perspectives across industry and academia.

Workshop Organizers

Amit Agarwal

Oracle AI

Builds agentic vision systems and retrieval pipelines with grounded evidence and data enrichment at scale.

Vivek Gupta

Arizona State University

Researches heterogeneous retrieval over structured and unstructured sources, focusing on robust, grounded search.

Vivek Srikumar

University of Utah

Works on grounding, reliability, and structured prediction for language/vision systems.

Tao Sheng

Oracle AI

Focuses on agentic planning, tool use, and multimodal system integration for production deployments.

Alice Oh

KAIST

Leads work on multilingual attribution, grounding, and socially responsible AI across modalities.

Sara Hooker

Adaption Labs

Researches efficient and responsible ML (distillation, compression) to make large models deployable.

Jyotika Singh

Oracle AI

Works on agentic memory, data quality, and human-in-the-loop interaction for grounded systems.

Hitesh Patel

Oracle AI

Focuses on multilingual and multimodal responsible AI with grounded retrieval and safety.

Program Committee

Karan Dua

Builds large-scale multimodal data pipelines, synthetic generation workflows, and evaluation benchmarks for generative AI systems.

Hansa Meghwani

Holds an MSc from LJMU, UK. Specializes in the architecture of enterprise-grade RAG and agentic workflows with a focus on attribution. Expertise spans the rigorous evaluation of multilingual LLMs and VLMs.

Meizhu Liu

Holds a PhD from the University of Florida. Specializes in the development of multimodal retrieval architectures and generative tools for image editing. Over 70+ publications in top-tier conferences.

Michael Avendi

Holds a PhD from UC Irvine. Focuses on generative AI, vision-language-action models, and robotics learning to bridge perception and physical autonomy.

Matthew Rowe

Researches multimodal and hybrid search with a focus on enterprise-scale RAG. Specializes in architecting retrieval systems for high-throughput production deployments.

Yassi Abbasi

Holds a PhD from USC. Specializes in multimodal AI with an emphasis on image-text representation learning, generative modeling, and the design of robust evaluation metrics.

Peerat Limkonchotiwat

Research Fellow at AI Singapore, NUS. Leads SEA-LION, SEA-HELM, and SEA-Guard; collaborates on LLM, multimodal, safety, and dataset research. Contributor to SEACrowd and SEA-VL; invited researcher at Chulalongkorn University and SIGSEA advisory board member.

Taki Hasan

PhD candidate at Hanyang University, South Korea. Research focuses on trustworthy AI, including efficient test-time adaptation of vision-language and multimodal models for handling distribution shifts.

Praneet Pabolu

Holds a Master’s from Cornell and leads foundation model development at Splunk, focusing on representation learning and generative reasoning for complex environments.

Dr. Pao-Ann Hsiung

Professor of Computer Science and Dean of the College of Engineering. His work spans AI, IoT, and smart city applications, and he previously led smart city initiatives in Chiayi City that earned international recognition.

Bronson Bakunga

Ugandan ML engineer, NLP researcher, and co-founder of Crane AI Labs. He builds compact offline-first language models for low-power devices to improve education access across Sub-Saharan Africa.

Sai Ashish Somayajula

Holds a PhD from UC San Diego and works on NL2SQL, enterprise analytics, and robust reasoning across structured and unstructured data. Served as SRW Chair for EACL 2026, where the workshop expanded submissions, mentorship, and reviewer participation.

Bhargava Kumar

TD Securities

Director at TD Securities, where he leads the AI Practice and supports AI strategy to production delivery for financial markets. He is currently focused on building agentic solutions to support business workflows across the firm. He has co-authored papers at ICLR, ICML, and ACL, among others, and reviews for major ML and NLP conferences. He holds an MS in Operations Research from Columbia University.

Tejaswini Kumar

Apple

Senior Engineering Program Manager at Apple with a strong interest in applied AI, particularly generative AI and agentic systems. Her published research appears at ACL, NAACL, and AACL, among other venues. She holds an MS from the Industrial Engineering and Operations Research (IEOR) department at Columbia University.

Eun Woo Im

Arizona State University

Ph.D. student at ASU focused on multimodal reasoning, retrieval, and its application to video understanding.

Tampu Ravi Kumar

Arizona State University

Ph.D. student at ASU focused on multimodal reasoning, QA over structured and unstructured data, and speech disfluency and audio reasoning.

Manan Roy Choudhury

Arizona State University

Ph.D. student at ASU focused on LLM reasoning and planning, anomaly and discrepancy detection, multi-modal robustness and perturbation analysis, and adversarial attacks in agentic frameworks.

Tejas Anvekar

Arizona State University

Ph.D. student at ASU specializing in the personalized and trustworthy evaluation of MLLMs. Develops agent-driven, explainable frameworks to identify risks and bias across diverse tasks.

Abhijit Chakraborty

MongoDB / Arizona State University

Technical Sales Director for North America Solution Consulting at MongoDB and a Computer Science Ph.D. candidate at Arizona State University's CoRAL Lab, with nearly two decades of experience building data platforms and AI systems across Fortune 500 enterprises. His research spans knowledge graph embeddings, federated retrieval-augmented generation, and trustworthy multi-agent AI, with recent work submitted to venues like NeurIPS and EMNLP. He serves the community as a NeurIPS and EMNLP 2025 reviewer, a Harvard Business Review Board of Advisors member, and an open-source contributor for Apache.

Yash Shah

JPGlobal / Arizona State University

Applied AI Intern at JPGlobal and an M.S. Computer Science thesis candidate at Arizona State University, working on scalable AI systems. His research spans diffusion language models, federated RAG, and reliable agentic AI systems, with work accepted at ACL and EACL.

Ashish Raj Shekhar

Arizona State University

M.S. Computer Science student at Arizona State University's CoRAL Lab, advised by Dr. Vivek Gupta. His research spans AI in education, AI safety, and multi-agent systems, with work accepted at EACL and ACL. He previously worked as a Data Engineer at Amazon and LendingKart.

Shiven Agarwal

Arizona State University

Master's student in Computer Science at Arizona State University conducting research in the CoRAL Lab under Dr. Vivek Gupta. His work spans multi-agent systems, LLM evaluation and robustness, AI safety, and privacy-preserving machine learning, with a focus on taking research systems from paper to deployed product. He is the first author of GamED.AI, a multi-agent framework for automated educational game generation accepted to the ACL 2026 System Demonstrations track, and has additional work published at EACL 2026. Prior to graduate studies, he spent three years as a software engineer building healthcare technology platforms.

Adarsh Singh

Arizona State University

Specializing in Information Retrieval over structured data and retrieval-augmented question answering. Focuses on retrieval, reranking, and representation learning methods that enable robust and scalable reasoning over structured knowledge.

Stay connected

General inquiries

Reach out to the organizers with questions about submissions, sponsorship, or program.

Email organizers See updates