Unintentionally retaining foreign objects during minimally invasive surgery can cause serious complications requiring reoperation (Badiee et al., 2025). SAVE FOCUS addresses this patient safety challenge by benchmarking AI on tasks surgeons face intraoperatively: continuous counting and tracking of surgical items, real-time localization of unaccounted objects, and verification of complete retrieval before closure.
At the same time, the challenge targets a critical technical frontier: can vision-language models maintain accurate understanding across hours-long procedures when safety decisions depend on remembering what happened much earlier in the operation?
The challenge has been designed according to the Equator guideline BIAS. The full challenge design can be found here.
SAVE FOCUS – Foreign Object Contextual Understanding for Safe Surgical AI – is organized in three tracks.
Together, these tracks enable a systematic characterization of where current VLMs succeed and fail as task complexity transitions from instantaneous perception to long-context intraoperative reasoning.






















ORena SAVE FOCUS is a joint initiative of the Wellcome Leap SAVE program and the ORena initiative of the DKFZ Intelligent Medical Systems (IMSY) lab, co-sponsored by Helmholtz Imaging.




The SAVE program is a US $50 million research and innovation initiative launched by Wellcome Leap, a nonprofit aimed at accelerating breakthroughs in global human health. SAVE stands for Surgery: Assess, Validate, Expand and focuses on addressing critical challenges in global surgical care. It is led by Prof. Thomas G. Weiser, clinical coordinator of the SAVE FOCUS challenge.
Helmholtz Imaging is an interdisciplinary platform of the Helmholtz Association of German Research Centres designed to unlock the scientific potential of imaging data and methods across all disciplines and scales of research. It serves as a central hub for expertise, tools, funding, and collaboration in imaging science. Within this framework, the DKFZ IMSY lab leads the benchmarking activities, driving the development of standards and evaluation strategies for imaging methods.
ORena is an umbrella framework for surgical AI competitions designed specifically for Operating Room challenges. It provides standardized benchmarks, curated datasets, and community leaderboards to drive progress on AI systems that can support real clinical workflows. SAVE FOCUS is the inaugural challenge, with future editions planned to address other critical problems in surgical safety and quality assurance.