Any time a patient hears the word cancer as a potential diagnosis, it’s a highly stressful situation. For many people, the time it takes to get an official diagnosis can feel like an eternity. They may require more tests to get a clearer picture, or the location where those tests need to be performed may be far away, and the experts needed to examine scans or cells may be in short supply in their area. A collaborative group of researchers, using their U.S. National Science Foundation ACCESS allocation on NCSA’s DeltaAI supercomputer, has created OPENPROS, the first large-scale, public dataset designed to improve prostate cancer detection using a specialized imaging technique called Ultrasound Computed Tomography (USCT).
Speaking on behalf of the large research group, Youzuo Lin is an associate professor in the School of Data Science and Society at the University of North Carolina at Chapel Hill (UNC). The work on OPENPROS is very promising, potentially solving problems such as access to high-end prostate imaging (e.g., MRI) and helping overcome “blind spots” caused by the pelvic bone in images. The OPENPROS project was co-led by Johns Hopkins University (JHU), and the team’s paper regarding this work was recently accepted for presentation at the 2026 International Conference on Learning Representations (ICLR).
Mapping the Body Through Sound
Imagine you’re trying to take a high-quality photo of an object, but you can only see it from one or two specific angles because there are walls in the way. This is the challenge doctors face when trying to get a clear image of the prostate. The pelvic bone creates a barrier, making it difficult to get a clear picture.
The UNC and JHU research group approached this problem in a novel way. They created a “digital training ground” to teach computers how to create a high-resolution image of the prostate.
“Our methodology combines realistic medical imaging data, physics-based simulation and machine learning to create a large, reliable benchmark for prostate ultrasound computed tomography (USCT),” said Lin. “To our knowledge, this is the first large-scale dataset designed specifically for learning-based and physics-informed reconstruction of prostate USCT under clinically realistic imaging constraints.”
To train the computers, they needed to create ideal scans from “patients” for the machine to learn from. Hanchen Wang, the co-first author from UNC, explained, “We begin with clinically derived MRI and CT scans of the prostate, which are carefully annotated by medical experts to produce anatomically accurate 3D digital models. These models incorporate realistic tissue properties, including speed-of-sound measurements obtained from real ex vivo prostate samples.”

“From these 3D models,” Wang explained, “we systematically extract hundreds of thousands of clinically relevant 2D slices that reflect the limited-angle geometry imposed by real prostate imaging. For each slice, we simulate ultrasound wave propagation using high-fidelity physics solvers based on the acoustic wave equation, generating full waveform data that closely mimics clinical measurements.”
Faster Results Equal Faster Detection and Treatment
With cancer, getting a timely diagnosis is often key to treatment and long-term positive health outcomes. Lin’s group created a dataset that can meaningfully affect the time between testing and treatment.
“OPENPROS is designed to accelerate clinically realistic prostate ultrasound computed tomography (USCT) specifically under the limited-angle constraints in real prostate imaging – transrectal and transabdominal access, nearby bones, heterogeneous tissue,” said Yixuan Wu, the co-first author from JHU.
For decades, doctors have relied on standard gray-scale ultrasounds, which can be like looking at a grainy, black-and-white photo where tumors are easily hidden in the shadows of objects like pelvic bones. The OPENPROS approach instead focuses on more than just the image. The AI measures how fast sound travels through tissue – a specific “biomarker” that can pinpoint cancer much more accurately, even in hard-to-reach areas where traditional tests often fail.
A major breakthrough is the speed at which these tests can be performed. While older, more complex methods could take hours, this system can do so in a fraction of a second. “The baselines show learned reconstruction can be milliseconds per sample, versus hours for iterative physics-based inversion,” said Wu. “It points toward real-time or near-real-time imaging that could fit biopsy/therapy workflows.”
By testing these tools against a massive, realistic database, researchers are ensuring these algorithms aren’t just lab experiments – they’re reliable, robust tools that clinicians can actually trust to make life-saving decisions on the spot.
“OPENPROS explicitly emphasizes benchmarking for generalization, robustness and uncertainty-aware reconstruction, all central to translating early-detection algorithms into tools clinicians can trust for treatment decisions,” said Wu.
Using HPC to Create More Affordable Care
It’s no secret that treating major conditions like cancer can be costly. While the work done on OPENPROS required the use of a large supercomputer like DeltaAI, Lin’s research group hopes their work helps alleviate some of the costlier aspects of diagnosis.
“A major barrier to widespread clinical adoption of Ultrasound Computed Tomography (USCT) is the computational cost of image reconstruction,” said Lin. “Traditional high-resolution USCT reconstruction relies on full-waveform inversion (FWI), an iterative physics-based optimization method that repeatedly solves large-scale wave equations. In practice, a single high-quality FWI reconstruction can require hundreds to thousands of forward and adjoint simulations, translating to days or even weeks of computation on GPUs or HPC clusters for one patient.”
Once a deep learning model is trained, reconstruction becomes a single forward pass through a neural network. In our benchmarks, deep learning inference takes seconds to a few minutes on a single GPU, and can even run on commodity hardware.
–Youzuo Lin, Associate Professor, University of North Carolina at Chapel Hill
This means it’s possible that, at some point, clinicians will be able to use this resource at smaller health institutions as well.
“This dramatic reduction in computational cost has important implications for accessibility. Faster reconstruction lowers the need for expensive computing infrastructure, reduces operational costs and makes it feasible to deploy USCT systems in a wider range of clinical settings, including community hospitals and outpatient clinics,” explains Lin. “By enabling high-quality imaging without specialized hardware or long processing delays, our approach helps move advanced cancer screening closer to being affordable and accessible to the average patient.”
As a partner in the ACCESS program, HPC resources like DeltaAI help advance society at large by supporting important research. Lin’s research group was able to create OPENPROS, a dataset designed to improve prostate cancer detection, thanks to their allocation through the ACCESS program. Without access to HPC resources, this kind of research would take far longer to complete.
“Data generation itself was computationally intensive,” said Lin. “Each sample requires numerically solving acoustic wave equations under realistic anatomical and clinical constraints. We addressed this by leveraging the DeltaAI supercomputing cluster supported by the U.S. National Science Foundation, which provided large-scale GPU resources and high-throughput job scheduling. Efficient parallelization, careful memory management and robust job orchestration were essential to avoid node failures and wasted compute time.”
The payoff for using HPC resources was like going from a sketch to a three-dimensional painting in seconds. Without this technology, calculating even a single medical image could take a computer up to 24 hours because the physics involved are so complex. By training AI on the DeltaAI supercomputer, Lin’s research group created a system that can now analyze a patient’s ultrasound and produce a clear, accurate map of the prostate in just five to nine milliseconds.
Leveraging the power of a supercomputer, the researchers did more than just make a better map of the human body – they turned a day-long waiting game into an instant result, bringing all of us one step closer to life-saving, real-time cancer detection in every doctor’s office.
If you have a research project that could benefit from HPC resources, you can get started with ACCESS here. To read more about this research project, you can find the original story here: From Image to Diagnosis in Milliseconds.
Resource Provider Institution(s): National Center for Supercomputing Applications (NCSA)
Resources Used: DeltaAI
Affiliations: University of North Carolina at Chapel Hill, Johns Hopkins University, Iowa State University
Funding Agency: NSF
Grant or Allocation Number(s): CIS250282
The science story featured here was enabled by the U.S. National Science Foundation’s ACCESS program, which is supported by National Science Foundation grants #2138259, #2138286, #2138307, #2137603, and #2138296.
