Joint Diffusion Modeling of Imaging and Clinical Metadata

[Submitted on 10 Dec 2025 (v1), last revised 12 Dec 2025 (this version, v2)]

Authors:Yihao Liu, Chenyu Gao, Lianrui Zuo, Michael E. Kim, Brian D. Boyd, Lisa L. Barnes, Walter A. Kukull, Lori L. Beason-Held, Susan M. Resnick, Timothy J. Hohman, Warren D. Taylor, Bennett A. Landman

View a PDF of the paper titled MetaVoxel: Joint Diffusion Modeling of Imaging and Clinical Metadata, by Yihao Liu and 11 other authors

View PDF
HTML (experimental)

Abstract:Modern deep learning methods have achieved impressive results across tasks from disease classification, estimating continuous biomarkers, to generating realistic medical images. Most of these approaches are trained to model conditional distributions defined by a specific predictive direction with a specific set of input variables. We introduce MetaVoxel, a generative joint diffusion modeling framework that models the joint distribution over imaging data and clinical metadata by learning a single diffusion process spanning all variables. By capturing the joint distribution, MetaVoxel unifies tasks that traditionally require separate conditional models and supports flexible zero-shot inference using arbitrary subsets of inputs without task-specific retraining. Using more than 10,000 T1-weighted MRI scans paired with clinical metadata from nine datasets, we show that a single MetaVoxel model can perform image generation, age estimation, and sex prediction, achieving performance comparable to established task-specific baselines. Additional experiments highlight its capabilities for flexible inference. Together, these findings demonstrate that joint multimodal diffusion offers a promising direction for unifying medical AI models and enabling broader clinical applicability.

Submission history

From: Yihao Liu [view email]
[v1]
Wed, 10 Dec 2025 19:47:52 UTC (760 KB)
[v2]
Fri, 12 Dec 2025 02:15:39 UTC (760 KB)

What's Hot

At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

Here’s how I turned a Raspberry Pi into an in-car media server

Beloved SF cat’s death fuels Waymo criticism

Joint Diffusion Modeling of Imaging and Clinical Metadata

[2601.15871] Why Inference in Large Models Becomes Decomposable After Training

Self-Hosting Your First LLM | Towards Data Science

To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

Ratio-Aware Layer Editing for Targeted Unlearning in Vision Transformers and Diffusion Models

Generalizing Real-World Robot Manipulation via Generative Visual Transfer

CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents

At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

Here’s how I turned a Raspberry Pi into an in-car media server

Beloved SF cat’s death fuels Waymo criticism

[2601.15871] Why Inference in Large Models Becomes Decomposable After Training

Top Blog Platforms for SEO: Which Sites to Conside

Self-Hosting Your First LLM | Towards Data Science

YouTube tests sticky banner after ad skip

Post, Story, and Reels Dimensions

How nonprofits can build a digital presence that actually drives impact

Most Popular

13 Trending Songs on TikTok in Nov 2025 (+ How to Use Them)

How to watch the 2026 GRAMMY Awards online from anywhere

Corporate Reputation Management Strategies | Sprout Social

Our Picks

At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

Here’s how I turned a Raspberry Pi into an in-car media server

Beloved SF cat’s death fuels Waymo criticism

Subscribe to Updates

What's Hot

Joint Diffusion Modeling of Imaging and Clinical Metadata

Submission history

Related Posts

Subscribe to Updates