← All Phases

Phase 1: Case Database ExplorerDEMO

Liver CDS System — Data Foundation
323
Total Cases
3 data sources
54%
Malignant
174 cases
71%
Avg Completeness
hospital: 90%, public: 48%
12
Diagnostic Errors
flagged for Phase 4
Case Browser
Data Completeness
Distributions
Data Sources
Showing 18 of 18 cases
Case ID Age Sex Lesion Type LI-RADS Source Completeness Malignancy

Data Completeness Matrix

Green = available, Yellow = partial, Red = missing
Available Partial Missing

Lesion Type Distribution

LI-RADS Category Distribution

Malignancy Classification

Data Source Comparison

Public Datasets

122
Total Cases
LiverHccSeg: 17 cases · Duke Liver: 105 cases
Completeness:
~48%
Imaging data, segmentation masks
No lab values, clinical notes, or treatment data
Limited demographics

Hospital Data

201
Total Cases
Full EHR integration · IRB-approved · De-identified
Completeness:
~90%
Full imaging + labs + clinical notes
Treatment and outcome data available
Complete demographics and history
Why hospital data matters: Public datasets provide imaging and annotations (~50% complete), but only hospital data delivers the full clinical picture needed for multimodal retrieval and error-aware reasoning. The completeness gap directly impacts retrieval quality in Phase 2 and VLM reasoning accuracy in Phase 3.