323
Total Cases
3 data sources
54%
Malignant
174 cases
71%
Avg Completeness
hospital: 90%, public: 48%
12
Diagnostic Errors
flagged for Phase 4
Case Browser
Data Completeness
Distributions
Data Sources
| Case ID | Age | Sex | Lesion Type | LI-RADS | Source | Completeness | Malignancy |
|---|
Data Completeness Matrix
Green = available, Yellow = partial, Red = missing
Available
Partial
Missing
Lesion Type Distribution
LI-RADS Category Distribution
Malignancy Classification
Data Source Comparison
Public Datasets
122
Total Cases
LiverHccSeg: 17 cases · Duke Liver: 105 cases
Completeness:
~48%
Imaging data, segmentation masks
No lab values, clinical notes, or treatment data
Limited demographics
Hospital Data
201
Total Cases
Full EHR integration · IRB-approved · De-identified
Completeness:
~90%
Full imaging + labs + clinical notes
Treatment and outcome data available
Complete demographics and history
Why hospital data matters:
Public datasets provide imaging and annotations (~50% complete), but only hospital data delivers the full clinical picture needed for multimodal retrieval and error-aware reasoning. The completeness gap directly impacts retrieval quality in Phase 2 and VLM reasoning accuracy in Phase 3.