Deep learning for semi-automated unidirectional measurement of lung tumor size in CT

Cancer Imaging

Table 2 Reader statistics and inter-observer variability between human radiologist and DL algorithm

Reader	Image Use	Number of Annotated Images	Average Measurement (cm)
Radiologist 1	Test	244	2.99 ± 0.93 (1.57–4.91)
Radiologist 2	Training	734	3.17 ± 0.96 (1.51–5.00)
Radiologist 2	Validation	159	3.21 ± 0.98 (1.50–4.99)
Radiologist 3	Training	395	2.92 ± 0.88 (1.49–4.94)
Radiologist 3	Validation	85	3.35 ± 0.89 (1.56–4.79)
DL Algorithm	Test	244	3.07 ± 0.91 (1.37–5.44)
Radiologist 1 & DL Algorithm ICC: 0.959 (95% CI: 0.947, 0.967)

Note – Average Measurement ± Standard Deviation. Numbers in parentheses represent range consisting of (minimum observed value – maximum observed value). ICC denotes intraclass correlation coefficient. The ICC score is based on a two-way random-effects model. CI denotes confidence interval.

ISSN: 1470-7330