TY - JOUR
T1 - Prediction of upstaged ductal carcinoma in situ using forced labeling and domain adaptation
AU - Hou, Rui
AU - Mazurowski, Maciej A.
AU - Grimm, Lars J.
AU - Marks, Jeffrey R.
AU - King, Lorraine M.
AU - Maley, Carlo C.
AU - Hwang, Eun Sil Shelley
AU - Lo, Joseph Y.
N1 - Funding Information:
This work was supported in part by NIH/NCI R01-CA185138
Publisher Copyright:
© 1964-2012 IEEE.
PY - 2020/6
Y1 - 2020/6
N2 - Objective: The goal of this study is to use adjunctive classes to improve a predictive model whose performance is limited by the common problems of small numbers of primary cases, high feature dimensionality, and poor class separability. Specifically, our clinical task is to use mammographic features to predict whether ductal carcinoma in situ (DCIS) identified at needle core biopsy will be later upstaged or shown to contain invasive breast cancer. Methods: To improve the prediction of pure DCIS (negative) versus upstaged DCIS (positive) cases, this study considers the adjunctive roles of two related classes: atypical ductal hyperplasia (ADH), a non-cancer type of breast abnormity, and invasive ductal carcinoma (IDC), with 113 computer vision based mammographic features extracted from each case. To improve the baseline Model A's classification of pure vs. upstaged DCIS, we designed three different strategies (Models B, C, D) with different ways of embedding features or inputs. Results: Based on ROC analysis, the baseline Model A performed with AUC of 0.614 (95% CI, 0.496-0.733). All three new models performed better than the baseline, with domain adaptation (Model D) performing the best with an AUC of 0.697 (95% CI, 0.595-0.797). Conclusion: We improved the prediction performance of DCIS upstaging by embedding two related pathology classes in different training phases. Significance: The three new strategies of embedding related class data all outperformed the baseline model, thus demonstrating not only feature similarities among these different classes, but also the potential for improving classification by using other related classes.
AB - Objective: The goal of this study is to use adjunctive classes to improve a predictive model whose performance is limited by the common problems of small numbers of primary cases, high feature dimensionality, and poor class separability. Specifically, our clinical task is to use mammographic features to predict whether ductal carcinoma in situ (DCIS) identified at needle core biopsy will be later upstaged or shown to contain invasive breast cancer. Methods: To improve the prediction of pure DCIS (negative) versus upstaged DCIS (positive) cases, this study considers the adjunctive roles of two related classes: atypical ductal hyperplasia (ADH), a non-cancer type of breast abnormity, and invasive ductal carcinoma (IDC), with 113 computer vision based mammographic features extracted from each case. To improve the baseline Model A's classification of pure vs. upstaged DCIS, we designed three different strategies (Models B, C, D) with different ways of embedding features or inputs. Results: Based on ROC analysis, the baseline Model A performed with AUC of 0.614 (95% CI, 0.496-0.733). All three new models performed better than the baseline, with domain adaptation (Model D) performing the best with an AUC of 0.697 (95% CI, 0.595-0.797). Conclusion: We improved the prediction performance of DCIS upstaging by embedding two related pathology classes in different training phases. Significance: The three new strategies of embedding related class data all outperformed the baseline model, thus demonstrating not only feature similarities among these different classes, but also the potential for improving classification by using other related classes.
KW - Breast cancer
KW - Domain adaptation
KW - Ductal carcinoma in situ
KW - Forced labeling
KW - Mammographic features
UR - http://www.scopus.com/inward/record.url?scp=85085257231&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85085257231&partnerID=8YFLogxK
U2 - 10.1109/TBME.2019.2940195
DO - 10.1109/TBME.2019.2940195
M3 - Article
C2 - 31502960
AN - SCOPUS:85085257231
SN - 0018-9294
VL - 67
SP - 1565
EP - 1572
JO - IEEE Transactions on Biomedical Engineering
JF - IEEE Transactions on Biomedical Engineering
IS - 6
M1 - 8827297
ER -