A deep learning framework to classify breast density with noisy labels regularization


Por: Lopez-Almazan H, Javier Pérez-Benito F, Larroza A, Perez-Cortes JC, Pollan M, Perez-Gomez B, Salas Trejo D, Casals M and Llobet R

Publicada: 1 jun 2022 Ahead of Print: 1 may 2022
Resumen:
Background and Objective: Breast density assessed from digital mammograms is a biomarker for higher risk of developing breast cancer. Experienced radiologists assess breast density using the Breast Image and Data System (BI-RADS) categories. Supervised learning algorithms have been developed with this objective in mind, however, the performance of these algorithms depends on the quality of the ground truth information which is usually labeled by expert readers. These labels are noisy approximations of the ground truth, as there is often intra-and inter-reader variability among labels. Thus, it is crucial to provide a reliable method to obtain digital mammograms matching BI-RADS categories. This paper presents RegL (Labels Regularizer), a methodology that includes different image pre-processes to allow both a correct breast segmentation and the enhancement of image quality through an intensity adjustment, thus allowing the use of deep learning to classify the mammograms into BI-RADS categories. The Confusion Matrix (CM) CNN network used implements an architecture that models each radiologist's noisy label. The final methodology pipeline was determined after comparing the performance of image pre-processes combined with different DL architectures.Methods: A multi-center study composed of 1395 women whose mammograms were classified into the four BI-RADS categories by three experienced radiologists is presented. A total of 892 mammograms were used as the training corpus, 224 formed the validation corpus, and 279 the test corpus.Results: The combination of five networks implementing the RegL methodology achieved the best results among all the models in the test set. The ensemble model obtained an accuracy of (0.85) and a kappa index of 0.71. Conclusions: The proposed methodology has a similar performance to the experienced radiologists in the classification of digital mammograms into BI-RADS categories. This suggests that the pre-processing steps and modelling of each radiologist's label allows for a better estimation of the unknown ground truth labels.(c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ )

Filiaciones:
Lopez-Almazan H:
 Instituto Tecnológico de la Informática, Universitat Politècnica de València,Camino de Vera, s/n, 46022 València, Spain

Javier Pérez-Benito F:
 Instituto Tecnológico de la Informática, Universitat Politècnica de València,Camino de Vera, s/n, 46022 València, Spain

Larroza A:
 Instituto Tecnológico de la Informática, Universitat Politècnica de València,Camino de Vera, s/n, 46022 València, Spain

Perez-Cortes JC:
 Instituto Tecnológico de la Informática, Universitat Politècnica de València,Camino de Vera, s/n, 46022 València, Spain

Pollan M:
 National Center for Epidemiology, Carlos III Institute of Health, Monforte de lemos, 5, 28029 Madrid, Spain

 Consortium for Biomedical Research in Epidemiology and Public Health (CIBER en Epidemiología y Salud Pública - CIBERESP), Carlos III Institute of Health, Monforte de Lemos 5, 28029 Madrid, Spain. Electronic address:

Perez-Gomez B:
 National Center for Epidemiology, Carlos III Institute of Health, Monforte de lemos, 5, 28029 Madrid, Spain

:
 Valencian Breast Cancer Screening Program, General Directorate of Public Health, Valencia, Spain

 Centro Superior de Investigación en Salud Pública CSISP, FISABIO, València, Spain

:
 Valencian Breast Cancer Screening Program, General Directorate of Public Health, Valencia, Spain

 Centro Superior de Investigación en Salud Pública CSISP, FISABIO, València, Spain

Llobet R:
 Instituto Tecnológico de la Informática, Universitat Politècnica de València,Camino de Vera, s/n, 46022 València, Spain
ISSN: 01692607





COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE
Editorial
Elsevier BV, ELSEVIER HOUSE, BROOKVALE PLAZA, EAST PARK SHANNON, CO, CLARE, 00000, IRELAND, Irlanda
Tipo de documento: Article
Volumen: 221 Número:
Páginas: 106885-106885
WOS Id: 000806506100005
ID de PubMed: 35594581
imagen hybrid, Green Published

FULL TEXT

imagen Published Version CC BY

MÉTRICAS