Overview
In this study, a breast-sentence dataset is proposed to investigate its usefulness in computer-aided diagnosis. Based on the conventional breast mammography datasets (DDSM) [2], we annotated sentences in the natural language according to the standardized terms (defined in Breast Imaging-Reporting and Data System) in conventional breast mammography datasets. This web page provides the datasets of Paper [1]. All images in this web page are from [2]. This dataset provides only caseID-sentence pairs. If you use our datasets, please cite the paper [1] and [2]. This dataset is acquired for academic research only, and any use of the dataset for commercial applications is prohibited.
If you use the database, please cite as :
[1] Hyebin Lee, Seong Tae Kim, and Yong Man Ro (2019). Building a Breast-Sentence Dataset: Its Usefulness for Computer-Aided Diagnosis. ICCV 2019 Workshop on Visual Recognition for Medical Images (ICCV-VRMI 2019).
[2] The Digital Database for Screening Mammography, Michael Heath, Kevin Bowyer, Daniel Kopans, Richard Moore and W. Philip Kegelmeyer, in Proceedings of the Fifth International Workshop on Digital Mammography, M.J. Yaffe, ed., 212-218, Medical Physics Publishing, 2001. ISBN 1-930524-00-5.