Preprocessing for classification of thermograms in breast cancer detection

Łukasz Neumann , Robert Marek Nowak , Rafał Okuniewski , Witold Oleszkiewicz , Paweł Cichosz , Dariusz Jagodziński , Mateusz Matysiewicz

Abstract

Performance of binary classification of breast cancer suffers from high imbalance between classes. In this article we present the preprocessing module designed to negate the discrepancy in training examples. Preprocessing module is based on standardization, Synthetic Minority Oversampling Technique and undersampling. We show how each algorithm influences classification accuracy. Results indicate that described module improves overall Area Under Curve up to 10% on the tested dataset. Furthermore we propose other methods of dealing with imbalanced datasets in breast cancer classification.
Author Łukasz Neumann (FEIT / ICS)
Łukasz Neumann,,
- The Institute of Computer Science
, Robert Marek Nowak (FEIT / PE)
Robert Marek Nowak,,
- The Institute of Electronic Systems
, Rafał Okuniewski
Rafał Okuniewski,,
-
, Witold Oleszkiewicz (FEIT / IN)
Witold Oleszkiewicz,,
- The Institute of Computer Science
, Paweł Cichosz (FEIT / PE)
Paweł Cichosz,,
- The Institute of Electronic Systems
, Dariusz Jagodziński (FEIT / PE)
Dariusz Jagodziński,,
- The Institute of Electronic Systems
, Mateusz Matysiewicz
Mateusz Matysiewicz,,
-
Pages100313A-1-100313A-8
Publication size in sheets0.5
Book Romaniuk Ryszard (eds.): Proc. SPIE. 10031, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2016, vol. 10031, 2016, P.O. Box 10, Bellingham, Washington 98227-0010 USA , SPIE , ISBN 9781510604858, [781510604865 (electronic) ], 1170 p., DOI:10.1117/12.2257157
DOIDOI:10.1117/12.2249307
URL http://dx.doi.org/10.1117/12.2249307
Languageen angielski
File
100313A_neumann.pdf 271.75 KB
Score (nominal)15
Score sourceconferenceIndex
ScoreMinisterial score = 15.0, 06-07-2020, BookChapterMatConfByConferenceseries
Ministerial score (2013-2016) = 15.0, 06-07-2020, BookChapterMatConfByConferenceseries
Publication indicators WoS Citations = 0; Scopus Citations = 4; GS Citations = 8.0
Citation count*8 (2020-08-28)
Cite
Share Share

Get link to the record


* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.
Back
Confirmation
Are you sure?