Detecting reliable non interacting proteins (NIPs) significantly enhancing the computational prediction of protein-protein interactions using machine learning methods

Atul K. Srivastava , G. Mazzocco , A. Kel , Lucjan S Wyrwicz , Dariusz Plewczyński


Protein-protein interactions (PPIs) play a vital role in most biological processes. Hence their comprehension can promote a better understanding of the mechanisms underlying living systems. However, besides the cost and the time limitation involved in the detection of experimentally validated PPIs, the noise in the data is still an important issue to overcome. In the last decade several in silico PPI prediction methods using both structural and genomic information were developed for this purpose. Here we introduce a unique validation approach aimed to collect reliable non interacting proteins (NIPs). Thereafter the most relevant protein/protein-pair related features were selected. Finally, the prepared dataset was used for PPI classification, leveraging the prediction capabilities of well-established machine learning methods. Our best classification procedure displayed specificity and sensitivity values of 96.33% and 98.02%, respectively, surpassing the prediction capabilities of other methods, including those trained on gold standard datasets. We showed that the PPI/NIP predictive performances can be considerably improved by focusing on data preparation.

Author Atul K. Srivastava - [Maria Sklodowska-Curie Institute – Oncology Center]
Atul K. Srivastava,,
, G. Mazzocco - [University of Warsaw, Centre of New Technologies]
G. Mazzocco,,
, A. Kel - [GeneXplain GmbH]
A. Kel,,
, Lucjan S Wyrwicz - [Maria Sklodowska-Curie Institute – Oncology Center]
Lucjan S Wyrwicz,,
, Dariusz Plewczyński (FMIS / DIPS)
Dariusz Plewczyński,,
- Department of Information Processing Systems
Journal seriesMolecular BioSystems, [Molecular BioSystems], ISSN 1742-206X, e-ISSN 1742-2051
Issue year2016
ASJC Classification1312 Molecular Biology; 1305 Biotechnology
Languageen angielski
Score (nominal)30
Score sourcejournalList
ScoreMinisterial score = 25.0, 04-06-2020, ArticleFromJournal
Ministerial score (2013-2016) = 30.0, 04-06-2020, ArticleFromJournal
Publication indicators Scopus Citations = 2; WoS Citations = 1; Scopus SNIP (Source Normalised Impact per Paper): 2016 = 0.753; WoS Impact Factor: 2016 = 2.781 (2) - 2016=2.823 (5)
Citation count*
Share Share

Get link to the record

* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.
Are you sure?