Please use this identifier to cite or link to this item:
Title: ParSMURF-NG: A Machine Learning High Performance Computing System for the Analysis of Imbalanced Big Omics Data
Authors: Petrini, Alessandro
Notaro, Marco
Gliozzo, Jessica
Castrignanò, Tiziana 
Robinson, Peter N.
Casiraghi, Elena
Valentini, Giorgio
Issue Date: 2022
In the context of Genomic and Precision Medicine, prediction problems are often characterized by a high imbalance between classes and Big Data. This requires specialized tools, as traditional Machine Learning approaches may struggle with big datasets and often fail to predict the minority class with unbalanced classification problems. In this work we present ParSMURF-NG, a High Performance Computing-oriented Machine Learning approach designed to scale well on big omics data. We measured its performance capabilities on three current-generation HPC systems and we showed its usefulness in the context of Genomic Medicine, providing a powerful model for the detection of pathogenic single nucleotide variants in the non-coding regions of the human genome.
ISBN: 9783031083402
DOI: 10.1007/978-3-031-08341-9_34
Appears in Collections:D1. Contributo in Atti di convegno

Files in This Item:
File Description SizeFormat Existing users please
18th_AIAI_2022_CAMERA_READY (2).pdf343.98 kBAdobe PDF    Request a copy
Show full item record

Page view(s)

Last Week
Last month
checked on Jun 7, 2023


checked on Jun 7, 2023

Google ScholarTM



All documents in the "Unitus Open Access" community are published as open access.
All documents in the community "Prodotti della Ricerca" are restricted access unless otherwise indicated for specific documents