Please use this identifier to cite or link to this item:
http://hdl.handle.net/2067/48596
Title: | ParSMURF-NG: A Machine Learning High Performance Computing System for the Analysis of Imbalanced Big Omics Data | Authors: | Petrini, Alessandro Notaro, Marco Gliozzo, Jessica Castrignanò, Tiziana Robinson, Peter N. Casiraghi, Elena Valentini, Giorgio |
Issue Date: | 2022 | Abstract: | In the context of Genomic and Precision Medicine, prediction problems are often characterized by a high imbalance between classes and Big Data. This requires specialized tools, as traditional Machine Learning approaches may struggle with big datasets and often fail to predict the minority class with unbalanced classification problems. In this work we present ParSMURF-NG, a High Performance Computing-oriented Machine Learning approach designed to scale well on big omics data. We measured its performance capabilities on three current-generation HPC systems and we showed its usefulness in the context of Genomic Medicine, providing a powerful model for the detection of pathogenic single nucleotide variants in the non-coding regions of the human genome. |
URI: | http://hdl.handle.net/2067/48596 | ISBN: | 9783031083402 | DOI: | 10.1007/978-3-031-08341-9_34 |
Appears in Collections: | D1. Contributo in Atti di convegno |
Files in This Item:
File | Description | Size | Format | Existing users please |
---|---|---|---|---|
18th_AIAI_2022_CAMERA_READY (2).pdf | 343.98 kB | Adobe PDF | Request a copy |
Page view(s)
37
Last Week
0
0
Last month
0
0
checked on Jun 7, 2023
Download(s)
3
checked on Jun 7, 2023
Google ScholarTM
Check
Altmetric
All documents in the "Unitus Open Access" community are published as open access.
All documents in the community "Prodotti della Ricerca" are restricted access unless otherwise indicated for specific documents