References of "Gómez de Lope, Elisa 50040349"
     in
Bookmark and Share    
Full Text
Peer Reviewed
See detailGraph neural networks for investigating complex diseases: A case study on Parkinson's Disease
Gómez de Lope, Elisa UL; Viñas Torné, Ramón; Liò, Pietro et al

Poster (2023, July 25)

Omics data analysis is a critical component in the study of complex diseases, but the high dimension and heterogeneity of the data often pose challenges that are difficult to address by classical ... [more ▼]

Omics data analysis is a critical component in the study of complex diseases, but the high dimension and heterogeneity of the data often pose challenges that are difficult to address by classical statistical and machine learning methods. Recently, structured data analyses using graph neural networks (GNNs) have emerged as a promising complementary approach, particularly for investigating the relational information between samples. However, it is still unclear which strategies for designing and optimizing GNNs are most effective when working with real-world data from complex disorders, such as Parkinson's disease (PD). Our study addresses this gap by examining the application of various GNN models, including Graph Convolutional Network, ChebyNet, and Graph Attention Network, to identify and interpret discriminative patterns between PD patients and controls using omics data. The developed pipeline integrates Lasso penalty-based feature selection, similarity graph construction, and final modeling for sample classification. Through an end-to-end model building and evaluation process, we assess the practical utility of the pipeline on independent PD omics datasets. Overall, our analyses highlight some of the benefits and challenges of using graph structure data for machine learning analysis of disease-related omics data and provide directions for further research. [less ▲]

Detailed reference viewed: 134 (0 UL)
Full Text
Peer Reviewed
See detailMachine learning applied to higher order functional representations of omics data reveals biological pathways associated with Parkinson‘s Disease
Gómez de Lope, Elisa UL; Glaab, Enrico UL

Poster (2022, September 18)

Background: Despite the increasing prevalence of Parkinson’s Disease (PD) and research efforts to understand its underlying molecular pathogenesis, early diagnosis of PD remains a challenge. Machine ... [more ▼]

Background: Despite the increasing prevalence of Parkinson’s Disease (PD) and research efforts to understand its underlying molecular pathogenesis, early diagnosis of PD remains a challenge. Machine learning analysis of blood-based omics data is a promising non-invasive approach to finding molecular fingerprints associated with PD that may enable an early and accurate diagnosis. Description: We applied several machine learning classification methods to public omics data from PD case/control studies. We used aggregation statistics and Pathifier’s pathway deregulation scores to generate higher order functional representations of the data such as pathway-level features. The models’ performance and most relevant predictive features were compared with individual feature level predictors. The resulting diagnostic models from individual features and Pathifier’s pathway deregulation scores achieve significant Area Under the Curve (AUC, a receiver operating characteristic curve) scores for both cross-validation and external testing. Furthermore, we identify plausible biological pathways associated with PD diagnosis. Conclusions: We have successfully built machine learning models at pathway-level and single-feature level to study blood-based omics data for PD diagnosis. Plausible biological pathway associations were identified. Furthermore, we show that pathway deregulation scores can serve as robust and biologically interpretable predictors for PD. [less ▲]

Detailed reference viewed: 266 (11 UL)
Full Text
Peer Reviewed
See detailTen Quick Tips for Biomarker Discovery and Validation Analyses Using Machine Learning
Diaz-Uriarte, R.; Gómez de Lope, Elisa UL; Giugno, R. et al

in PLoS Computational Biology (2022), 18(8), 1010357

High-throughput experimental methods for biosample profiling and growing collections of clinical and health record data provide ample opportunities for biomarker discovery and medical decision support ... [more ▼]

High-throughput experimental methods for biosample profiling and growing collections of clinical and health record data provide ample opportunities for biomarker discovery and medical decision support. However, many of the new data types, including single-cell omics and high-resolution cellular imaging data, also pose particular challenges for data analysis. A high dimensionality of the data in relation to small numbers of available samples, influences of additive and multiplicative noise, large numbers of uninformative or redundant data features, outliers, confounding factors and imbalanced sample group numbers are all common characteristics of current biomedical data collections. While first successes have been achieved in developing clinical decision support tools using multifactorial omics data, there is still an unmet need and great potential for earlier, more accurate and robust diagnostic and prognostic tools for many complex diseases. Here, we provide a set of broadly applicable tips to address some of the most common pitfalls and limitations for biomarker signature development, including supervised and unsupervised machine learning, feature selection and hypothesis testing approaches. In contrast to previous guidelines discussing detailed aspects of quality control, statistics or study reporting, we give a broader overview of the typical challenges and sort the quick tips to address them chronologically by the study phase (starting with study design, then covering consecutive phases of biomarker signature discovery and validation, see also the overview in Fig. 1). While these tips are not comprehensive, they are chosen to cover what we consider as the most frequent, significant, and practically relevant issues and risks in biomarker development. By pointing the reader to further relevant literature on the covered aspects of biomarker discovery and validation, we hope to provide an initial guideline and entry point into the more detailed technical and application-specific aspects of this field. [less ▲]

Detailed reference viewed: 173 (13 UL)