Reciprocal Perspective: A Cascaded Semi-Supervised Machine Learning Framework to Improve Pairwise Classification & Regression
Public Deposited- Resource Type
- Creator
- Contributors
- James Green (Supervisor)
- Abstract
Many real-world problems can be represented as a network, with nodes representing elements and the edges (i.e. links; or lack thereof) capturing the relationship between elements. An example domain that leverages link prediction algorithms to elucidate relationships between pairs of nodes is the task of protein-protein interaction (PPI) prediction. Leveraging high-performance computing and optimized PPI predictors, it is recently possible to evaluate every possible combination of paired nodes enabling the generation of a comprehensive prediction matrix (CPM). We introduce a novel semi-supervised machine learning method, denoted Reciprocal Perspective (RP), which leverages this new wealth of information by extracting context-based features from this CPM by considering reciprocal views of pairwise elements for use in a cascaded classifier which has demonstrated significant improvement in predictive performance. Historically, this achievable wealth of information has been ignored due to computational intractability. We demonstrate that expending compute resources to generate CPMs is a worthy investment given the improvement in predictive performance in both classification- and regression-type tasks. This thesis makes contributions at all stages of a prototypical prediction pipeline. We demonstrate that RP is applicable to a variety of application domains within bioinformatics (PPI, microRNA-target, and drug-target interaction prediction) as well as within Network Science with Recommendation Systems. Furthermore, RP is demonstrated to improve individual model performance as well as function as an ensemble method to combine multiple experts. Taken together, these contributions demonstrate that RP can be broadly applied for pairwise prediction problems across different domains, problem formulations, and varying scales of data.
- Subject
- Language
- Publisher
- Thesis Degree Level
- Thesis Degree Name
- Thesis Degree Discipline
- Identifier
- Rights Notes
- Copyright © 2022 the author(s). Theses may be used for non-commercial research, educational, or related academic purposes only. Such uses include personal study, research, scholarship, and teaching. Theses may only be shared by linking to Carleton University Institutional Repository and no part may be used without proper attribution to the author. No part may be used for commercial purposes directly or indirectly via a for-profit platform; no adaptation or derivative works are permitted without consent from the copyright owner.
- Date Created
- 2022
Relations
- In Collection:
Items
Thumbnail | Title | Date Uploaded | Visibility | Actions |
---|---|---|---|---|
|
dick-reciprocalperspectiveacascadedsemisupervised_r.pdf | 2023-05-05 | Public | Download |