2021
DOI: 10.1007/978-3-030-65351-4_7
|View full text |Cite
|
Sign up to set email alerts
|

Experimental Evaluation of Train and Test Split Strategies in Link Prediction

Abstract: In link prediction, the goal is to predict which links will appear in the future of an evolving network. To estimate the performance of these models in a supervised machine learning model, disjoint and independent train and test sets are needed. However, objects in a realworld network are inherently related to each other. Therefore, it is far from trivial to separate candidate links into these disjoint sets.Here we characterize and empirically investigate the two dominant approaches from the literature for cre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…Many different classification algorithms are known to work well in link prediction. Commonly used classifiers include logistic regression (Potgieter et al 2007;O'Madadhain et al 2005), support vector machines (Al Hasan et al 2006;Öczan A, Öğüdücü 2017), k-nearest neighbours (Al Hasan et al 2006;Bütün et al 2018Bütün et al , 2016, and random forests (Öczan A, Öğüdücü 2017;Bütün et al 2016Bütün et al , 2018Ghasemian et al 2020;de Bruin et al 2021de Bruin et al , 2020. We report performances using the logistic regression classifier.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Many different classification algorithms are known to work well in link prediction. Commonly used classifiers include logistic regression (Potgieter et al 2007;O'Madadhain et al 2005), support vector machines (Al Hasan et al 2006;Öczan A, Öğüdücü 2017), k-nearest neighbours (Al Hasan et al 2006;Bütün et al 2018Bütün et al , 2016, and random forests (Öczan A, Öğüdücü 2017;Bütün et al 2016Bütün et al , 2018Ghasemian et al 2020;de Bruin et al 2021de Bruin et al , 2020. We report performances using the logistic regression classifier.…”
Section: Related Workmentioning
confidence: 99%
“…A common approach in temporal link prediction is to employ a supervised machine learning model that utilises multiple features to classify which links are missing or, in case of temporal link prediction, will appear in the future (de Bruin et al 2021). Features are typically computed for every pair of nodes that is not (yet) connected, based on the topology of the network (Kumar et al 2020).…”
Section: Introduction and Problem Statementmentioning
confidence: 99%
“…These models struggle with over-fitting in link prediction because of the abundance of features. In other words, the designed machine learning model performs well on educational data but less well on experimental data [6]. In homogeneous networks, there is also the issue of an unbalanced data set.…”
Section: Introductionmentioning
confidence: 96%