“…To summarise, the majority of studies have evaluated the performance of their model using microaveraged F1-score, followed by macro-averaged F1-score, and standard F1-score. [65] Flat SVM, Hierarchy-based SVM -Marafino et al [55] SVM -Subotin and Davis [86] Two-level hierarchical classification -Kavuluru et al [43] SVM, LR, MNB BR, copy transformation, ECC Ayyar and Oliver [4] LSTM -Prakash et al [67] C-MemNN and A-MemNN End-to-End Memory Network, KV-MemNNs Lin et al [52] CNN SVM, RF, GBM Berndorfer and Henriksson [8] Flat SVM and Hierarchical SVM -Amoia et al [3] LR and CNN -Catling et al [13] RNN-GRU -Baumel et al [5] SVM, CBOW, CNN, HA-GRU -Mullenbach et al [61] CAML, DR-CAML CNN, LR, Bi-GRU, Flat SVM [65], HA-GRU [5], C-MemNN [67], [80],…”