“…TC algorithms require that text features are formatted before they can be interpreted by the specified classifier, this process is also referred to as term weighting because each term is entered together with a weight value. Included papers show the most used technique is the Term Frequency-Inverse Document Frequency (TF-IDF) as in [27,32,37,40,43,45,48,51,53,55,57,58,[60][61][62]67]. It is a statistical method to indicate the significance of a word within a given corpus.…”