Abstract:Author identification is the process of recognizing an author based on a sample of text. Feature selection is the process of selecting the most salient features required for recognition. In many cases, this results in an increase in recognition accuracy. In this paper, we apply Genetic and Evolutionary Feature Selection with Machine Learning (GEFeS ML ) to author identification. We then introduce Genetic Heuristic Development (GHD), a process to improve the matching process. GHD uses subsets of features found … Show more
“…The validation set was used for cross-validation in an effort to reduce overfitting [16]. GEFeS was an instance of a Steady-State Genetic Algorithm implemented in X-TOOLSS [15], [19]. GEFeS evolved a population of 20 FMs.…”
Section: A Results Of Experiments I: English To Englishmentioning
“…The validation set was used for cross-validation in an effort to reduce overfitting [16]. GEFeS was an instance of a Steady-State Genetic Algorithm implemented in X-TOOLSS [15], [19]. GEFeS evolved a population of 20 FMs.…”
Section: A Results Of Experiments I: English To Englishmentioning
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.