Introduction: The large amount of digital textual information available on the Internet makes the organization, analysis and extraction of knowledge essential both in the academic world and in the job market, making automatic text classification increasingly important. Question classification is a subgroup of text classification and basically consists of associating one or more labels with each question, according to a predetermined criterion, but with less text available than the general documents. The main applications of automatic question classification systems are: QA (Question/Answering), IR (Information Retrieval), educational environment, and specific languages processing. The QA and IR systems have as their starting point a question written in natural language and, from there, search a collection of documents in the web that are compatible with the subject described. Considering specifically the educational environment, the automatic generation of assessment tests has immediate practical application in e-learning systems by enabling the personalization of teaching through the search for questions that are appropriate to a particular learning profile, the so-called adpative learning systems. To enable personalization, it is essential to classify questions within a representative range of appropriate competencies and skills. Large-scale evaluations (ENEM, SAEB, Prova Brasil) could be a source of information for this generation, as they use evaluation reference matrices to classify questions according to the areas of knowledge, disciplines, competencies and expected skills of students. One way to perform this classification is through Machine Learning algorithms that are able to extract patterns or generalize classes by generating mathematical models from the available data. Examples of Machine Learning algorithms are: neural networks, decision trees, support vector machines (SVM), naive bayes, among others. The different forms of text representation and Machine Learning algorithms have extensive research done when it comes to classifying documents with large amounts of text; when it comes to short excerpts (such as questions), this task becomes more complex because the amount of text available for analysis is reduced when compared to other types of textual documents. In addition, the majority of current research addresses the problem of QA or IR, and there is not a lot of research available considering the educational environment. Objectives: (i) Identify the architecture of a classifier or set of classifiers in order to maximize the performance of the question classification process in the educational context; (ii) perform an empirical evaluation to compare the performance of the different combinations used; (iii) make available representations, algorithms, source codes and tools developed for the scientific community to evaluate and replicate results; and (iv) make available tools for integration and application of content developed for use by other platforms and institutions (schools, companies) interes...