Patent classification is a task performed in patent offices around the world by experts, where they assign category codes to a patent application based on its technical content. Nowadays, the number of applications is constantly growing and there is an economical interest on developing accurate and fast models to automate the classification task. In this paper, we present a methodology to systematically analyze the effect of three patent data properties and two classification details on the patent classification task: patent section to use for training/testing, document representation, patent codes to use for training, use of the hierarchy of categories, and the base classifier. For the analysis we create a diversity of models by combining different options for the properties. We evaluate the models in detail using standard patent datasets in two languages, English and German, considering three performance metrics, using statistical tests to validate the results and comparing them with other models in the literature. Our research findings indicate that it is important to follow a methodology to properly choose the options for the data properties to build a model according to our goal, considering classification accuracy and computational efficiency. Some combinations of options build models with good results but with high computational cost, whilst other build model that produce slightly worst results but at a fraction of the training time.