2018
DOI: 10.21105/joss.00501
|View full text |Cite
|
Sign up to set email alerts
|

Category Encoders: a scikit-learn-contrib package of transformers for encoding categorical data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 28 publications
(13 citation statements)
references
References 2 publications
0
12
0
1
Order By: Relevance
“…Some common categorical encoding methods are listed in Table 1 . For a more extensive list of encoding techniques, one can refer to the work of other researchers [ 89 , 90 , 91 ]. It has to be emphasised that the cardinality of variables has to be taken into account, since most of them are not suitable for high cardinality due to the so-called curse of dimensionality.…”
Section: ML Methods For Financial Fraud Data Classificationmentioning
confidence: 99%
“…Some common categorical encoding methods are listed in Table 1 . For a more extensive list of encoding techniques, one can refer to the work of other researchers [ 89 , 90 , 91 ]. It has to be emphasised that the cardinality of variables has to be taken into account, since most of them are not suitable for high cardinality due to the so-called curse of dimensionality.…”
Section: ML Methods For Financial Fraud Data Classificationmentioning
confidence: 99%
“…The sum of the latent representations of their features represents each user, which enables us to generalize to new users and new source-destinations. For hybrid matrix factorization, we have a set of features encoding customer information using one-hot encoding [ 51 ] to denote the various customer features, such as gender, age range, nationality, and airline membership tier. The next set of features identifies the source-destination pair.…”
Section: Methodsmentioning
confidence: 99%
“…This research is mainly completed with the help of Scikit-leam (sklearn) toolkit based on Python language [17]. The toolkit contains a variety of commonly used machine learning methods, and regression, clustering and preprocessing modules.…”
Section: Construction Of Academic Performance Prediction Model Of Online Education Based On Rf Algorithmmentioning
confidence: 99%