2019
DOI: 10.1186/s40537-019-0181-8
|View full text |Cite
|
Sign up to set email alerts
|

The effects of class rarity on the evaluation of supervised healthcare fraud detection models

Abstract: The United States healthcare system produces an enormous volume of data with a vast number of financial transactions generated by physicians administering healthcare services. This makes healthcare fraud difficult to detect, especially when there are considerably less fraudulent transactions (documented and readily available) than non-fraudulent. The ability to successfully detect fraudulent activities in healthcare, given such discrepancies, can garner up to $350 billion in recovered monetary losses. In machi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 32 publications
(17 citation statements)
references
References 46 publications
0
16
0
Order By: Relevance
“…We feel using these parameters in addition to SMOTE or ADASYN would be redundant, since these algorithms balance the data before it is presented to a classifier. For further studies on application of techniques for addressing classs imbalance see [ 65 ] and [ 6 ].…”
Section: Catboost Applications By Fieldmentioning
confidence: 99%
“…We feel using these parameters in addition to SMOTE or ADASYN would be redundant, since these algorithms balance the data before it is presented to a classifier. For further studies on application of techniques for addressing classs imbalance see [ 65 ] and [ 6 ].…”
Section: Catboost Applications By Fieldmentioning
confidence: 99%
“…These extreme imbalance and rarity conditions provide a problem statement as to whether RUS treatments can improve classification performance. In addition to these severe class imbalances, the XSS and SQL Injection web attacks exhibit rarity [53] with a low PCC as indicated in Table 2.…”
Section: Sampling Techniquesmentioning
confidence: 99%
“…We feel using these parameters in addition to SMOTE or ADASYN would be redundant, since these algorithms balance the data before it is presented to a classifier. For further studies on application of techniques for addressing classs imbalance see [48] and [56]. Yang and Beth report that the data they use in their study is imbalanced with 142 out of 9,666 records in the positive class for one dataset they use, and 109 out of 8,445 records in the positive class for another dataset.…”
Section: Medicinementioning
confidence: 99%
“…We feel using these parameters in addition to SMOTE or ADASYN would be redundant, since these algorithms balance the data before it is presented to a classifier. For further studies on application of techniques for addressing classs imbalance see [48] and [56].…”
Section: Financementioning
confidence: 99%