2018
DOI: 10.1007/s13755-018-0051-3
|View full text |Cite
|
Sign up to set email alerts
|

The effects of varying class distribution on learner behavior for medicare fraud detection with imbalanced big data

Abstract: Healthcare in the United States is a critical aspect of most people's lives, particularly for the aging demographic. This rising elderly population continues to demand more cost-effective healthcare programs. Medicare is a vital program serving the needs of the elderly in the United States. The growing number of Medicare beneficiaries, along with the enormous volume of money in the healthcare industry, increases the appeal for, and risk of, fraud. In this paper, we focus on the detection of Medicare Part B pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
41
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 64 publications
(50 citation statements)
references
References 37 publications
0
41
0
Order By: Relevance
“…In this example, the majority group of healthy patients is referred to as the negative class. Learning from these imbalanced data sets can be very difficult, especially when working with big data [8,9], and non-standard machine learning methods are often required to achieve desirable results. A thorough understanding of the class imbalance problem and the methods available for addressing it is indispensible, as such skewed data exists in many real-world applications.…”
mentioning
confidence: 99%
“…In this example, the majority group of healthy patients is referred to as the negative class. Learning from these imbalanced data sets can be very difficult, especially when working with big data [8,9], and non-standard machine learning methods are often required to achieve desirable results. A thorough understanding of the class imbalance problem and the methods available for addressing it is indispensible, as such skewed data exists in many real-world applications.…”
mentioning
confidence: 99%
“…Therefore, developing well-balanced training data is an important step. There are two categories of methods that handle this class imbalance problem: Data-Level methods (e.g., data sampling), and Algorithm-Level methods (e.g., cost-sensitive and hybrid/ensemble) [ 56 , 57 , 58 ].…”
Section: Resultsmentioning
confidence: 99%
“…3 Using real-time data collection, the Office of the Inspector General can compare patient volume for similar professional claims to identify abnormally high reimbursement submissions, unnatural practice growth patterns, or unusually high numbers of procedures based on specialty and practice size or to flag suspect patient visits patterns (such as an excessive number of patients during a 24-hour window.) 22,23 This artificial intelligence-based system for identifying potential program integrity anomalies is relatively new. But CMS is also directed to cases by whistleblowers, who are incentivized to report fraud under the False Claims Act and Stark Law (ie, prohibition on self-referral), which entitle them to receive a percentage of any government recoveries.…”
Section: Solutions To Mitigate Fraud and Abusementioning
confidence: 99%