Use of responsible artificial intelligence to predict health insurance claims in the USA using machine learning algorithms

Ashrafe Alam; Victor R. Prybutok

doi:10.37349/edht.2024.00009

Open Access

Original Article

Use of responsible artificial intelligence to predict health insurance claims in the USA using machine learning algorithms

Affiliation:

¹Department of Information Science, University of North Texas, Denton, TX 76207, USA

Email: AshrafeAlam@my.unt.edu

ORCID: https://orcid.org/0009-0009-5182-5006

Ashrafe Alam ^1*

Affiliation:

²Department of Information Technology and Decision Science, G. Brint Ryan College of Business, University of North Texas, Denton, TX 76201, USA

ORCID: https://orcid.org/0000-0003-3810-9039

Victor R. Prybutok ²

Explor Digit Health Technol. 2024;2:30–45 DOI: https://doi.org/10.37349/edht.2024.00009

Received: September 20, 2023 Accepted: November 28, 2023 Published: February 28, 2024

Academic Editor: Atanas G. Atanasov, Medical University of Vienna, Austria; Shariful Islam, Deakin University, Australia

The article belongs to the special issue Data-informed Decision Making in Healthcare

Abstract

Aim: This study investigates the potential of artificial intelligence (AI) in revolutionizing healthcare insurance claim processing in the USA. It aims to determine the most effective machine learning (ML) model for predicting health insurance claims, leading to cost savings for insurance companies.

Methods: Six ML algorithms were used to predict health insurance claims, and their performance was evaluated using various metrics. The algorithms examined include support vector machine (SVM), decision tree (DT), random forest (RF), linear regression (LR), extreme gradient boosting (XGBoost), and k-nearest neighbors (KNN). The research involves a performance assessment that encompasses key metrics. Additionally, a feature importance analysis is conducted to illuminate the critical variables that exert influence on the prediction of insurance claims.

Results: The findings demonstrate that the XGBoost and RF models outperformed the other algorithms, displaying the highest R-squared values of 79% and 77% and the lowest prediction errors. The feature importance analysis underscores the pivotal role of variables such as smoking habits, body mass index (BMI), and blood pressure levels in the domain of insurance claim prediction. These results emphasize the degree to which these variables should be included in the formulation of insurance policies and pricing strategies.

Conclusions: This study supports the transformative potential of AI, with specific emphasis on the XGBoost model, in extending the precision and efficiency of healthcare insurance claim processing. The identification of key variables and the mitigation of prediction errors not only signal the potential for substantial cost savings but also affirm the potential to integrate AI into healthcare insurance processes. This research supports the value of the utilization of AI as an emerging tool for process optimization and data-informed decision-making within the healthcare insurance domain.

Keywords

Insurance claim, responsible artificial intelligence, machine learning

Introduction

A sizeable amount of healthcare spending is allocated to processing insurance claims, making the healthcare sector in the USA one of the world’s largest and most complex enterprises. National health expenditure (NHE) grew 4.6% to $3.6 trillion in 2018, accounting for 17.7% of the gross domestic product (GDP) [1]. Medicare spending grew 6.4% to $750.2 billion, Medicaid expenditure grew 3.0% to $597.4 billion, private health insurance spending grew 5.8% to $1.243 trillion, and out-of-pocket spending grew 2.8% to $375.6 billion (these are 21%, 16%, 34%, and 10% of total NHE) in 2018 [1]. According to the 2021 national health statistics report, 28.1 million individuals, or 8.6% of the total population of the USA, lack health insurance coverage, 65.4% have private health insurance, 56.9% are employer-based, and 7.2% have directly purchased coverage for the population under age 65 [2]. Among this age group, nearly two of every five children and one of every five adults relied on public health coverage through Medicaid and children’s health insurance programs. These estimates are characterized based on specific sociodemographic attributes, such as age, gender, race, Hispanic origin, family income, education, employment situation, and marital status [2]. A study explored that healthcare access is a fundamental right for every American citizen [3]. Unfortunately, the USA is confronted with a substantial and expanding segment of its population lacking insurance coverage. After hitting a low point of 28.7 million individuals in 2016, the projected trajectory indicates a rise in the number of uninsured individuals to 37.2 million by 2028 [3]. This concerning trend coincides with a period in which an increasing wealth of research establishes a strong connection between having insurance coverage and notable enhancements in financial stability, overall well-being, and lifespan [3].

Health insurance claims processing is crucial in this rapidly changing environment, as it holds the potential to revolutionize the healthcare landscape by providing prompt and accurate reimbursements and optimizing insurance provider cost management and risk assessment. The processing of health insurance claims entails looking over and verifying medical records, billing data, and payment authorization. The insurance market in the USA provides a vital safety net for both individuals and corporations. Insurance companies employ data and predictive modeling to measure risk and establish premiums. However, the insurance sector’s adoption of responsible artificial intelligence (AI) ensures that forecasting models are impartial, fair, and open. In the insurance sector, predicting insurance claims is essential because it aids insurers in calculating the likelihood of an event occurring and the potential cost. The employment of AI in this process has prompted questions about bias and discrimination, particularly against specific demographic groups. Developing and deploying prediction models can be guided by responsible AI principles to alleviate all the concerns. This includes ensuring the model is transparent, understandable, constantly monitored, and updated to prevent bias and discrimination.

This study explores how the potential application of responsible AI affects claims processing accuracy and identifies the most useful model for forecasting a customer’s insurance claim process. Various methods have been explored to address concerns about AI algorithm interpretability [4]. Recent studies have created metrics for the relevance of specific covariates based on their contribution to model prediction accuracy [4]. Furthermore, it investigates the variables that can be investigated to forecast insurance claims with greater precision and address any potential ethical issues by using predictive models for processing insurance claims. To prevent prejudice and ensure that clients receive the coverage they require, it is crucial to ensure that AI systems used in insurance are developed and implemented transparently, equitably, and ethically.

The use of AI in the insurance sector has raised moral questions regarding transparency, fairness, and bias, particularly in the prediction of insurance claims. Consequently, there is an increasing focus on employing responsible AI within this sector. Ensuring the responsible development and implementation of AI systems involves transparent and ethical consideration of their potential impacts on individuals and society. This policy must be followed to protect consumers from unfair treatment and prevent discrimination against specific categories of individuals. In the insurance sector, predicting insurance claims is essential because it aids insurers in calculating the likelihood of an event occurring and the potential cost. The use of AI in this process has prompted questions about bias and discrimination, particularly against specific demographic groups. Responsible AI principles can direct the creation and use of prediction models to allay all worries. The data used to train the model must also be varied and representative of the community.

A crucial area of concentration for the insurance business is the use of responsible AI in the forecasting of insurance claims in the USA. Moreover, the use of responsible AI for forecasting insurance claims can add to the body of knowledge regarding AI ethics and governance. AI technologies must be used responsibly, ethically, and within legal and regulatory frameworks as they become more common and significant. This study seeks to answer the following questions:

(1)
What model can be used to predict an efficient insurance claim process for a customer?
(2)
Which variable could be analyzed to anticipate insurance claims more accurately?
(3)
What potential ethical considerations are associated with using such predictive models?

To identify best practices and prospective areas for development, this study also analyzes the existing literature on AI and insurance claim processing. In recent years, several businesses have started using AI to automate jobs that people are typically hired to complete, such as detecting fraudulent movements, choosing resumes, processing credit-related requests, and releasing those people for high-level duties [5]. According to a survey of 2,360 business executives about the use of AI, more than 62% of executives said that AI solutions increased their revenues, decreased their costs, and enhanced customer satisfaction [6]. AI has proven its value in different business sectors by quickly establishing automated environments that are controlled and digitally upgraded for maximum efficiency in 2017 [7]. The use of AI in Tanzania’s healthcare sector includes applications for disease prediction and diagnosis, vaccine stock optimization, and health supply chain management [8]. AI can assist in the methods mentioned above to improve customer satisfaction and revenues and cut down on fraud, inefficient time use, and operational complexity [9]. One of the important areas across Europe where AI can address many issues with the health system is the health sector [10]. Several problems prevent medical AI from being used properly and effectively. These confrontations include data privacy, intellectual property rights, accountability, openness, cybersecurity, accuracy, performance, bias, and discrimination. Thus, it is advised that strategic decisions should be made when putting AI-based innovations into practice for companies to make sure that i) they are responsible, ii) the challenges are properly addressed, and iii) there is a balance between opposing interests and values [10, 11]. Besides the hazards, the growing application of AI exacerbates intrinsic problems with trust and accountability. Enterprises must be aware of the difficulties and dangers associated with AI and take these into full consideration when presenting suggested plans to effectively address these issues [11].

Adaptive boosting (AdaBoost), a whitebox algorithm, outperforms all additional models in terms of performance, aids in lowering operating costs for providers, improves the speed and accuracy of the insurance claim process and allows patients to concentrate on their recovery rather than navigating the insurance claim appeals process [12]. Because of the unstable nature of intelligent applications, the theoretical paradigm for the growth of responsible AI was founded on perceived risk theory. Digital healthcare AI risks are inversely correlated with responsible AI [13]. There should be a complete framework for responsible AI that businesses can use to emphasize and address important issues when developing and implementing responsible AI applications. Governance provides an ongoing foundation for all other aspects and assists businesses in creating AI that complies with the relevant regulations and upholds ethical standards [14]. The operation of the Apriori algorithm consists of two steps: First, obtain frequent item sets of the largest possible size, and then, use these frequent item sets to generate rules by locating all their subsets [14]. Both supervised and unsupervised machine learning (ML) algorithms were used, including support vector machine (SVM), logistic regression, naive Bayes, random forest (RF) classifiers, deep neural networks, and AdaBoost. They examined the efficacy of every algorithm for detecting blockchain fraud and discovered that RF, AdaBoost, and SVM generated effective outcomes [15].

The aftereffects of the different processes reduced the number of claims filed as a text message was sent to the insured individual that full or a portion of a claim may be denied. Furthermore, it might impose conditions on the claimant as the decision is being made, which could prolong the process [16]. The paper’s primary goal is to use the Apriori algorithm to find similarities between medical bills and purchasing bills. This method searches a database of frequently occurring item sets to identify item sets whose occurrences exceed a specified threshold [17]. The main objective while building an AI solution is to identify the easiest model that performs the best. Using ML to solve problems in trading and investment management has made things more useful and opened new options for the economy. This is possible because computers are getting faster, data storage costs are going down, and big data is ready to use [18, 19]. A supervised learning rule first completes a foundational task using sample data and then attempts to build a temporary performance, leading to the plotting of new input vectors. In several application areas, supervised learning algorithms are used. A comparable goal is for the supervised learning rule to cut back superbly from the knowledge to the contained objects in the best possible setting, helping the rule to appropriately index the class labels for near occurrences [20–22]. It is necessary to continuously test various AI algorithms because the performance of an AI model varies with the core data structures. AI systems’ accuracy, simplicity, and inter-portability can all be traded off. To choose the finest-performing AI algorithm with the least amount of complexity and the greatest degree of interpretability, it is crucial to investigate several AI algorithms. The intended study uses six AI algorithms: two interpretable, two whiteboxes, and two blackboxes [23].

Materials and methods

Necessary data were gathered to construct ML models based on USA health insurance claims by people aged 18 to 60 in four different regions [24]. The data covers information about insurance claims, including age, gender, body mass index (BMI), blood pressure, diabetic status, number of children, smoking status, and region of the insured person. It must be cleaned up and sorted before the data is used to build ML models. Data cleaning eliminates erroneous data by extracting input and output features that contribute to effectively fitting the best model. A description of the features is given in Table 1. Data preparation is the vital process of refining and adapting data to make it well-suited for ML algorithms. The quality of this process significantly influences the model’s performance. It encompasses tasks such as data cleansing, exploratory data analysis (EDA), standardization, and reducing dimensionality. Data cleaning is the key step of the following retrieval, which involves identifying and eliminating erroneous, misleading, incomplete, and corrupt data. By using a mean substitution, a dummy value that treats the missing feature values as missing values can take their place.

Table 1. Description of health insurance database

Number	Variable name	Explanation
1.	Age	Age of primary beneficiary
2.	Gender	Gender of the beneficiary
3.	BMI	BMI of the beneficiary (kg/m²) using the ratio of height to weight, ideally 18.5 kg/m² to 25 kg/m²
4.	Blood pressure	Whether the insured person has blood pressure (mmHg) or not
5.	Diabetic	Whether the insured person is diabetic or not
6.	Children	Number of children of the insured person
7.	Smoker	Whether the insured person is a smoker or not
8.	Region	The residential areas of the beneficiary in the USA are Northeast USA, Southeast USA, Southwest USA, and Northwest USA
9.	Claim	Amount of the insurance claim

Variable	Insurance claim (%)
Gender
Male	50.3
Female	49.7
Diabetic
Yes	47.8
No	52.2
Smoker
Yes	20.6
No	79.4

Region	Insurance claim (%)
Southeast USA	33.2
Northeast USA	17.3
Southwest USA	23.6
Northwest USA	25.9

Statistics	Age (years)	BMI	Blood pressure (mmHg)	Children	Claim ($)
Minimum	18	16.00	80	0	1,121.87
Maximum	60	53.10	140	5	63,770.43
Mean	38.09	30.66	94	1	13,325.25
Standard deviation	11.11	6.12	11	1	12,109.62

Performance measures	Algorithm
Performance measures	SVM	DT	RF	LR	XGBoost	KNN
R-square	0.10	0.57	0.77	0.68	0.79	0.31
Adjusted R-square	0.09	0.57	0.77	0.68	0.78	0.30
Mean square error (MSE)	149,194,459.22	58,612,767.52	30,736,317.10	43,614,936.74	29,099,812.49	94,609,424.94
Root MSE (RMSE)	12,214.52	7,595.89	5,544.03	6,604.16	5,394.42	9,726.74
Mean absolute error (MAE)	8,188.16	5,170.15	4,066.94	5,072.64	3,870.03	6,769.37
Mean absolute percentage error (MAPE)	1.01	0.72	0.66	0.71	0.63	0.98

Abstract

Keywords

Introduction

Materials and methods

Results

Discussion

Abbreviations

Declarations

Author contributions

Conflicts of interest

Ethical approval

Consent to participate

Consent to publication

Availability of data and materials

Funding

Copyright

References

Data science techniques to gain novel insights into quality of care: a scoping review of long-term care for older adults

Do you need a blockchain in healthcare data sharing? A tertiary review

Digital twin technology training and research in health higher education: a review

Developing a multi-variate prediction model for COVID-19 from crowd-sourced respiratory voice data

Quantifying and mapping population response to the COVID-19 pandemic in different countries for the period 2020–2022