Prediction Model for COVID-19 Patient Visits in the Ambulatory Setting

Research Article

J Fam Med. 2022; 9(1): 1283.

Prediction Model for COVID-19 Patient Visits in the Ambulatory Setting

Li RC1*, Harrison CK4, Jurkovitz CT1, Ndura K1, Papas MA1, Kerzner RK2, Teal C3 and Chiam T5

1Institute for Research on Equity and Community Health (iREACH), ChristianaCare Health Services, USA

2Medical Group, ChristianaCare Health Services, USA

3Family and Community Medicine, ChristianaCare Health Services, USA

4Connecticut Children’s and UConn School of Medicine, USA

5Infection Prevention & Control, Children’s Hospital of Philadelphia, USA

*Corresponding author: Li RC, Institute for Research on Equity and Community Health (iREACH), ChristianaCare Health Services, Avenue North, 4000 Nexus Drive, Wilmington, DE 19803, USA

Received: October 29, 2021; Accepted: December 29, 2021; Published: January 05, 2022

Abstract

Objective: Healthcare systems globally were shocked by coronavirus disease 2019 (COVID-19). Policies put in place to curb the tide of the pandemic resulted in a decrease of patient volumes throughout the ambulatory system. The future implications of COVID-19 in healthcare are still unknown, specifically the continued impact on the ambulatory landscape. The primary objective of this study is to accurately forecast the number of COVID-19 and non-COVID-19 weekly visits in primary care practices.

Materials and Methods: This retrospective study was conducted in a single health system in Delaware. All patients’ records were abstracted from our electronic health records system (EHR) from January 1, 2019 to July 25, 2020. Patient demographics and comorbidities were compared using t-tests, Chi square, and Mann Whitney U analyses as appropriate. ARIMA time series models were developed to provide an 8-week future forecast for two ambulatory practices (AmbP) and compare it to a naïve moving average approach.

Results: Among the 271,530 patients considered during this study period, 4,195 patients (1.5%) were identified as COVID-19 patients. The best fitting ARIMA models for the two AmbP are as follows: AmbP1 COVID-19+ ARIMAX(4,0,1), AmbP1 nonCOVID-19 ARIMA(2,0,1), AmbP2 COVID-19+ ARIMAX(1,1,1), and AmbP2 nonCOVID-19 ARIMA(1,0,0).

Discussion and Conclusion: Accurately predicting future patient volumes in the ambulatory setting is essential for resource planning and developing safety guidelines. Our findings show that a time series model that accounts for the number of positive COVID-19 patients delivers better performance than a moving average approach for predicting weekly ambulatory patient volumes in a short-term period.

Keywords: Ambulatory; COVID-19; ARIMA; Time series analysis; Family medicine

Introduction

Healthcare systems were globally shocked by a novel coronavirus, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and the resulting disease, coronavirus disease 2019 (COVID-19) [1-3]. On March 11, 2020, COVID-19 was declared a global pandemic by the World Health Organization (WHO) [4]. By mid-March, transmission of COVID-19 had rapidly accelerated, increasing case counts throughout the United States, and it was found that many patients with severe disease also had common comorbidities such as hypertension, obesity and diabetes [5,6]. In the state of Delaware, the first presumptive positive case of COVID-19 was reported by the Delaware Division of Public Health on March 11, 2020 [7]. In order to mitigate the spread of the virus, the Governor of Delaware declared a state of emergency on March 13, 2020. The weeks that followed included several modifications to the original state of emergency to minimize the spread of the virus.

In response to the growing pandemic, ChristianaCare Health Services, Inc. (ChristianaCare), which serves the majority catchment area of Northern Delaware and the most populous county in the state, followed suit with its own measures to mitigate spread, postponing all elective procedures in hospitals and all ambulatory practices effective March 17, 2020 to adhere to state and CDC guidelines. The ambulatory services at ChristianaCare adjusted the delivery of healthcare services by reducing the number of in-person visits to minimize the risk to patients and healthcare providers, redirecting patients to telehealth when appropriate. By April 2020, more than 80% of ambulatory visits were telehealth visits (Phone or video). This proportion rapidly decreased the following months to approximately 35% in September 2020. Primary care practices were screening patients according to CDC guidelines to determine eligibility for in person versus virtual visits [8]. The majority of patients who were suspected of Covid had telehealth visits. Although our proportion of Telehealth visits are somewhat larger, the trend is similar to the trend described by the Vizient organization from a cohort of 39 large organizations including ChristianaCare [9]. This resulted in a decrease of patient volumes throughout the ambulatory system. With the uncertainty that COVID-19 presented then, the Phase 1 reopening that occurred on June 1, 2020, and the rise in cases occurring, it is essential to understand how the ambulatory setting will continue to be affected in order to develop proper guidelines.

To understand the impact of the novel virus, scientists rely on community spread models to predict possible transmission. The popular susceptible, infected, and recovered (SIR) epidemiologic model and variations of this model have been used to gauge community spread of a variety of infectious diseases such as influenza and dengue fever [10-14]. SIR models have also been applied to inpatient settings to predict hospital capacity regarding admissions, ICU beds, and ventilators [11,13,15,16]. In addition to SIR models, the current literature on predicting patient volume varies from descriptive statistics to Discrete-event Simulation (DES), Markov modeling, and advanced time series models, with most of the studies that have used time series forecasting models focusing on emergency department and hospital admissions [17].

Time series forecasting in ambulatory visits prior to the COVID19 pandemic have been described in a few reports but other types of modeling for both in-person and telehealth visits are lacking [18-25]. The most used method for time series forecasting is the Box-Jenkins method otherwise known as the AutoRegressive Integrated Moving Average (ARIMA) model [26]. The ARIMA model has been used for its simplicity and flexibility in capturing linear patterns in a time series [17,19-22].

Significance

The future implications of COVID-19 in healthcare are still unknown, specifically how it will continue to affect the ambulatory landscape. This work aims to inform COVID-19 and nonCOVID-19 ambulatory resources allocation as well as guide ambulatory practices for in-person visits as in-person care might have been delayed. Integrated health systems, such as ours, could benefit from having insights into both ambulatory and inpatient predictions to optimize resources throughout the health system. We propose an ARIMA time series model to capture the changes in ambulatory patient volumes as a result of COVID-19.

Objective

The primary objective of this study is to accurately forecast the number of COVID-19 and nonCOVID-19 weekly visits in primary care practices for both Telehealth and in-person visits. The ability to forecast patient volumes in primary care locations by accurately evaluating the dynamic changes in patient visits and fitting these data to a statistical model is useful for the appropriate allocation of human and material resources for future planning. With the uncertainty that COVID-19 presents, healthcare systems have been adapting their ambulatory practices to adhere to state guidelines. Therefore, we developed a time series model that provides an 8-week future forecast for ambulatory practices and compared it to a moving average approach.

Materials and Methods

Study design

This retrospective study was conducted in primary care practices that are part of a single integrated health care system in Delaware (ChristianaCare), serving the primary catchment area of New Castle County. New Castle County is in the northernmost region of Delaware and as of 2019 has an estimated population of 558,753, accounting for nearly two-third of the entire state population [27]. We selected the patients’ records from the two practices that had the highest historical patient volumes among all clinics affiliated with ChristianaCare. Although the two practices were conducting both Telehealth and in-person, due to the small number of visits we combined them to obtain a total overall weekly visit count. Our study population included (1) COVID-19 patients who had prior family medicine ambulatory services within ChristianaCare in 2019 and had been previously hospitalized and discharged for COVID-19 or COVID-19 positive who were self-monitoring at home and had not been hospitalized. (2) Any patient who utilized ambulatory services from the same practices during the same time period and were not diagnosed with COVID-19. COVID-19 patients currently hospitalized were excluded from the population.

We extracted all patients’ records from our electronic health records system (EHR) from January 1, 2019 to July 25, 2020 and built two datasets. One included patient-level data (e.g. age, gender, race, ethnicity, insurance, marital status, and Elixhauser comorbidities) and the other ambulatory practice-related data (e.g. encounter location, encounter providers, and weekly patient volumes) [28]. Patient-level data were used for characterizing the study population and were not included in the forecasting model. Ambulatory practice-related data were primarily used for our time series models. For model development, we used one year of data between January 2019 and December 2019, and for model validation data from January 2020 until July 2020.

Statistical and forecasting methods

Descriptive statistics: Patient demographics and comorbidities were compared using t-tests, Chi square, Mann Whitney U analyses as appropriate according to the distribution.

Time-series: A time series is a sequential set of data points, measured typically over successive times. The ARIMA model was created for auto-correlated and non-stationary time series data [11]. The framework for ARIMA is displayed in Table 1.