How are digital health technologies advancing clinical research?
Digital health technologies are enhancing clinical research in the UK, streamlining participant experience, improving accessibility, and increasing...
Finding ways to diversify ethnic representation in clinical research is crucial for creating a more equitable, accessible, and impactful scientific future.
The COVID-19 pandemic has highlighted the importance of collecting and interpreting health data. Not only does it help shape our understanding of infection but it allows us to track health trends and devise effective disease management strategies that can positively advance public health. But health policies and measures are only as effective as the diversity of the data that they are informed by. If certain demographic groups are not included or underrepresented in research datasets, they risk being underserved by the solutions, medication or technology that the research underpinning them intends to develop.
Unfortunately, health inequities are perpetuated by a lack of diversity in clinical research, which is further compounded by unconscious biases in the development of new medical devices and technologies. However, there are measures that can be taken to ensure that no one is left behind.
In this article, I will focus on the ways to improve the ethnic diversity of clinical data. It's important to note that this focus on ethnicity does not diminish the importance of other identity determinants such as gender, sexuality, or neurodivergence. Each of these deserves its own dedicated piece in isolation as there is much to explore on each topic.
What did COVID-19 teach us?
The COVID-19 pandemic highlighted the uncomfortable truth of ethnic inequalities in healthcare. A recent UK analysis of 17 million primary care and death records revealed that individuals of Black or South Asian ancestry had a disproportionately higher risk of death, with an approximate twofold increase compared to those who identified as White (after adjusting for age and sex differences).
And while the magnitude of disparities underscored by this COVID-19 dataset may be challenging to accept, collecting data along ethnicity lines has provided valuable insights into the underlying issue and it is hoped that this could lead to the creation of new policies and practices to address these inequalities in the future.
At a surface level, the pandemic exposed the need for better data collection practices. A research study conducted by the Nuffield Trust revealed that ‘poor data about ethnicity has obscured the true extent of ethnic disparities in the impact of the pandemic’. The study further discovered that 13% of patient records in the UK do not contain a valid ethnic group classification, while 8.5% are categorised as 'not stated', and 8.8% as 'other', which are legal categories but not useful for analytical purposes.
Gaps in patient data like this have far-reaching implications, including the efficacy of clinical trials. Research has shown that ethnicity can play a significant role in the effectiveness of medicines. However, for COVID-19 vaccine trials, ethnicity data was not considered a priority. A study showed that out of 20,692 US-based trials with reported results representing ∼4.76 million enrollees, only 43% reported any ethnicity data. Moreover, where the data was available, the majority of enrollees were White (79.7%), while Black participants accounted for just 3% of major vaccine trial participation, despite representing 21% of COVID-19 related deaths.
An all-too-familiar challenge
Unfortunately, the COVID-19 pandemic starkly revealed a longstanding and systematic failure within clinical data to accurately represent ethnicity. Let’s consider ischemic heart disease (IHD) as an example. IHD stands as the most common cause of premature death in the UK; and of those that die from IHD, diabetes is the main contributing factor. However, our understanding of these conditions is largely derived from seminal studies like the Framingham Heart Study, the Physicians Health Study, and the Nurses Health Study, wherein 90% of participants were categorised as ‘White’.
Perhaps more alarming, is a review of 12 diabetes trials conducted in the UK. It revealed that despite comprising 11.2% of the UK diabetes population, South Asians accounted for only 5.5% of the participants. Additionally, it is concerning that ethnicity was not even reported in four of these studies. While diabetes is a complex condition that cannot be generalised based on observations in one population, it is often the case that biased views of what is normal' are established. To better advance our understanding of the disease and mitigate the risk for a broader range of the world's population, it is crucially important to consider diverse populations and avoid making universal assumptions.
Similarly the 'triglyceride paradox' vividly highlights the oversight within metabolic syndrome criteria — a set of indicators used to identify individuals at risk of type 2 diabetes and heart disease. Typically, metabolic syndrome encompasses at least three of the following biomarkers: elevated triglyceride levels, low high-density lipoprotein levels (HDL), increased waist circumference, high blood pressure, and fasting hyperglycemia. However, during the formulation of these criteria, the unique physiology of Black individuals was disregarded.
As a consequence, the distinct lack of association between insulin resistance and hypertriglyceridemia in Black individuals went unrecognised. Subsequently, it's now understood that hypertriglyceridemia, a hallmark of insulin resistance, occurs less frequently in Black individuals, leading to a lower prevalence of metabolic syndrome among this ethnicity. In essence, a critical diagnostic test for predicting the onset of type 2 diabetes and heart disease proves inadequate for Black individuals, resulting in under-diagnosis and heightened long-term health risks.
The dangers of health data uniformity are not limited to diabetes and metabolic conditions. For instance, a primary preventative method for heart disease in the UK involves the use of a validated risk-prediction algorithm, the QRISK score, to guide treatment decisions for individuals. This score synthesises various risk factors, including blood pressure, lipids, and ethnicity, to generate a 'risk score' for developing heart disease, and people who reach a certain threshold are identified for treatment. However, the QRISK score is based on large epidemiological datasets and, while effective at predicting risk at a population level, it cannot account for all the unique factors that impact an individual's risk of disease.
Incorporating genetic data into risk scores can enable a more personalised and precise approach. Large genome-wide association studies (GWAS) conducted over the last two decades have confirmed that cardiovascular disease is polygenic, meaning it is caused by the coexistence of thousands of gene variants. However, the polygenic risk scores that have been developed so far are derived mainly from a cohort of European ancestry. This is because Europeans represent 78% of people in GWAS studies, while only 2% of participants are of African descent. Consequently, polygenic risk scores are 4.5 times more accurate for people with European ethnicity than African.
Race correction in algorithms
Ethnic and racial categories are socially constructed, and are poor proxies for the relationship between geographical ancestry and genetic makeup. Indeed, studies have shown that there is more genetic variation within ethnic groups than between them. When race is conflated with genetic makeup, it can lead to subtle biases in medical practices, resulting in widening health disparities and worsened outcomes for ethnically diverse people. This issue is particularly noticeable within diagnostic algorithms.
A notable example can be observed in nephrology, where the CKD-EPI algorithm had been considered a reliable measure of estimated glomerular filtration rate (eGFR), an important marker for kidney health. The algorithm used race, along with age, gender, and levels of creatinine, and regularly reported higher eGFR levels in Black people, suggesting they had better kidney function. This finding was originally justified with evidence of higher than average serum creatinine concentrations among Black people, incorrectly explained by the idea that they are more muscular. However, studies have since cast doubt on this explanation, leading to one analysis that found removing race from the algorithm may increase the prevalence of Black adults with chronic kidney disease from 14.9% to 18.4%. After a review of the evidence, the National Institute for Health and Care Excellence (NICE) recommended the removal of race correction factors from the CKD-EPI algorithm.
Algorithms must take into account the impact of socioeconomic factors on health conditions and outcomes. Social deprivation is often correlated with ethnicity and may be part of the causal nexus linking ethnicity to poor health. For instance, in the United States, the 'Vaginal Birth After Caesarean' (VBAC) algorithm is designed to evaluate the risk associated with attempting a vaginal delivery for women who have previously undergone a caesarean section. The algorithm predicts that, all other factors being equal, African American or Hispanic women have a lower chance of a successful delivery.
As a result, these women are less likely to choose (or be offered) a vaginal birth. However, in its predictive analysis, the algorithm neglects crucial factors such as marital status and medical insurance, both of which significantly influence a woman's overall health and her access to quality healthcare. This oversight can directly impact their birthing experience. Conversely, White women, who statistically belong to higher socioeconomic groups, are deemed at lower risk. This is particularly noteworthy because vaginal deliveries offer distinct health advantages, including reduced rates of surgical complications, shorter recovery periods, and fewer complications during subsequent pregnancies. By inadequately accounting for socioeconomic variables, the VBAC algorithm inadvertently exacerbates health disparities among different ethnic groups, rather than mitigating them.
Charting a more diverse pathway forward
The NEJM also rightly emphasises the need for clinicians to take responsibility for the algorithms they use and ‘discern whether the correction is likely to relieve or exacerbate inequities. By critically evaluating the use of race in clinical algorithms, clinicians can ensure that they are not perpetuating health inequities and that they are providing equitable healthcare to all individuals, regardless of their race or ethnicity.
Conclusion
COVID-19 brought to light the failure of clinical data to accurately represent ethnicity. The implications of perpetuating these failures extend far beyond our ability to manage one pandemic. When ethnic groups are excluded from clinical research and go unrecorded in medical data, they are underserved by healthcare initiatives. This includes the potential of polygenic risk scores to provide personalised risk predictions with precision.
It is essential to address these issues by ensuring that clinical data accurately represents ethnic diversity. By doing so, we can improve our understanding of how diseases manifest in different populations and tailor healthcare initiatives to meet the needs of all individuals equally".
Researchers and clinicians alike must be prepared to call out these shortcomings for what they really are; systemic racism. Accepting the current levels of healthcare inequality is unacceptable. We must hold ourselves accountable, ensure diverse representation in data and research practices, and advocate for policies that promote diversity and inclusion.
The good news is that we are not ignorant to the issues surrounding clinical data bias and inaccuracy. We know what needs to be done to address it, and the key is collaboration. The problems and solutions are multifaceted, and we need to bring together experts from various domains, including researchers, practitioners, data scientists, public health bodies, social inclusion organisations, and businesses with innovative incentives in the remote diagnostics space. Collaboration is crucial to achieving our shared goals, and we must continue to work together to address these issues and promote diversity, equity, and inclusion in healthcare.
Get in touch with a member of the Thriva Solutions team today to learn more.
Digital health technologies are enhancing clinical research in the UK, streamlining participant experience, improving accessibility, and increasing...
From alleviating NHS pressure to driving innovations and clinical research, pharmacies can lead the way towards a more accessible, patient-centric,...
Remote diagnostics are revolutionising population health monitoring with profound implications for participant engagement, data handling, and the...
Be the first to know about new insights in today's healthcare industry.