Translation of ethnicity lists in multi-country real-world studies

When entering demographic information, participants registering for multi-country real-world studies typically select their ethnicity from a pre-determined list. Ethnicity can be taken to represent a self-claimed identity encompassing nationality, history, cultural origin, and possibly religion¹. As a result, the definition of what constitutes an ethnic group can become complex, particularly when considered in international studies.

Why is it important to collect ethnicity data?

Collecting demographic data allows us to understand our participant populations in a structured way, so we can analyze variability in the disease experience based on factors such as age, gender, and ethnicity. Analyzing differences in the pattern of ill health and disease in specified populations is the driving force for shaping public health policies², and robust demographic data is thus essential for visibility of diversity. To improve approval timelines, access to innovative therapies, and screening and treatment guidelines, it is necessary to critically assess the equity of healthcare delivery across ethnicity, race, and other socioeconomic factors³.

What are the obstacles to collecting ethnicity data in multi-country studies?

To put it simply, it is impossible to have an international standard for collecting ethnicity or ethnic group data.

For instance, the classifications of “American Indian” and “Alaska Native” in the US, or subclassifications of “white British” and “white Irish” in the UK, will be much less relevant in most other countries⁴. Alternatively, a classification used in one country (e.g., “Gypsy” in the UK) may be inappropriate in another, where a different term may be used to capture people who identify in a similar way. While the collection of ethnicity data is well established in certain countries, others, such as France and Germany, actively avoid collecting data on race or ethnicity of their citizens.

Naturally, these differences in classifications between countries can complicate various stages in the development of international real-world studies, in particular:

1. Localization

Clients may choose to extend ethnicity lists to cover all bases for a particular country, which—if certain categorizations have no direct equivalent in one country compared with another—can confound the translation process. This creates the need for cultural adaptation and localization, as unique ethnicity lists may be required for each country.

2. Data capture

To facilitate the comparison of data across a patient population, questions in surveys tend to have the same answer options, regardless of the language. When it comes to ethnicity lists, investigators may end up with different datasets depending on the country.

What is the solution?

There are ways to handle issues in collecting ethnicity data, to ensure generation of real-world evidence that paints a full and meaningful picture of disease impact:

Research and plan studies carefully to minimize issues with data capture further down the line. Ethnicity lists are likely to be study-specific, taking into account the countries involved and their associated national regulations, while capturing the main disease-relevant population groups in accordance with clinical trials or prior research.
Involve localization experts, particularly those specialized in life science translation, who will be able to guide the localization process to meet national standards as well as study endpoints.
Guarantee regulatory compliance by going through the appropriate ethical approval pathway and ensuring that ethnicity data can be collected in a particular country for that study.

At Vitaccess, we have experience in running real-world studies around the globe. To learn how our in-house experts can help you reach your patient-centered outcomes goals, contact us at info@vitaccess.com.

References

Connelly R et al. Method Innov 2016;9:1–10.
Bhopal R. J Epidemiol Community Health 2004;58(6):441–5.
Ju-Young JS et al. International and global issues – differences in health systems, patient populations, and medical practice. In: Girman CJ and Ritchey ME eds. Pragmatic Randomized Controlled Trials. Academic Press;2021:257–72.
GOV.UK. Data in government. 2022. Available at: https://dataingovernment.blog.gov.uk/2022/01/25/comparing-ethnicity-data-for-different-countries/. Accessed: Aug 2022.

By Fatemeh Amini and Anna Richards

Fatemeh Amini, MScR - Associate Medical Writer

Anna Richards, MA - Commercial Lead

Supporting evidence-generation with expertise

See All Publications

Estimating health-state utility values for family-caregivers of patients with Duchenne muscular dystrophy using time trade-off valuation

Borecka O, Llewellyn S, Woollacott I, Richardson L, Fellows A, Lawrence J, Bottomley C, Biggane AM (JPRO, 2026)

A Novel International Patient Registry in Chronic Inflammatory Demyelinating Polyneuropathy Linking Clinical and Patient-Reported Outcomes Data: The Vitaccess Real CIDP (VRCIDP) Registry

Bowmar E, Larkin M, Habib A, Di Salvo N, Rajabally Y (AAN 2026)

Impact of Caring for Patients with Monocarboxylate Transporter 8 (MCT8) Deficiency on Caregivers’ Sleep: Results from an International Cross-Sectional Study

Ofori A, Larkin MJ, Georges N (PES 2025)

What is the consistency of response to rimegepant at a group level in the acute treatment of migraine in UK adult patients? Protocol for a real-world, patient-centred study

O’Neil G, Abraham L, Pawinski R, Nakajima K, Bagshaw E, Fellows A, Llewellyn S, Lambru G (EHC, 2025)

Assessment of stable chronic bronchitis improvement with adjunctive long-term Mucinex® use via the cough and sputum assessment questionnaire (CASA-Q)

Divel C, Borecka O, Llewellyn S, Spangenthal S (CHEST, 2025)

Adjunctive long-term use of Mucinex® leading to improvement in stable chronic bronchitis and patient’s quality of life: A case report.

Spangenthal S, Divel C, Borecka O, Llewellyn S (Medical Reports, 2025)

Real-world use of Vitaccess Real™ platform to assess quality of life impact with long-term use of Mucinex® in stable chronic bronchitis

Spangenthal S, Divel C, Borecka O, Llewellyn S (ISOQOL, 2025)

Multiple myeloma caregiver costs and disabilities data for economic modelling and HTA submissions

Alsawady M, Tarnowska R, Kudlac A, Vincent S, Melrose D, Lied-Lied A (ISPOR Europe, 2025)

A targeted review of caregiver experiences of CAR-T therapy in ambulatory settings

Ringland C, Bagshaw E, Llewellyn S, Pugh G (ISPOR Europe, 2025)

Extrapulmonary disease burden and impact of cystic fibrosis (CF) on productivity in people with CF (pwCF) aged >12 years not treated with CF transmembrane conductance regulator modulators (CFTRm): interim analysis (IA) of the HUBBLE study

Elborn S, Mainz J, Abbott J, Carr S, Sole A, Costa S, Ganapathy V, Arteaga-Solis E, Liu J, Yuan J, Thorat T, Llewellyn S, Larkin M, De Iorio F, Kaplowitz H (ITS Annual Scientific Meeting, 2021)

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.

Necessary

Always Enabled

Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-marketing	1 year	This cookie is set by the GDPR Cookie Consent plugin to store the user consent for the cookies in the category "Marketing".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
__hssrc	session	This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.

Analytics

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.

Cookie	Duration	Description
hubspotutk	5 months 27 days	HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts.
iutk	5 months 27 days	This cookie is used by Issuu analytic system to gather information regarding visitor activity on Issuu products.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_99254427_1	1 minute	Set by Google to distinguish users.
_ga_B2GQFRT399	2 years	This cookie is installed by Google Analytics.
_ga_JKZYG9CKXQ	2 years	This cookie is installed by Google Analytics.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
__hstc	5 months 27 days	This is the main cookie set by Hubspot, for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).

Marketing

Others

Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.

Cookie	Duration	Description
AnalyticsSyncHistory	1 month	No description
li_gc	5 months 27 days	No description
ln_or	1 day	No description

Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.

Cookie	Duration	Description
mc	1 year 1 month	Quantserve sets the mc cookie to anonymously track user behaviour on the website.

Functional

Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.

Cookie	Duration	Description
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
__hssc	30 minutes	HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.

Join the Registry

Advanced patient-centric, science-driven research

Observational Studies

Qualitative Research

Registries

Health Utilities & Preference Studies

Quantitative Research

Analytics & Data Visualization

Patient-reported Outcomes

Translation & Linguistic Validation

Publication Support

Patient & Caregiver Burden Studies

Publications

Blogs

Case studies

White papers

Webinars

Study datasets

Advanced patient-centric, science-driven research

Observational Studies

Qualitative Research

Registries

Health Utilities & Preference Studies

Quantitative Research

Analytics & Data Visualization

Patient-reported Outcomes

Translation & Linguistic Validation

Publication Support

Patient & Caregiver Burden Studies

Publications

Blogs

Case studies

White papers

Webinars

Study datasets

Translation of ethnicity lists in multi-country real-world studies

Supporting evidence-generation with expertise

Cookies

Discover more from Vitaccess