6. Data available for sharing
Details of data - available to researchers worldwide
For full details of available data please review the online MCPS Data Showcase. The Showcase displays all the data types currently available, in a grouped format (i.e. not at the individual participant level), along with further information about each data field (for example, background information about how measures were taken). Genetic variation can be viewed using the MCPS online Variant Browser.
Baseline data (1998-2004)
Available for 159,517 participants
|
Month and year of recruitment Socio-demographic
Lifestyle characteristics
|
Prior diseases and medication Reproductive history (women)
Physical measurements
|
Blood samples
|
Resurvey data (2015-2019)
Available for 10,143 participants. Similar data to that collected at baseline plus:
|
Additional questionnaire data
|
Additional samples
|
Additional measurements
- Bioimpedance (fat mass, fat free mass, muscle mass, muscle score, bone mass, body water, degree of obesity, visceral fat rating, basal metabolic rate, metabolic age, Rohrer’s index)
- Pulse rate
Baseline NMR metabolomic data using the Nightingale Health platform
First release: 40,297 participants
|
14 Lipoprotein subclasses
|
Lipoprotein mean particle sizes and apolipoproteins
|
Cholines and glycolysis-related
|
|
7 Lipid measures for each subclass
|
Fatty acids
|
Amino acids
|
|
Ketone bodies, inflammation and kidney function
|
||
Genomic Data
Described fully in Ziyatdinov et al. 2023
|
Genome–wide genotyping with the Illumina Global Screening Array (GSA) version 2 • Non-filtered dataset (140,831 participants) • Quality controlled dataset (138,511 participants) Whole Exome Sequencing (WES) • Non-filtered dataset (141,046 participants) Whole Genome Sequencing (WGS) • Non-filtered dataset (9950 participants) • Phased WGS Imputation Reference Panel (MCPS10k)9,948 whole genome sequenced phased samples • Total of 134,337,444 variants distributed across 22 autosomes and chromosome X • Data available in four file formats. TopMed Imputed • Non-filtered dataset (140,831 participants) |
Mortality data (up to 30th September 2022)
- Date of death
- ICD-10 underlying cause
- ICD-10 contributory causes
- Timing/duration of diseases
- Location of death
- Seen by doctor before death
Apo A1=apolipoprotein A1; Apo B=apolipoprotein B; HDL=high density lipoproteins; HDL−D=high density lipoprotein particle diameter; IDL=intermediate density lipoproteins; L=large; LDL=low density lipoproteins; LDL−D=low density lipoprotein particle diameter; M=medium; S=small; VLDL=very low density lipoproteins; VLDL−D=very low density lipoprotein particle diameter; XL=very large; XS=very small; XXL=extremely large.
Additional data currently available only to researchers in Mexico
NMR metabolomic data using the Nightingale Health platform
Second release: 152,833 participants at baseline and 9657 participants at resurvey
All metabolites as listed for the first release plus Aceto-acetate, Clinical LDL-C, and, Glycine and Pyruvate.

