6. Data available for sharing
For full details of available data please review the online MCPS Data Showcase. The Showcase displays all the data types currently available, in a grouped format (i.e. not at the individual participant level), along with further information about each data field (for example, background information about how measures were taken). Genetic variation can be viewed using the MCPS online Variant Browser.
Details of data available to researchers worldwide
Baseline data (1998 - 2004) available for 159,517 participants
Month and year of recruitment
Socio-demographic
- age
- sex
- area of residence
- marital status
- educational achievement
- occupation
- income
- health service provider
Lifestyle characteristics
- smoking
- passive smoking
- alcohol consumption
- physical activity
- sleep duration
- fruit/vegetable intake
- fried food intake
- type of cooking oil used
Prior diseases and medication
Reproductive history (women)
- menopausal status
- hysterectomy
- oophorecomy
- hormone replacement therapy
- contraceptive use
- age at first sexual relationship
- age at first pregnancy
- number of pregnancies
Physical measurements
- height
- weight
- waist circumference
- hip circumference
- systolic blood pressure
- diastolic blood pressure
Blood samples
- time of blood sampling
- time since last meal
- glycosylated haemoglobin
Resurvey data (2015 - 2019) available for 10,143 participants. Similar data to that collected at baseline plus
Additional questionnaire data
- diabetes control questions
- diabetes consequences (e.g eyes, amputations, dialysis)
- fractures/falls
- treatment for breast cancer
- additional dietary questions (e.g sugary drinks, added salt, meat, fish, desserts, diets)
- cognitive function (MMSE)
Additional measurements
|
Additional samples
|
Baseline NMR metabolomic data using the Nightingale Health platform First release: 40,297 participants |
14 Lipoprotein subclasses
|
Lipoprotein mean particle sizes and apolipoproteins
|
Cholines and glycolysis-related
|
7 Lipid measures for each subclass
|
Fatty acids
|
Amino acids
|
Ketone bodies, inflammation and kidney function
|
Genomic DataDescribed fully in Ziyatdinov et al. 2023
Genome–wide genotyping with the Illumina Global Screening Array (GSA) version 2 Non-filtered dataset (140,831 participants) Quality controlled dataset (138,511 participants) Non-filtered dataset (141,046 participants) Non-filtered dataset (9950 participants) Phased WGS Imputation Reference Panel (MCPS10k) 9,948 whole genome sequenced phased samples Total of 134,337,444 variants distributed across 22 autosomes and chromosome X Data available in four file formats. mortality data (up to 30 september 2022)
|
Apo A1=apolipoprotein A1; Apo B=apolipoprotein B; HDL=high density lipoproteins; HDL−D=high density lipoprotein particle diameter; IDL=intermediate density lipoproteins; L=large; LDL=low density lipoproteins; LDL−D=low density lipoprotein particle diameter; M=medium; S=small; VLDL=very low density lipoproteins; VLDL−D=very low density lipoprotein particle diameter; XL=very large; XS=very small; XXL=extremely large. |
additional data currently available only to researchers in mexico
NMR metabolomic data using the Nightingale Health platform second release: 152833 participants at baseline and 9657 participants at resurvey
All metabolites as listed for the first release plus Phosphoglycerides, Pyruvate, Aceto-acetate and Clinical LDL-C.