University College London
Browse
ARCHIVE
corrupted_synthetic_ALSPAC.zip (7.84 MB)
ARCHIVE
uncorrupted_synthetic_ALSPAC.zip (1.05 MB)
1/0
2 files

Uncorrupted and Corrupted Synthetic Identifiers (based on ALSPAC)

dataset
posted on 2024-07-01, 15:50 authored by Joseph LamJoseph Lam, Andy Boyd, Katie HarronKatie Harron, Ruth BlackburnRuth Blackburn, Robin Linacre

These are synthetic identifiers generated based on gold-standard identifiers from participants in the Avon Longitudinal Study of Parents And Children. The datasets have undergone checks and are approved to be released by ALSPAC executive board. The synthetic data includes Mother surnames, Baby surnames and forenames, sex, ethnicity, age of mother at giving birth, ethnicity, index of multiple deprivation, and baby's date of birth.

Please refer to my GitHub Repo for full codes and steps to reproduce the process. Please read my paper for full methodology and appendix.

Paper Title: Generating Synthetic Identifiers to support development and evaluation of data linkage methods.

https://ijpds.org/article/view/2389


Funding

The Avon Longitudinal Study of Parents and Children (ALSPAC): A multi- generation, longitudinal resource focusing on life course health and well- being.

Wellcome Trust

Find out more...

Efficient and transparent methods for linking and analysing longitudinal population studies and administrative data

Wellcome Trust

Find out more...

History

Usage metrics

    University College London

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC