These are synthetic identifiers generated based on gold-standard identifiers from participants in the Avon Longitudinal Study of Parents And Children. The datasets have undergone checks and are approved to be released by ALSPAC executive board. The synthetic data includes Mother surnames, Baby surnames and forenames, sex, ethnicity, age of mother at giving birth, ethnicity, index of multiple deprivation, and baby's date of birth.
Please refer to my GitHub Repo for full codes and steps to reproduce the process. Please read my paper for full methodology and appendix.
Paper Title: Generating Synthetic Identifiers to support development and evaluation of data linkage methods.
The Avon Longitudinal Study of Parents and Children (ALSPAC): A multi- generation, longitudinal resource focusing on life course health and well- being.