University College London
Download file
Download file
2 files

Dawnn training set simulation

posted on 2023-05-04, 16:03 authored by George HallGeorge Hall, Sergi Castellano HerezaSergi Castellano Hereza

This project is a collection of files to allow users to reproduce the model development and benchmarking in "Dawnn: single-cell differential abundance with neural networks" (Hall and Castellano, under review). Dawnn is a tool for detecting differential abundance in single-cell RNAseq datasets. It is available as an R package here. Please contact us if you are unable to reproduce any of the analysis in our paper.

The files in this collection correspond to the code and resulting dataset from the training set generation procedure.


  • autogen4_code.R R code to generate training set (250,000 independent random walks are simulated, with each random walk constituting an instance within the training set). Saves output to labels_df.csv.
  • labels_df.csv Training dataset generated by autogen4_code.R (Each row corresponds to a training instance, with the first column containing the simulated probability of the cell at the centre of a simulated trajectory having been drawn from Condition_1, and the remaining columns containing the labels corresponding to its 1000 neighbouring cells, with labels drawn according to the random walks simualted in autogen4_code.R).


NIHR Great Ormond Street Hospital Biomedical Research Centre


Usage metrics

    University College London