(I) A compressed ASCII encoded archive (NDA-all-datasets.tar.gz) which contains 5 datasets:
The Neurofilament Degradome Atlas (NDA-dataset.txt): this is the basic dataset for the NDA. This is an ASCII, tab delimited, text file. The dataset contains the NDA IDs with their peptide sequences, protein properties calculated for ready use. All Neurofilament Degradome Atlas IDs reviewed in the validation study are tagged to the respective PRIDE repositories.
NDA_self_matches: all self-matches in the NDA by BLAST+.
NDA_non_self_matches: all non-self matches of the NDA with the SwissProt database.
The unique NDA sequences for HSP.
The unique NDA sequences for PD.
(II) A compressed archive (NDA-codes-python.tar.gz) which contains the phyton codes for creation of the NDA from the FASTA sequences for each of the 5 Nf isoforms: NfH, NfM, NfL, INA, PRP.