# Katharina J. Hoff # August 25th 2021 # Species specific AUGUSTUS parameters for Cyclotella cryptica Augustus parameter set name: Cyclotella_cryptica Lineage: Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta; Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales; Stephanodiscaceae; Cyclotella This parameter set was created with BRAKER on the basis of genome assembly https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/013/187/285/GCA_013187285.1_ASM1318728v1/GCA_013187285.1_ASM1318728v1_genomic.fna.gz and OrthoDb proteins https://v100.orthodb.org/download/odb10_plants_fasta.tar.gz Call: braker.pl --genome=genome.fa --prot_seq=proteins.fasta --softmasking --cores=8 --species=Cyclotella_cryptica --epmode Flanking region size: 988 Number of generated training genes: 2258 nonredundant genes, 200 set aside for testing, 200 set aside for optimize_augustus.pl time consuming steps Initial accuracy on test set: 0.5982 Accuracy on test set after optimize_augustus.pl: 0.6696 With this parameter set, BRAKER predicted 28883 transcripts (Augustus with hints in the target genome). To our knowledge, this might be the first annotation. Annotation is available upon request from katharina.hoff@uni-greifswald.de This parameter set was generated in a fully automated fashion for supporing the POMPU project (http://www.pompu-project.de/). If using this parameter set, please cite: Bruna, T., Hoff, K.J., Stanke, M., Lomsadze, A., & Borodovsky, M. (2021). BRAKER2: Automatic Eukaryotic Genome Annotation with GeneMark-EP+ and AUGUSTUS Supported by a Protein Database. NAR genomics and bioinformatics, lqaa108.