Abstract:
Background: CACTA transposable elements (TEs) comprise one of the most abundant superfamilies of Class 2 (cut and-paste) transposons. Over recent decades, CACTA elements were widely identifed in species from the plant, fungi,
and animal kingdoms, but sufciently studied in the genomes of only a few model species although non-model
genomes can bring additional and valuable information. It primarily concerned the genomes of species belonging to
clades in the base of large taxonomic groups whose genomes, to a certain extent, can preserve relict and/or pos sesses specifc traits. Thus, we sought to investigate the genomes of Chenopodium (Amaranthaceae, Caryophyllales)
species to unravel the structural variability of CACTA elements. Caryophyllales is a separate branch of Angiosperms
and until recently the diversity of CACTA elements in this clade was unknown.
Results: Application of the short-read genome assembly algorithm followed by analysis of detected complete
CACTA elements allowed for the determination of their structural diversity in the genomes of 22 Chenopodium
album aggregate species. This approach yielded knowledge regarding: (i) the coexistence of two CACTA transposons
subtypes in single genome; (ii) gaining of additional protein conserved domains within the coding sequence; (iii)
the presence of captured gene fragments, including key genes for fower development; and (iv)) identifcation of
captured satDNA arrays. Wide comparative database analysis revealed that identifed events are scattered through
Angiosperms in diferent proportions.
Conclusions: Our study demonstrated that while preserving the basic element structure a wide range of coding and
non-coding additions to CACTA transposons occur in the genomes of C. album aggregate species. Ability to relocate
additions inside genome in combination with the proposed novel functional features of structural-diferent CACTA
elements can impact evolutionary trajectory of the host genome.