This study provides unique insight into evolution of chimeric genes in plant mitochondria. Our observation of high divergence in nucleotide sequence of the two alleles of a complex chimeric gene,
bobt_MV and
bobt_KR, is surprising. We know of one other study, in
Raphanus raphanistrum
[21], that observed two alleles in a chimeric mt gene, but these differences were much less striking than what we found in
S. vulgaris. One characteristic of
S. vulgaris that may have contributed to high divergence between these alleles was the high mitochondrial DNA mutation rate in this species, which also influences the high diversity found in the
atp1 gene
[29],
[44],
[45]. Overall, 43 single nucleotide differences were observed between
bobt_MV and
bobt_KR, counting from the start codon through the 3′ regions homologous to
atp1; 10 of these differences were within the segments homologous to
atp1 ( and
S1).
The divergence between the two
bobt alleles and the
atp1 variants in
S. vulgaris allowed us to test whether
bobt had a single evolutionary origin or two independent origins. The phylogenetic analysis of
atp1 genic segments of
bobt and the corresponding regions of the
atp1 genes did not provide an unequivocal solution. Although it is possible that
bobt evolved twice independently, given the complex structure of this chimeric gene and the presence of a large region with unknown homology, it is more parsimonious to assume a single origin of the gene. Assuming a single origin, the two
bobt alleles have taken remarkably different evolutionary routes that include nucleotide divergence likely influenced by gene conversion and structural re-arrangement of flanking regions that have influenced expression patterns. In particular, our study contributes to the recent realization of the commonness of gene conversion in angiosperm mitochondria
[46] that is motivating reassessment of the factors contributing to this genomes' evolution
[47].
Gene conversion also likely contributed to divergence between
bobt_MV and
bobt_KR. Its affect can be most easily seen in the nearly identical sequences from sites 780–848 for
atp1.2_MV and
bobt_MV in
Figure S1. Several instances of gene conversion in plant mitochondrial genes have been described recently
[46],
[48],
[49] and patterns in
bobt indicate that gene conversion must be a persistent characteristic of its evolution in
Silene vulgaris. Moreover, independent gene conversion events appear to have occurred in each
bobt lineage, generating patterns consistent with the phylogeny in .
The overall structure of
bobt reveals clues regarding its origin. Chimeric ORFs are created from the fusion of DNA fragments via recombination across shared short repeats (<30 bp). Recombination at these short repeat sites in plant mitochondria is considered very rare and generally they are not considered to facilitate homologous recombination (Marechal and Brisson 2010), despite some recent findings that even very short repeats (6 bp) can mediate recombination
[50]. We found an 8 bp region of complete identity at the junction between the
atp1 and
cox2 segments of
bobt, which is a likely site for repeated recombination between the
atp1 and
cox2 genes. Because the origin of the internal part of the
bobt genes is unknown, we are unable to identify possible regions of similarity and potential recombination between the internal unknown region and the segments of
cox2 and
atp1. The presence of hotspots of recombination may increase the probability of independent origins of the two
bobt alleles (
bobt_KR and
bobt_MV). We note that the same
cox2 segment has been independently incorporated into two different chimeric regions in petunia and wheat, although the wheat region is likely a pseudogene
[51],
[52],
[53].
Although not detected in previous studies
[44],
[54], here we documented multiple copies of
atp1 within the MV and KR genomes. The dynamic nature of angiosperm mt genomic rearrangements has been shown to result in dramatically different copy numbers and relative positions of genes within genomes
[55],
[56],
[57]. It is likely that
Silene also harbors high variation in gene copy number among lineages and this is a likely explanation for the absence of previous detection. Nonetheless, caution is justified when assuming homology among mt coding regions in
Silene and other plant species.
Bobt has many of the hallmarks of CMS genes. One commonality among most chimeric CMS genes is the presence of portions of
ATP synthase subunits
[1]; portions of these subunits are found in both
bobt alleles. CMS genes also tend to be transcribed; they are not loss-of-function genes. Both
bobt genes are transcribed, albeit at a comparably low level in the case of
bobt_MV. Finally, nuclear restorers of fertility (Rf) often post-transcriptionally interfere with the products of CMS genes, so that hermaphrodites accumulate less transcript than females
[58],
[59],
[60]. Both
bobt alleles exhibited lower transcript levels in hermaphrodite than in female plants. The much higher transcript level for
bobt_KR than for
bobt_MV in both female and hermaphrodite plants, however, may indicate significant differences in the functionality of these alleles including the possibility that
KR_bobt is a CMS-related gene but
MV_bobt is not. The different genomic context between
bobt_KR and
bobt_MV is likely related to the difference in transcript levels.
bobt_KR is co-transcribed with
cob, which lacks its own promoter, although a start of transcription from a sequence motif located upstream of the
cob coding region, including the unknown region of
bobt, cannot be completely excluded. In contrast,
bobt_MV is not co-transcribed with
cob. Although additional
cob copies may be present in Krasnoyarsk plants, the high transcript level detected at the junction between
bobt_KR and
cob suggests that the co-transcript is a major source of
cob mRNA. We may speculate that co-transcription plays a role in the inability to independently regulate transcription of
cob and
bobt_KR, however, our studies also show that
bobt might be post-transcriptionally regulated independently from
cob. We are curious to know whether the co-transcription may limit the options for new mutations to arise that down-regulate transcription of
bobt_KR and other CMS-associated genes. Co-transcription of CMS-associated genes with essential protein coding genes is not unique to the present study. For example, o
rf456 is co-transcribed with
cox2 in chili pepper
[60],
orf256 is co-transcribed with
cox1 in wheat
[61], and a CMS-related factor is co-transcribed with
nad6 in
Mimulus
[62]. Molecular options for suppression of expression may be more limited when CMS genes are co-transcribed with housekeeping genes.
Our study indicates that highly divergent copies of complex chimeric genes with similar structures as CMS genes are present within the same species, albeit in individuals with wide geographic separation. The ecology and evolution of gynodioecious mating has been studied in
S. vulgaris for many years and multiple studies have remarked on the high level of mitochondrial gene polymorphism
[22],
[44],
[45], especially when compared to hermaphroditic congeners
[22]. Discussion of the source of mitochondrial divergence has been primarily whether it results from high mutation rates within gynodioecious species or balancing selection maintaining divergent copies over long time periods. Our study suggests that gene conversion may also be a process contributing to divergence. Combined with the increasingly convincing evidence for low frequency paternal inheritance of mitochondrial variation
[63], gene conversion between two copies of a duplicated gene provides increasingly complex possibilities for the origins of plant mitochondrial genes. We should not let this complexity divert us from conducting carefully studies to determine the tempo and mode of plant mitochondrial genome evolution.
In summary, we have discovered two divergent allelic versions of a complex mitochondrial chimeric ORFs in different S. vulgaris individuals. Comparisons between homologous regions between the chimeric genes and atp1 are consistent with a history of independent gene conversion events in each lineage. Gene expression also has diverged both in quantity and the co-transcription of an adjacent cob gene in the bobt_KR version. These observations indicate that chimeric genes may persist for sufficiently long periods within species for significant evolution in nucleotide sequence, expression and genomic environment.