Publications

Filters: First Letter Of Last Name is X  [Clear All Filters]
A B C D E F G H I J K L M N O P Q R S T U V W [X] Y Z   [Show ALL]
A
Xu, JH, Messing J.  2009.  Amplification of prolamin storage protein genes in different subfamilies of the Poaceae. Theor Appl Genet. AbstractWebsite
Prolamins are seed storage proteins in cereals and represent an important source of essential amino acids for feed and food. Genes encoding these proteins resulted from dispersed and tandem amplification. While previous studies have concentrated on protein sequences from different grass species, we now can add a new perspective to their relationships by asking how their genes are shared by ancestry and copied in different lineages of the same family of species. These differences are derived from alignment of chromosomal regions, where collinearity is used to identify prolamin genes in syntenic positions, also called orthologous gene copies. New or paralogous gene copies are inserted in tandem or new locations of the same genome. More importantly, one can detect the loss of older genes. We analyzed chromosomal intervals containing prolamin genes from rice, sorghum, wheat, barley, and Brachypodium, representing different subfamilies of the Poaceae. The Poaceae commonly known as the grasses includes three major subfamilies, the Ehrhartoideae (rice), Pooideae (wheat, barley, and Brachypodium), and Panicoideae (millets, maize, sorghum, and switchgrass). Based on chromosomal position and sequence divergence, it becomes possible to infer the order of gene amplification events. Furthermore, the loss of older genes in different subfamilies seems to permit a faster pace of divergence of paralogous genes. Change in protein structure affects their physical properties, subcellular location, and amino acid composition. On the other hand, regulatory sequence elements and corresponding transcriptional activators of new gene copies are more conserved than coding sequences, consistent with the tissue-specific expression of these genes.
Murat, F, Xu JH, Tannier E, Abrouk M, Guilhot N, Pont C, Messing J, Salse J.  2010.  Ancestral grass karyotype reconstruction unravels new mechanisms of genome shuffling as a source of plant evolution. Genome Res. 20:1545-57. AbstractWebsite
The comparison of the chromosome numbers of today's species with common reconstructed paleo-ancestors has led to intense speculation of how chromosomes have been rearranged over time in mammals. However, similar studies in plants with respect to genome evolution as well as molecular mechanisms leading to mosaic synteny blocks have been lacking due to relevant examples of evolutionary zooms from genomic sequences. Such studies require genomes of species that belong to the same family but are diverged to fall into different subfamilies. Our most important crops belong to the family of the grasses, where a number of genomes have now been sequenced. Based on detailed paleogenomics, using inference from n = 5-12 grass ancestral karyotypes (AGKs) in terms of gene content and order, we delineated sequence intervals comprising a complete set of junction break points of orthologous regions from rice, maize, sorghum, and Brachypodium genomes, representing three different subfamilies and different polyploidization events. By focusing on these sequence intervals, we could show that the chromosome number variation/reduction from the n = 12 common paleo-ancestor was driven by nonrandom centric double-strand break repair events. It appeared that the centromeric/telomeric illegitimate recombination between nonhomologous chromosomes led to nested chromosome fusions (NCFs) and synteny break points (SBPs). When intervals comprising NCFs were compared in their structure, we concluded that SBPs (1) were meiotic recombination hotspots, (2) corresponded to high sequence turnover loci through repeat invasion, and (3) might be considered as hotspots of evolutionary novelty that could act as a reservoir for producing adaptive phenotypes.
D
Miclaus, M, Xu JH, Messing J.  2011.  Differential gene expression and epiregulation of alpha zein gene copies in maize haplotypes. PLoS Genet. 7:e1002131. AbstractWebsite
Multigenic traits are very common in plants and cause diversity. Nutritional quality is such a trait, and one of its factors is the composition and relative expression of storage protein genes. In maize, they represent a medium-size gene family distributed over several chromosomes and unlinked locations. Two inbreds, B73 and BSSS53, both from the Iowa Stiff Stock Synthetic collection, have been selected to analyze allelic and non-allelic variability in these regions that span between 80-500 kb of chromosomal DNA. Genes were copied to unlinked sites before and after allotetraploidization of maize, but before transposition enlarged intergenic regions in a haplotype-specific manner. Once genes are copied, expression of donor genes is reduced relative to new copies. Epigenetic regulation seems to contribute to silencing older copies, because some of them can be reactivated when endosperm is maintained as cultured cells, indicating that copy number variation might contribute to a reserve of gene copies. Bisulfite sequencing of the promoter region also shows different methylation patterns among gene clusters as well as differences between tissues, suggesting a possible position effect on regulatory mechanisms as a result of inserting copies at unlinked locations. The observations offer a potential paradigm for how different gene families evolve and the impact this has on their expression and regulation of their members.
Xu, J-H, Messing J.  2008.  Diverged Copies of the Seed Regulatory Opaque-2 Gene by a Segmental Duplication in the Progenitor Genome of Rice, Sorghum, and Maize. Mol Plant %R 10.1093/mp/ssn038. 1:760-769. AbstractWebsite
Comparative analyses of the sequence of entire genomes have shown that gene duplications, chromosomal segmental duplications, or even whole genome duplications (WGD) have played prominent roles in the evolution of many eukaryotic species. Here, we used the ancient duplication of a well known transcription factor in maize, encoded by the Opaque-2 (O2) locus, to examine the general features of divergences of chromosomal segmental duplications in a lineage-specific manner. We took advantage of contiguous chromosomal sequence information in rice (Oryza sativa, Nipponbare), sorghum (Sorghum bicolor, Btx623), and maize (Zea mays, B73) that were aligned by conserved gene order (synteny). This analysis showed that the maize O2 locus is contained within a 1.25 million base-pair (Mb) segment on chromosome 7, which was duplicated {approx}56 million years ago (mya) before the split of rice and maize 50 mya. The duplicated region on chromosome 1 is only half the size and contains the maize OHP gene, which does not restore the o2 mutation although it encodes a protein with the same DNA and protein binding properties in endosperm. The segmental duplication is not only found in rice, but also in sorghum, which split from maize 11.9 mya. A detailed analysis of the duplicated regions provided examples for complex rearrangements including deletions, duplications, conversions, inversions, and translocations. Furthermore, the rice and sorghum genomes appeared to be more stable than the maize genome, probably because maize underwent allotetraploidization and then diploidization.
Xu, JH, Bennetzen JL, Messing J.  2012.  Dynamic gene copy number variation in collinear regions of grass genomes. Mol Biol Evol. 29:861-71. AbstractWebsite
A salient feature of genomes of higher organisms is the birth and death of gene copies. An example is the alpha prolamin genes, which encode seed storage proteins in grasses (Poaceae) and represent a medium-size gene family. To better understand the mechanism, extent, and pace of gene amplification, we compared prolamin gene copies in the genomes of two different tribes in the Panicoideae, the Paniceae and the Andropogoneae. We identified alpha prolamin (setarin) gene copies in the diploid foxtail millet (Paniceae) genome (490 Mb) and compared them with orthologous regions in diploid sorghum (730 Mb) and ancient allotetraploid maize (2,300 Mb) (Andropogoneae). Because sequenced genomes of other subfamilies of Poaceae like rice (389 Mb) (Ehrhartoideae) and Brachypodium (272 Mb) (Pooideae) do not have alpha prolamin genes, their collinear regions can serve as "empty" reference sites. A pattern emerged, where genes were copied and inserted into other chromosomal locations followed by additional tandem duplications (clusters). We observed both recent (species-specific) insertion events and older ones that are shared by these tribes. Many older copies were deleted by unequal crossing over of flanking sequences or damaged by truncations. However, some remain intact with active and inactive alleles. These results indicate that genomes reflect only a snapshot of the gene content of a species and are far less static than conventional genetics has suggested. Nucleotide substitution rates for active alpha prolamins genes were twice as high as for low copy number beta, gamma, and delta prolamin genes, suggesting that gene amplification accelerates the pace of divergence.
M
Miclaus, M, Wu Y, Xu JH, Dooner HK, Messing J.  2011.  The maize high-lysine mutant opaque7 is defective in an acyl-CoA synthetase-like protein. Genetics. 189:1271-80. AbstractWebsite
Maize (Zea mays) has a large class of seed mutants with opaque or nonvitreous endosperms that could improve the nutritional quality of our food supply. The phenotype of some of them appears to be linked to the improper formation of protein bodies (PBs) where zein storage proteins are deposited. Although a number of genes affecting endosperm vitreousness have been isolated, it has been difficult to clone opaque7 (o7), mainly because of its low penetrance in many genetic backgrounds. The o7-reference (o7-ref) mutant arose spontaneously in a W22 inbred, but is poorly expressed in other lines. We report here the isolation of o7 with a combination of map-based cloning and transposon tagging. We first identified an o7 candidate gene by map-based cloning. The putative o7-ref allele has a 12-bp in-frame deletion of codons 350-353 in a 528-codon-long acyl-CoA synthetase-like gene (ACS). We then confirmed this candidate gene by generating another mutant allele from a transposon-tagging experiment using the Activator/Dissociation (Ac/Ds) system in a W22 background. The second allele, isolated from approximately 1 million gametes, presented a 2-kb Ds insertion that resembles the single Ds component of double-Ds, McClintock's original Dissociation element, at codon 496 of the ACS gene. PBs exhibited striking membrane invaginations in the o7-ref allele and a severe number reduction in the Ds-insertion mutant, respectively. We propose a model in which the ACS enzyme plays a key role in membrane biogenesis, by taking part in protein acylation, and that altered PBs render the seed nonvitreous.
O
Xu, JH, Messing J.  2008.  Organization of the prolamin gene family provides insight into the evolution of the maize genome and gene duplications in grass species. Proc Natl Acad Sci U S A. 105:14330-5. AbstractWebsite
Zea mays, commonly known as corn, is perhaps the most greatly produced crop in terms of tonnage and a major food, feed, and biofuel resource. Here we analyzed its prolamin gene family, encoding the major seed storage proteins, as a model for gene evolution by syntenic alignments with sorghum and rice, two genomes that have been sequenced recently. Because a high-density gene map has been constructed for maize inbred B73, all prolamin gene copies can be identified in their chromosomal context. Alignment of respective chromosomal regions of these species via conserved genes allow us to identify the pedigree of prolamin gene copies in space and time. Its youngest and largest gene family, the alpha prolamins, arose about 22-26 million years ago (Mya) after the split of the Panicoideae (including maize, sorghum, and millet) from the Pooideae (including wheat, barley, and oats) and Oryzoideae (rice). The first dispersal of alpha prolamin gene copies occurred before the split of the progenitors of maize and sorghum about 11.9 Mya. One of the two progenitors of maize gained a new alpha zein locus, absent in the other lineage, to form a nonduplicated locus in maize after allotetraplodization about 4.8 Mya. But dispersed copies gave rise to tandem duplications through uneven expansion and gene silencing of this gene family in maize and sorghum, possibly because of maize's greater recombination and mutation rates resulting from its diploidization process. Interestingly, new gene loci in maize represent junctions of ancestral chromosome fragments and sites of new centromeres in sorghum and rice.