A De-Novo Genome Analysis Pipeline (DeNoGAP) for large-scale comparative prokaryotic genomics studies

Thakur S, Guttman DS

BMC Bioinformatics 2016;17(1):260

PMID: 27363390

Abstract

BACKGROUND: Comparative analysis of whole genome sequence data from closely related prokaryotic species or strains is becoming an increasingly important and accessible approach for addressing both fundamental and applied biological questions. While there are number of excellent tools developed for performing this task, most scale poorly when faced with hundreds of genome sequences, and many require extensive manual curation.

RESULTS: We have developed a de-novo genome analysis pipeline (DeNoGAP) for the automated, iterative and high-throughput analysis of data from comparative genomics projects involving hundreds of whole genome sequences. The pipeline is designed to perform reference-assisted and de novo gene prediction, homolog protein family assignment, ortholog prediction, functional annotation, and pan-genome analysis using a range of proven tools and databases. While most existing methods scale quadratically with the number of genomes since they rely on pairwise comparisons among predicted protein sequences, DeNoGAP scales linearly since the homology assignment is based on iteratively refined hidden Markov models. This iterative clustering strategy enables DeNoGAP to handle a very large number of genomes using minimal computational resources. Moreover, the modular structure of the pipeline permits easy updates as new analysis programs become available.

CONCLUSION: DeNoGAP integrates bioinformatics tools and databases for comparative analysis of a large number of genomes. The pipeline offers tools and algorithms for annotation and analysis of completed and draft genome sequences. The pipeline is developed using Perl, BioPerl and SQLite on Ubuntu Linux version 12.04 LTS. Currently, the software package accompanies script for automated installation of necessary external programs on Ubuntu Linux; however, the pipeline should be also compatible with other Linux and Unix systems after necessary external programs are installed. DeNoGAP is freely available at https://sourceforge.net/projects/denogap/ .

Genomic screens identify a new phytobacterial microbe-associated molecular pattern and the cognate Arabidopsis receptor-like kinase that mediates its immune elicitation

Mott GA, Thakur S, Smakowska E, Wang PW, Belkhadir Y, Desveaux D, Guttman DS

Genome Biol. 2016;17:98

PMID: 27160854

Abstract

BACKGROUND: The recognition of microbe-associated molecular patterns during infection is central to the mounting of an effective immune response. In spite of their importance, it remains difficult to identify these molecules and the host receptors required for their perception, ultimately limiting our understanding of the role of these molecules in the evolution of host-pathogen relationships.

RESULTS: We employ a comparative genomics screen to identify six new immune eliciting peptides from the phytopathogenic bacterium Pseudomonas syringae. We then perform a reverse genetic screen to identify Arabidopsis thaliana leucine-rich repeat receptor-like kinases required for the recognition of these elicitors. We test the six elicitors on 187 receptor-like kinase knock-down insertion lines using a high-throughput peroxidase-based immune assay and identify multiple lines that show decreased immune responses to specific peptides. From this primary screen data, we focused on the interaction between the xup25 peptide from a bacterial xanthine/uracil permease and the Arabidopsis receptor-like kinase xanthine/uracil permease sensing 1; a family XII protein closely related to two well-characterized receptor-like kinases. We show that xup25 treatment increases pathogenesis-related gene induction, callose deposition, seedling growth inhibition, and resistance to virulent bacteria, all in a xanthine/uracil permease sensing 1-dependent manner. Finally, we show that this kinase-like receptor can bind the xup25 peptide directly. These results identify xup25 as a P. syringae microbe-associated molecular pattern and xanthine/uracil permease sensing 1 as a receptor-like kinase that detects the xup25 epitope to activate immune responses.

CONCLUSIONS: The present study demonstrates an efficient method to identify immune elicitors and the plant receptors responsible for their perception. Further exploration of these molecules will increase our understanding of plant-pathogen interactions and the basis for host specificity.

The auxin response factor MONOPTEROS controls meristem function and organogenesis in both the shoot and root through the direct regulation of PIN genes

Krogan NT, Marcos D, Weiner AI, Berleth T

New Phytol. 2016 Jul;

PMID: 27441727

Abstract

The regulatory effect auxin has on its own transport is critical in numerous self-organizing plant patterning processes. However, our understanding of the molecular mechanisms linking auxin signal transduction and auxin transport is still fragmentary, and important regulatory genes remain to be identified. To track a key link between auxin signaling and auxin transport in development, we established an Arabidopsis thaliana genetic background in which fundamental patterning processes in both shoot and root were essentially abolished and the expression of PIN FORMED (PIN) auxin efflux facilitators was dramatically reduced. In this background, we demonstrate that activating a steroid-inducible variant of the auxin response factor (ARF) MONOPTEROS (MP) is sufficient to restore patterning and PIN gene expression. Further, we show that MP binds to distinct promoter elements of multiple genetically defined PIN genes. Our work identifies a direct regulatory link between central, well-characterized genes involved in auxin signal transduction and auxin transport. The steroid-inducible MP system directly demonstrates the importance of this molecular link in multiple patterning events in embryos, shoots and roots, and provides novel options for interrogating the properties of self-regulated auxin-based patterning in planta.

High fecal IgA is associated with reduced Clostridium difficile colonization in infants

Bridgman SL, Konya T, Azad MB, Guttman DS, Sears MR, Becker AB, Turvey SE, Mandhane PJ, Subbarao P, , Scott JA, Field CJ, Kozyrskyj AL

Microbes Infect. 2016 May;

PMID: 27235197

Abstract

Colonization of infants with Clostridium difficile is on the rise. Although better tolerated by infants than adults, it is a risk factor for future allergic disease. The present study describes associations between infant fecal immunoglobulin A (IgA) and colonization with C. difficile in 47 infants enrolled in the Canadian Healthy Infant Longitudinal Development (CHILD) study. C. difficile colonization was observed in over half (53%) of the infants. Median IgA was lower in infants colonized with C. difficile (10.9 μg versus 25.5 μg per g protein; p = 0.18). A smaller proportion of infants with IgA in the highest tertile were colonized with C. difficile compared to the other tertiles (31.3% versus 64.5%, p = 0.03). In unadjusted analysis, odds of colonization with C. difficile was reduced by 75% (OR 0.25 95% CI 0.07, 0.91 p = 0.04) among infants with IgA in the highest tertile compared to those in the other tertiles. Following adjustment for parity, birth mode and breastfeeding, this association was even stronger (aOR 0.17, 95% CI 0.03, 0.94, p = 0.04). Our study provides evidence that high fecal IgA, independent of breastfeeding, is associated with reduced likelihood of C. difficile colonization in infancy.

Prophages mediate defense against phage infection through diverse mechanisms

Bondy-Denomy J, Qian J, Westra ER, Buckling A, Guttman DS, Davidson AR, Maxwell KL

ISME J 2016 Jun;

PMID: 27258950

Abstract

The activity of bacteriophages poses a major threat to bacterial survival. Upon infection, a temperate phage can either kill the host cell or be maintained as a prophage. In this state, the bacteria carrying the prophage is at risk of superinfection, where another phage injects its genetic material and competes for host cell resources. To avoid this, many phages have evolved mechanisms that alter the bacteria and make it resistant to phage superinfection. The mechanisms underlying these phentoypic conversions and the fitness consequences for the host are poorly understood, and systematic studies of superinfection exclusion mechanisms are lacking. In this study, we examined a wide range of Pseudomonas aeruginosa phages and found that they mediate superinfection exclusion through a variety of mechanisms, some of which affected the type IV pilus and O-antigen, and others that functioned inside the cell. The strongest resistance mechanism was a surface modification that we showed is cost-free for the bacterial host in a natural soil environment and in a Caenorhabditis. elegans infection model. This study represents the first systematic approach to address how a population of prophages influences phage resistance and bacterial behavior in P. aeruginosa.The ISME Journal advance online publication, 3 June 2016; doi:10.1038/ismej.2016.79.

Population genomic scans suggest novel genes underlie convergent flowering time evolution in the introduced range of Arabidopsis thaliana

Gould BA, Stinchcombe JR

Mol. Ecol. 2016 Apr;

PMID: 27064998

Abstract

A long-standing question in evolutionary biology is whether the evolution of convergent phenotypes results from selection on the same heritable genetic components. Using whole genome sequencing and genome scans, we tested whether the evolution of parallel longitudinal flowering time clines in the native and introduced ranges of Arabidopsis thaliana has a similar genetic basis. We found that common variants of large effect on flowering time in the native range do not appear to have been under recent strong selection in the introduced range. We identified a set of 38 new candidate genes that are putatively linked to the evolution of flowering time. A high degree of conditional neutrality of flowering time variants between the native and introduced range may preclude parallel evolution at the level of genes. Overall, neither gene pleiotropy nor available standing genetic variation appears to have restricted the evolution of flowering time to high frequency variants from the native range or to known flowering time pathway genes. This article is protected by copyright. All rights reserved.

New BAR Tools for Mining Expression Data and Exploring Cis-Elements in Arabidopsis thaliana

Austin RS, Hiu S, Waese J, Ierullo M, Pasha A, Wang T, Fan J, Foong C, Breit R, Desveaux D, Moses A, Provart NJ

Plant J. 2016 Jul;

PMID: 27401965

Abstract

Identifying sets of genes that are specifically expressed in certain tissues or in response to an environmental stimulus is useful for designing reporter constructs, generating gene expression markers, or for understanding gene regulatory networks. We have developed an easy-to-use online tool for defining a desired expression profile (a modification of our Expression Angler program), which can then be used to identify genes exhibiting patterns of expression that match this profile as closely as possible. Further, we have developed another online tool, Cistome, for predicting or exploring cis-elements in the promoters of sets of co-expressed genes identified by such a method, or by other methods. We present two use cases for these tools, which are freely available on the Bio-Analytic Resource at http://BAR.utoronto.ca. This article is protected by copyright. All rights reserved.

The ERF11 Transcription Factor Promotes Internode Elongation by Activating Gibberellin Biosynthesis and Signaling

Zhou X, Zhang ZL, Park J, Tyler L, Yusuke J, Qiu K, Nam EA, Lumba S, Desveaux D, McCourt P, Kamiya Y, Sun TP

Plant Physiol. 2016 Aug;171(4):2760-70

PMID: 27255484

Abstract

The phytohormone gibberellin (GA) plays a key role in promoting stem elongation in plants. Previous studies show that GA activates its signaling pathway by inducing rapid degradation of DELLA proteins, GA signaling repressors. Using an activation-tagging screen in a reduced-GA mutant ga1-6 background, we identified AtERF11 to be a novel positive regulator of both GA biosynthesis and GA signaling for internode elongation. Overexpression of AtERF11 partially rescued the dwarf phenotype of ga1-6 AtERF11 is a member of the ERF (ETHYLENE RESPONSE FACTOR) subfamily VIII-B-1a of ERF/AP2 transcription factors in Arabidopsis (Arabidopsis thaliana). Overexpression of AtERF11 resulted in elevated bioactive GA levels by up-regulating expression of GA3ox1 and GA20ox genes. Hypocotyl elongation assays further showed that overexpression of AtERF11 conferred elevated GA response, whereas loss-of-function erf11 and erf11 erf4 mutants displayed reduced GA response. In addition, yeast two-hybrid, coimmunoprecipitation, and transient expression assays showed that AtERF11 enhances GA signaling by antagonizing the function of DELLA proteins via direct protein-protein interaction. Interestingly, AtERF11 overexpression also caused a reduction in the levels of another phytohormone ethylene in the growing stem, consistent with recent finding showing that AtERF11 represses transcription of ethylene biosynthesis ACS genes. The effect of AtERF11 on promoting GA biosynthesis gene expression is likely via its repressive function on ethylene biosynthesis. These results suggest that AtERF11 plays a dual role in promoting internode elongation by inhibiting ethylene biosynthesis and activating GA biosynthesis and signaling pathways.

The HopF family of Pseudomonas syringae type III secreted effectors

Lo T, Koulena N, Seto D, Guttman DS, Desveaux D

Mol. Plant Pathol. 2016 Apr;

PMID: 27061875

Abstract

Pseudomonas syringae is a bacterial phytopathogen that utilizes the type III secretion system to inject effector proteins into plant host cells. P. syringae can infect a wide range of plant hosts including agronomically-important crops such as tomatoes and beans. The ability of P. syringae to infect such numerous hosts is due, in part, to the diversity of effectors employed by this phytopathogen. Over 60 different effector families exist in P. syringae, one such family is HopF, which contains over 100 distinct alleles. Despite this diversity, research has focused on only two members of this family: HopF1 from P. syringae pathovar phaseolicola 1449B and HopF2 from P. syringae pathovar tomato DC3000. In this study, we review the research on HopF family members including their host targets and molecular mechanisms of immunity suppression, and their enzymatic function. We also provide a phylogenetic analysis of this expanding effector family which provides a basis for a proposed nomenclature to guide future research. The extensive genetic diversity that exists within the HopF family presents a great opportunity to study how functional diversification on an effector family contributes to host specialization. This article is protected by copyright. All rights reserved.

A Human “eFP” Browser for Generating Gene Expression Anatograms

Patel RV, Hamanishi ET, Provart NJ

PLoS ONE 2016;11(3):e0150982

PMID: 26954504

Abstract

Transcriptomic studies help to further our understanding of gene function. Human transcriptomic studies tend to focus on a particular subset of tissue types or a particular disease state; however, it is possible to collate into a compendium multiple studies that have been profiled using the same expression analysis platform to provide an overview of gene expression levels in many different tissues or under different conditions. In order to increase the knowledge and understanding we gain from such studies, intuitive visualization of gene expression data in such a compendium can be useful. The Human eFP (“electronic Fluorescent Pictograph”) Browser presented here is a tool for intuitive visualization of large human gene expression data sets on pictographic representations of the human body as gene expression “anatograms”. Pictographic representations for new data sets may be generated easily. The Human eFP Browser can also serve as a portal to other gene-specific information through link-outs to various online resources.