ResearchHub | Open Science Community

PopGenome: An Efficient Swiss Army Knife for Population Genomic Analyses in R

Bastian Pfeifer et al.Apr 16, 2014

Although many computer programs can perform population genetics calculations, they are typically limited in the analyses and data input formats they offer; few applications can process the large data sets produced by wholegenome resequencing projects.Furthermore, there is no coherent framework for the easy integration of new statistics into existing pipelines, hindering the development and application of new population genetics and genomics approaches.Here, we present PopGenome, a population genomics package for the R software environment (a de facto standard for statistical analyses).PopGenome can efficiently process genome-scale data as well as large sets of individual loci.It reads DNA alignments and single-nucleotide polymorphism (SNP) data sets in most common formats, including those used by the HapMap, 1000 human genomes, and 1001 Arabidopsis genomes projects.PopGenome also reads associated annotation files in GFF format, enabling users to easily define regions or classify SNPs based on their annotation; all analyses can also be applied to sliding windows.PopGenome offers a wide range of diverse population genetics analyses, including neutrality tests as well as statistics for population differentiation, linkage disequilibrium, and recombination.PopGenome is linked to Hudson's MS and Ewing's MSMS programs to assess statistical significance based on coalescent simulations.PopGenome's integration in R facilitates effortless and reproducible downstream analyses as well as the production of publication-quality graphics.Developers can easily incorporate new analyses methods into the PopGenome framework.PopGenome and R are freely available from CRAN (http://cran.r-project.org/) for all major operating systems under the GNU General Public License.

Genetics

Demography

0

Paper

Save

Evolview v3: a webserver for visualization, annotation, and management of phylogenetic trees

Balakrishnan Subramanian et al.May 16, 2019

Abstract Evolview is an interactive tree visualization tool designed to help researchers in visualizing phylogenetic trees and in annotating these with additional information. It offers the user with a platform to upload trees in most common tree formats, such as Newick/Phylip, Nexus, Nhx and PhyloXML, and provides a range of visualization options, using fifteen types of custom annotation datasets. The new version of Evolview was designed to provide simple tree uploads, manipulation and viewing options with additional annotation types. The ‘dataset system’ used for visualizing tree information has evolved substantially from the previous version, and the user can draw on a wide range of additional example visualizations. Developments since the last public release include a complete redesign of the user interface, new annotation dataset types, additional tree visualization styles, full-text search of the documentation, and some backend updates. The project management aspect of Evolview was also updated, with a unified approach to tree and project management and sharing. Evolview is freely available at: https://www.evolgenius.info/evolview/.

Artificial Intelligence

Biochemistry

0

Paper

Artificial Intelligence

661

0

Save

0

Evolview v2: an online visualization and management tool for customized and annotated phylogenetic trees

Zilong He et al.Apr 30, 2016

Evolview is an online visualization and management tool for customized and annotated phylogenetic trees. It allows users to visualize phylogenetic trees in various formats, customize the trees through built-in functions and user-supplied datasets and export the customization results to publication-ready figures. Its ‘dataset system’ contains not only the data to be visualized on the tree, but also ‘modifiers’ that control various aspects of the graphical annotation. Evolview is a single-page application (like Gmail); its carefully designed interface allows users to upload, visualize, manipulate and manage trees and datasets all in a single webpage. Developments since the last public release include a modern dataset editor with keyword highlighting functionality, seven newly added types of annotation datasets, collaboration support that allows users to share their trees and datasets and various improvements of the web interface and performance. In addition, we included eleven new ‘Demo’ trees to demonstrate the basic functionalities of Evolview, and five new ‘Showcase’ trees inspired by publications to showcase the power of Evolview in producing publication-ready figures. Evolview is freely available at: http://www.evolgenius.info/evolview/.

Genetics

Molecular Biology

0

Paper

Save

Clustering of housekeeping genes provides a unified model of gene order in the human genome

Martin Lercher et al.May 6, 2002

Genetics

Artificial Intelligence

0

Paper

Save

Adaptive evolution of bacterial metabolic networks by horizontal gene transfer

Csaba Pál et al.Nov 20, 2005

Genetics

Molecular Biology

0

Paper

Save

Gene Transfer from Bacteria and Archaea Facilitated Evolution of an Extremophilic Eukaryote

Gerald Schönknecht et al.Mar 7, 2013

Hot, Toxic Eukaryote Unusually, the single-celled eukaryote red alga, Galdieria sulphuraria , can thrive in hot, acidic springs. This organism is endowed with extraordinary metabolic talents and can consume a variety of strange carbohydrates, as well as turn on photosynthesis when the food runs out. Schönknecht et al. (p. 1207 ; see the Perspective by Rocha ) discerned from phylogenetic analysis of its genome that during its evolution, G. sulphuraria appears to have commandeered at least 75 bacterial and archaeal genes by horizontal gene transfer and then applied gene expansion to boost its metabolic repertoire.

Genetics

Ecology

0

Paper

Save

EvolView, an online tool for visualizing, annotating and managing phylogenetic trees

Huangkai Zhang et al.Jun 12, 2012

EvolView is a web application for visualizing, annotating and managing phylogenetic trees. First, EvolView is a phylogenetic tree viewer and customization tool; it visualizes trees in various formats, customizes them through built-in functions that can link information from external datasets, and exports the customized results to publication-ready figures. Second, EvolView is a tree and dataset management tool: users can easily organize related trees into distinct projects, add new datasets to trees and edit and manage existing trees and datasets. To make EvolView easy to use, it is equipped with an intuitive user interface. With a free account, users can save data and manipulations on the EvolView server. EvolView is freely available at: http://www.evolgenius.info/evolview.html.

Biochemistry

Paleontology

0

Paper

Save

The ECAT HRRT: performance and first clinical application of the new high resolution research tomograph

K. Wienhard et al.Feb 1, 2002

The ECAT HRRT is a three-dimensional (3-D) only dedicated brain tomograph employing the new scintillator lutetium-oxy-orthosilicate (LSO) and using depth of interaction (DOI) information to achieve uniform isotropic resolution across a 20-cm diameter volume. With its unique technological innovations it represents the prototype of a new generation of high-resolution brain tomographs. The physical performance with respect to count rate, live time, scatter, sensitivity, and resolution was evaluated with phantom studies and measurements with a point source. The HRRTs imaging performance was tested with phantoms and fluorodeoxyglucose (FDG) scans performed in animal and human brains. We find that due to the significantly improved resolution and the large solid angle covered by the panel detectors, several issues that have been adequately solved for older generation scanners demand new attention for the HRRT, like acquiring and handling large amounts of data effectively, strategies for optimal reconstruction, shielding, and correction of random coincidences.

Artificial Intelligence

Atomic And Molecular Physics, And Optics

0

Paper

Artificial Intelligence

408

0

Save

0

An mRNA Blueprint for C4 Photosynthesis Derived from Comparative Transcriptomics of Closely Related C3 and C4 Species

Andrea Bräutigam et al.Jun 11, 2010

Abstract C4 photosynthesis involves alterations to the biochemistry, cell biology, and development of leaves. Together, these modifications increase the efficiency of photosynthesis, and despite the apparent complexity of the pathway, it has evolved at least 45 times independently within the angiosperms. To provide insight into the extent to which gene expression is altered between C3 and C4 leaves, and to identify candidates associated with the C4 pathway, we used massively parallel mRNA sequencing of closely related C3 (Cleome spinosa) and C4 (Cleome gynandra) species. Gene annotation was facilitated by the phylogenetic proximity of Cleome and Arabidopsis (Arabidopsis thaliana). Up to 603 transcripts differ in abundance between these C3 and C4 leaves. These include 17 transcription factors, putative transport proteins, as well as genes that in Arabidopsis are implicated in chloroplast movement and expansion, plasmodesmatal connectivity, and cell wall modification. These are all characteristics known to alter in a C4 leaf but that previously had remained undefined at the molecular level. We also document large shifts in overall transcription profiles for selected functional classes. Our approach defines the extent to which transcript abundance in these C3 and C4 leaves differs, provides a blueprint for the NAD-malic enzyme C4 pathway operating in a dicotyledon, and furthermore identifies potential regulators. We anticipate that comparative transcriptomics of closely related species will provide deep insight into the evolution of other complex traits.

Genetics

Molecular Biology

0

Paper

Save

The substrate scopes of enzymes: a general prediction model based on machine and deep learning

Alexander Kroll et al.May 25, 2022

ABSTRACT For a comprehensive understanding of metabolism, it is necessary to know all potential substrates for each enzyme encoded in an organism’s genome. However, for most proteins annotated as enzymes, it is unknown which primary and/or secondary reactions they catalyze [1], as experimental characterizations are time-consuming and costly. Machine learning predictions could provide an efficient alternative, but are hampered by a lack of information regarding enzyme non-substrates, as available training data comprises mainly positive examples. Here, we present ESP, a general machine learning model for the prediction of enzyme-substrate pairs, with an accuracy of over 90% on independent and diverse test data. This accuracy was achieved by representing enzymes through a modified transformer model [2] with a trained, task-specific token, and by augmenting the positive training data by randomly sampling small molecules and assigning them as non-substrates. ESP can be applied successfully across widely different enzymes and a broad range of metabolites. It outperforms recently published models designed for individual, well-studied enzyme families, which use much more detailed input data [3, 4]. We implemented a user-friendly web server to predict the substrate scope of arbitrary enzymes, which may support not only basic science, but also the development of pharmaceuticals and bioengineering processes.

Artificial Intelligence

Biochemistry

32

Paper

Artificial Intelligence

7

0

Save