ResearchHub | Open Science Community

Daisuke Kihara

Author with expertise in Protein Structure Prediction and Analysis

Achievements

Cited Author

Open Access Advocate

Key Stats

Upvotes received:

Publications:

(65% Open Access)

Cited by:

1,361

h-index:

i10-index:

184

Reputation

Biology

< 1%

Chemistry

< 1%

Economics

< 1%

How is this calculated?

Publications

A large-scale evaluation of computational protein function prediction

Predrag Radivojac et al.Jan 27, 2013

Automated annotation of protein function is challenging. As the number of sequenced genomes rapidly grows, the overwhelming majority of protein products can only be annotated computationally. If computational predictions are to be relied upon, it is crucial that the accuracy of these methods be high. Here we report the results from the first large-scale community-based critical assessment of protein function annotation (CAFA) experiment. Fifty-four methods representing the state of the art for protein function prediction were evaluated on a target set of 866 proteins from 11 organisms. Two findings stand out: (i) today's best protein function prediction algorithms substantially outperform widely used first-generation methods, with large gains on all types of targets; and (ii) although the top methods perform well enough to guide experiments, there is considerable need for improvement of currently available tools.

Genetics

Artificial Intelligence

Paper

Genetics

934

Save

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

Yuxiang Jiang et al.Sep 7, 2016

A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging.We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2.The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent.

Paper

Save

Protein Contact Map Denoising Using Generative Adversarial Networks

Sai Subramaniya et al.Jun 27, 2020

ABSTRACT Protein residue-residue contact prediction from protein sequence information has undergone substantial improvement in the past few years, which has made it a critical driving force for building correct protein tertiary structure models. Improving accuracy of contact predictions has, therefore, become the forefront of protein structure prediction. Here, we show a novel contact map denoising method, ContactGAN, which uses Generative Adversarial Networks (GAN) to refine predicted protein contact maps. ContactGAN was able to make a consistent and significant improvement over predictions made by recent contact prediction methods when tested on two datasets including protein structure modeling targets in CASP13. ContactGAN will be a valuable addition in the structure prediction pipeline to achieve an extra gain in contact prediction accuracy.

Artificial Intelligence

Biochemistry

Paper

Artificial Intelligence

Save

Outcomes of the 2019 EMDataResource model challenge: validation of cryo-EM models at near-atomic resolution

Catherine Lawson et al.Jun 15, 2020

Abstract This paper describes outcomes of the 2019 Cryo-EM Map-based Model Metrics Challenge sponsored by EMDataResource ( www.emdataresource.org ). The goals of this challenge were (1) to assess the quality of models that can be produced using current modeling software, (2) to check the reproducibility of modeling results from different software developers and users, and (3) compare the performance of current metrics used for evaluation of models. The focus was on near-atomic resolution maps with an innovative twist: three of four target maps formed a resolution series (1.8 to 3.1 Å) from the same specimen and imaging experiment. Tools developed in previous challenges were expanded for managing, visualizing and analyzing the 63 submitted coordinate models, and several novel metrics were introduced. The results permit specific recommendations to be made about validating near-atomic cryo-EM structures both in the context of individual laboratory experiments and holdings of structure data archives such as the Protein Data Bank. Our findings demonstrate the relatively high accuracy and reproducibility of cryo-EM models derived from these benchmark maps by 13 participating teams, representing both widely used and novel modeling approaches. We also evaluate the pros and cons of the commonly used metrics to assess model quality and recommend the adoption of multiple scoring parameters to provide full and objective annotation and assessment of the model, reflective of the observed density in the cryo-EM map.

Philosophy

Artificial Intelligence

Paper

Philosophy

Save

Outcomes of the EMDataResource cryo-EM Ligand Modeling Challenge

Catherine Lawson et al.Jun 25, 2024

Artificial Intelligence

Biochemistry

Paper

Artificial Intelligence

Save

Real-Time Structure Search and Structure Classification for AlphaFold Protein Models

Tunde Aderinwale et al.Oct 23, 2021

Abstract AlphaFold2 showed a substantial improvement in the accuracy of protein structure prediction. Following the release of the software, whole-proteome protein structure predictions by AlphaFold2 for 21 organisms were made publicly available. Here, we developed the infrastructure, 3D-AF-Surfer, to enable real-time structure-based search for the AlphaFold2 models by combining molecular surface representation with 3D Zernike descriptors and deep neural networks.

Artificial Intelligence

Biochemistry

Paper

Artificial Intelligence

Save

OC_Finder: A deep learning-based software for osteoclast segmentation, counting, and classification

Xiao Wang et al.Oct 26, 2021

Abstract Osteoclasts are multinucleated cells that exclusively resorb bone matrix proteins and minerals on the bone surface. They differentiate from monocyte/macrophage-lineage cells in the presence of osteoclastogenic cytokines such as the receptor activator of nuclear factor-κB ligand (RANKL) and are stained positive for tartrate-resistant acid phosphatase (TRAP). In vitro, osteoclast formation assays are commonly used to assess the capacity of osteoclast precursor cells for differentiating into osteoclasts wherein the number of TRAP-positive multinucleated cells are counted as osteoclasts. Osteoclasts are manually identified on cell culture dishes by human eyes, which is a labor-intensive process. Moreover, the manual procedure is not objective and result in lack of reproducibility. To accelerate the process and reduce the workload for counting the number of osteoclasts, we developed OC_Finder, a fully automated system for identifying osteoclasts in microscopic images. OC_Finder consists of segmentation and classification steps. OC_Finder detected osteoclasts differentiated from wild-type and Sh3bp2 KI/+ precursor cells at a 99.4% accuracy for segmentation and at a 98.1% accuracy for classification. The number of osteoclasts classified by OC_Finder was at the same accuracy level with manual counting by a human expert. Together, successful development of OC_Finder suggests that deep learning is a useful tool to perform prompt and accurate unbiased classification and detection of specific cell types in microscopic images.

Artificial Intelligence

Biochemistry

Paper

Artificial Intelligence

Save

Emap2sec+: Detecting Protein and DNA/RNA Structures in Cryo-EM Maps of Intermediate Resolution Using Deep Learning

Xiao Wang et al.Aug 23, 2020

Abstract An increasing number of density maps of macromolecular structures, including proteins and protein and DNA/RNA complexes, have been determined by cryo-electron microscopy (cryo-EM). Although lately maps at a near-atomic resolution are routinely reported, there are still substantial fractions of maps determined at intermediate or low resolutions, where extracting structure information is not trivial. Here, we report a new computational method, Emap2sec+, which identifies DNA or RNA as well as the secondary structures of proteins in cryo-EM maps of 5 to 10 Å resolution. Emap2sec+ employs the deep Residual convolutional neural network. Emap2sec+ assigns structural labels with associated probabilities at each voxel in a cryo-EM map, which will help structure modeling in an EM map. Emap2sec+ showed stable and high assignment accuracy for nucleotides in low resolution maps and improved performance for protein secondary structure assignments than its earlier version when tested on simulated and experimental maps.

Genetics

Artificial Intelligence

Paper

Genetics

Save

AttentiveDist: Protein Inter-Residue Distance Prediction Using Deep Learning with Attention on Quadruple Multiple Sequence Alignments

Aashish Jain et al.Nov 25, 2020

ABSTRACT Protein 3D structure prediction has advanced significantly in recent years due to improving contact prediction accuracy. This improvement has been largely due to deep learning approaches that predict inter-residue contacts and, more recently, distances using multiple sequence alignments (MSAs). In this work we present AttentiveDist, a novel approach that uses different MSAs generated with different E-values in a single model to increase the co-evolutionary information provided to the model. To determine the importance of each MSA’s feature at the inter-residue level, we added an attention layer to the deep neural network. The model is trained in a multi-task fashion to also predict backbone and orientation angles further improving the inter-residue distance prediction. We show that AttentiveDist outperforms the top methods for contact prediction in the CASP13 structure prediction competition. To aid in structure modeling we also developed two new deep learning-based sidechain center distance and peptide-bond nitrogen-oxygen distance prediction models. Together these led to a 12% increase in TM-score from the best server method in CASP13 for structure prediction.

Artificial Intelligence

Biochemistry

Paper

Artificial Intelligence

Save

A Remarkable Genetic Shift in a Transmitted/Founder Virus Broadens Antibody Responses Against HIV-1

Swati Jain et al.Jun 16, 2021

Abstract A productive HIV-1 infection in humans is often established by transmission and propagation of a single transmitted/founder (T/F) virus, which then evolves into a complex mixture of variants during the lifetime of infection. An effective HIV-1 vaccine should elicit broad immune responses in order to block the entry of diverse T/F viruses. Currently, no such vaccine exists. An in-depth study of escape variants emerging under host immune pressure during very early stages of infection might provide insights into such a HIV-1 vaccine design. Here, in a rare longitudinal study involving HIV-1 infected individuals just days after infection in the absence of antiretroviral therapy, we discovered a remarkable genetic shift that resulted in near complete disappearance of the original T/F virus and appearance of a variant with H173Y mutation in the variable V2 domain of the HIV-1 envelope protein. This coincided with the disappearance of the first wave of strictly H173-specific antibodies and emergence of a second wave of Y173-specific antibodies with increased breadth. Structural analyses indicated conformational dynamism of the envelope protein which likely allowed selection of escape variants with a conformational switch in the V2 domain from an α-helix (H173) to a β-strand (Y173) and induction of broadly reactive antibody responses. This differential breadth due to a single mutational change was also recapitulated in a mouse model. Rationally designed combinatorial libraries containing 54 conformational variants of V2 domain around position 173 further demonstrated increased breadth of antibody responses elicited to diverse HIV-1 envelope proteins. These results offer new insights into designing broadly effective HIV-1 vaccines.

Genetics

Immunology

Paper

Genetics

Save