ResearchHub | Open Science Community

CHARACTERISTICS OF PLANETARY CANDIDATES OBSERVED BYKEPLER. II. ANALYSIS OF THE FIRST FOUR MONTHS OF DATA

Thomas Barclay et al.Jun 29, 2011

On 2011 February 1 the Kepler mission released data for 156,453 stars observed from the beginning of the science observations on 2009 May 2 through September 16. There are 1235 planetary candidates with transit-like signatures detected in this period. These are associated with 997 host stars. Distributions of the characteristics of the planetary candidates are separated into five class sizes: 68 candidates of approximately Earth-size (Rp < 1.25 R⊕), 288 super-Earth-size (1.25 R⊕ ⩽ Rp < 2 R⊕), 662 Neptune-size (2 R⊕ ⩽ Rp < 6 R⊕), 165 Jupiter-size (6 R⊕ ⩽ Rp < 15 R⊕), and 19 up to twice the size of Jupiter (15 R⊕ ⩽ Rp < 22 R⊕). In the temperature range appropriate for the habitable zone, 54 candidates are found with sizes ranging from Earth-size to larger than that of Jupiter. Six are less than twice the size of the Earth. Over 74% of the planetary candidates are smaller than Neptune. The observed number versus size distribution of planetary candidates increases to a peak at two to three times the Earth-size and then declines inversely proportional to the area of the candidate. Our current best estimates of the intrinsic frequencies of planetary candidates, after correcting for geometric and sensitivity biases, are 5% for Earth-size candidates, 8% for super-Earth-size candidates, 18% for Neptune-size candidates, 2% for Jupiter-size candidates, and 0.1% for very large candidates; a total of 0.34 candidates per star. Multi-candidate, transiting systems are frequent; 17% of the host stars have multi-candidate systems, and 34% of all the candidates are part of multi-candidate systems.

Astronomy And Astrophysics

Nuclear And High Energy Physics

0

Paper

Astronomy And Astrophysics

966

0

Save

3

ioSearch: a tool for searching disease-associated interacting omics; application on breast cancer data

Santanu Das et al.Aug 3, 2022

Abstract Biomarkers identification is difficult for cancer and other polygenic traits because such complicated diseases occur due to an intricate interplay of various genetic materials. Although high-throughput data from recent technolo-gies provide access to a tremendous amount of information still there is a huge gap in harnessing knowledge from the generated multi-omics data. It is evident from the availability of subject-specific multi-omics data from large consortiums that there is a growing need for appropriate tools to analyze such data. Traditional single-omics association tests more often identify strong signals but fail to explore the between-omics relationship and find moderately weak signals due to multiple testing burdens. Multi-omics data integration intuitively provides a clear advantage in understanding the genetic architecture of disease a little better by imparting complementary information. But the construction of such methods is challenging because of the diversity in the nature of multiple omics and the sample size which is much less than the number of omics variables. It is important to consider factors such as data diversity and prior biological knowledge to make meaningful and better predictions. Dimension reduction techniques such as feature selection are used to circumvent the sample size issue in general but treating all the omics variables similarly might be an oversimplification of the complex biological interactions. The lack of appropriate approaches for biomarker identification from complex multi-omics data led us to develop this method. ioSearch is a tool for integrating two omics assays with continuous measurements. Based on a two-step model, ioSearch explores the inter-relationship of the omics in a principal regression framework and selects features using sparse principal component analysis to provide easily interpretable inference in terms of p-values. Also, it uses prior biological information to reduce multiple testing burdens. Extensive simulation results show that our method is statistically powerful with a controlled type I error rate. Application of ioSearch to two publicly available breast cancer datasets identified relevant genes and proteins in important pathways.

Ecology

Biochemistry

3

Paper

Save

TiMEG: an integrative approach for partially missing multi-omics data with an application to tuberous sclerosis

Santanu Das et al.Dec 11, 2020

1 Abstract Multi-omics data integration is widely used to understand the genetic architecture of disease. In multi-omics association analysis, data collected on multiple omics for the same set of individuals are immensely important for biomarker identification. But when the sample size of such data is limited, the presence of partially missing individual-level observations poses a major challenge in data integration. More often, genotype data are available for all individuals under study but gene expression and/or methylation information are missing for different subsets of those individuals. Here, we develop a statistical model TiMEG, for the identification of disease-associated biomarkers in a case-control paradigm by integrating the above-mentioned data types, especially, in presence of missing omics data. Based on a likelihood approach, TiMEG exploits the inter-relationship among multiple omics data to capture weaker signals, that remain unidentified in single-omics analyses. Its application on a real tuberous sclerosis dataset identified functionally relevant genes in the disease pathway.

Genetics

Molecular Biology

0

Paper

Save

Maritime On-the-Move Troposcatter for Resilient Naval Communications

Scott Enserink et al.Oct 28, 2024

Geology

Aerospace Engineering

0

Paper

Geology

Aerospace Engineering

0

Save