ResearchHub | Open Science Community

SM

Sara Madeira

Author with expertise in Analysis of Gene Interaction Networks

Achievements

Cited Author

Open Access Advocate

Key Stats

Upvotes received:

0

Publications:

4

(75% Open Access)

Cited by:

2,585

h-index:

24

/

i10-index:

48

Reputation

Biology

< 1%

Chemistry

< 1%

Economics

< 1%

Show more

How is this calculated?

Publications

Biclustering algorithms for biological data analysis: a survey

Sara Madeira et al.Jan 1, 2004

A large number of clustering approaches have been proposed for the analysis of gene expression data obtained from microarray experiments. However, the results from the application of standard clustering methods to genes are limited. This limitation is imposed by the existence of a number of experimental conditions where the activity of genes is uncorrelated. A similar limitation exists when clustering of conditions is performed. For this reason, a number of algorithms that perform simultaneous clustering on the row and column dimensions of the data matrix has been proposed. The goal is to find submatrices, that is, subgroups of genes and subgroups of conditions, where the genes exhibit highly correlated activities for every condition. In this paper, we refer to this class of algorithms as biclustering. Biclustering is also referred in the literature as coclustering and direct clustering, among others names, and has also been used in fields such as information retrieval and data mining. In this comprehensive survey, we analyze a large number of existing approaches to biclustering, and classify them in accordance with the type of biclusters they can find, the patterns of biclusters that are discovered, the methods used to perform the search, the approaches used to evaluate the solution, and the target applications.

Artificial Intelligence

Molecular Biology

0

Paper

Artificial Intelligence

Save

Babelomics: an integrative platform for the analysis of transcriptomics, proteomics and genomic data with advanced functional profiling

Ignacio Medina et al.May 15, 2010

Babelomics is a response to the growing necessity of integrating and analyzing different types of genomic data in an environment that allows an easy functional interpretation of the results. Babelomics includes a complete suite of methods for the analysis of gene expression data that include normalization (covering most commercial platforms), pre-processing, differential gene expression (case-controls, multiclass, survival or continuous values), predictors, clustering; large-scale genotyping assays (case controls and TDTs, and allows population stratification analysis and correction). All these genomic data analysis facilities are integrated and connected to multiple options for the functional interpretation of the experiments. Different methods of functional enrichment or gene set enrichment can be used to understand the functional basis of the experiment analyzed. Many sources of biological information, which include functional (GO, KEGG, Biocarta, Reactome, etc.), regulatory (Transfac, Jaspar, ORegAnno, miRNAs, etc.), text-mining or protein-protein interaction modules can be used for this purpose. Finally a tool for the de novo functional annotation of sequences has been included in the system. This provides support for the functional analysis of non-model species. Mirrors of Babelomics or command line execution of their individual components are now possible. Babelomics is available at http://www.babelomics.org.

Artificial Intelligence

0

Paper

Save

The YEASTRACT database: an upgraded information system for the analysis of gene and genomic transcription regulation inSaccharomyces cerevisiae

Miguel Teixeira et al.Oct 28, 2013

The YEASTRACT (http://www.yeastract.com) information system is a tool for the analysis and prediction of transcription regulatory associations in Saccharomyces cerevisiae. Last updated in June 2013, this database contains over 200 000 regulatory associations between transcription factors (TFs) and target genes, including 326 DNA binding sites for 113 TFs. All regulatory associations stored in YEASTRACT were revisited and new information was added on the experimental conditions in which those associations take place and on whether the TF is acting on its target genes as activator or repressor. Based on this information, new queries were developed allowing the selection of specific environmental conditions, experimental evidence or positive/negative regulatory effect. This release further offers tools to rank the TFs controlling a gene or genome-wide response by their relative importance, based on (i) the percentage of target genes in the data set; (ii) the enrichment of the TF regulon in the data set when compared with the genome; or (iii) the score computed using the TFRank system, which selects and prioritizes the relevant TFs by walking through the yeast regulatory network. We expect that with the new data and services made available, the system will continue to be instrumental for yeast biologists and systems biology researchers.

Molecular Biology

0

Paper

Save

Temporal stratification of amyotrophic lateral sclerosis patients using disease progression patterns

Danielle Amaral et al.Jul 8, 2024

Abstract Identifying groups of patients with similar disease progression patterns is key to understand disease heterogeneity, guide clinical decisions and improve patient care. In this paper, we propose a data-driven temporal stratification approach, ClusTric, combining triclustering and hierarchical clustering. The proposed approach enables the discovery of complex disease progression patterns not found by univariate temporal analyses. As a case study, we use Amyotrophic Lateral Sclerosis (ALS), a neurodegenerative disease with a non-linear and heterogeneous disease progression. In this context, we applied ClusTric to stratify a hospital-based population (Lisbon ALS Clinic dataset) and validate it in a clinical trial population. The results unravelled four clinically relevant disease progression groups: slow progressors, moderate bulbar and spinal progressors, and fast progressors. We compared ClusTric with a state-of-the-art method, showing its effectiveness in capturing the heterogeneity of ALS disease progression in a lower number of clinically relevant progression groups.

0

Paper

Save