computational biology

Prediction of response to vemurafenib in BRAF V600E mutant cancers based on a network approach

Lung adenocarcinoma is the tumor with the highest number of switch genes (298) compared to its normal tissue, followed by thyroid (227) and colorectal (183) cancers. Switch genes codifying for kinases were 14,7 and 3 respectively.We looked for three homology sequences identified across vemurafenib targets and we found that thyroid cancer and lung adenocarcinoma have a similar number of putative targetable switch genes kinase (5-6); on the contrary, colorectal cancer has just one,with minor
homology sequence.

Gene co-expression in the interactome: moving from correlation toward causation via an integrated approach to disease module discovery

In this study, we integrate the outcomes of co-expression network analysis with the human interactome network to predict novel putative disease genes and modules. We first apply the SWItch Miner (SWIM) methodology, which predicts important (switch) genes within the co-expression network that regulate disease state transitions, then map them to the human protein–protein interaction network (PPI, or interactome) to predict novel disease–disease relationships (i.e., a SWIM-informed diseasome).

Computational identification of specific genes for glioblastoma stem-like cells identity

Glioblastoma, the most malignant brain cancer, contains self-renewing, stem-like cells that sustain tumor growth and therapeutic resistance. Identifying genes promoting stem-like cell differentiation might unveil targets for novel treatments. To detect them, here we apply SWIM - a software able to unveil genes (named switch genes) involved in drastic changes of cell phenotype - to public datasets of gene expression profiles from human glioblastoma cells.

On the optimization of embedding spaces via information granulation for pattern recognition

Embedding spaces are one of the mainstream approaches when dealing with structured data. Granular Computing, in the last decade, emerged as a powerful paradigm for the automatic synthesis of embedding spaces that, at the same time, yield an interpretable model on the top of meaningful entities known as information granules. Usually, in these contexts, one aims at finding the smallest set of information granules in order to boost the model interpretability while keeping satisfactory performances.

Granular computing techniques for bioinformatics pattern recognition problems in non-metric spaces

Computational intelligence and pattern recognition techniques are gaining more and more attention as the main computing tools in bioinformatics applications. This is due to the fact that biology by definition, deals with complex systems and that computational intelligence can be considered as an effective approach when facing the general problem of complex systems modelling. Moreover, most data available on shared databases are represented by sequences and graphs, thus demanding the definition of meaningful dissimilarity measures between patterns, which are often non-metric in nature.

(Hyper)Graph embedding and classification via simplicial complexes

This paper investigates a novel graph embedding procedure based on simplicial complexes. Inherited from algebraic topology, simplicial complexes are collections of increasing-order simplices (e.g., points, lines, triangles, tetrahedrons) which can be interpreted as possibly meaningful substructures (i.e., information granules) on the top of which an embedding space can be built by means of symbolic histograms. In the embedding space, any Euclidean pattern recognition system can be used, possibly equipped with feature selection capabilities in order to select the most informative symbols.

Modelling and recognition of protein contact networks by multiple kernel learning and dissimilarity representations

Multiple kernel learning is a paradigm which employs a properly constructed chain of kernel functions able to simultaneously analyse different data or different representations of the same data. In this paper, we propose an hybrid classification system based on a linear combination of multiple kernels defined over multiple dissimilarity spaces. The core of the training procedure is the joint optimisation of kernel weights and representatives selection in the dissimilarity spaces.

Metabolic networks classification and knowledge discovery by information granulation

Graphs are powerful structures able to capture topological and semantic information from data, hence suitable for modelling a plethora of real-world (complex) systems. For this reason, graph-based pattern recognition gained a lot of attention in recent years. In this paper, a general-purpose classification system in the graphs domain is presented. When most of the information of the available patterns can be encoded in edge labels, an information granulation-based approach is highly discriminant and allows for the identification of semantically meaningful edges.

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma