Options
Density-Based Clustering of Functionally Similar Genes Using Biological Knowledge
Journal
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISSN
03029743
Date Issued
2019-01-01
Author(s)
Pant, Namrata
Abstract
Clustering is used to identify natural groups present in the data. It has been applied widely for analyzing gene expression data to discover gene clusters that might be involved in same biological processes. This information is very important for analyzing data of fatal diseases like cancers and identifying potential diagnostic and prognostic markers. Existing clustering methods used in this regard are computationally efficient, but do not always produce biologically meaningful results. Additionally, they have one or the other shortcomings; either they are not able to deal with arbitrary-shaped clusters, require number of clusters to be specified previously or are not efficient in dealing with noise present in biological data. In this study, a new density-based clustering method specific for gene expression data is introduced that overcomes the above shortcomings and produces biologically enriched clusters of functionally similar genes by incorporating biological information from Gene Ontology (GO). The proposed method integrates the GO semantic similarity information and the correlation information between the genes for obtaining clusters. The clusters are further validated for their biological relevance using Disease Ontology, KEGG Pathway enrichment and protein-protein interaction network analysis.
Volume
11942 LNCS
Unpaywall