biclustering data mining gene expression simulated annealing
Issue Date:
2006
Citation:
Bryan K., Cunningham P., Bolshakova N. ‘Application of simulated annealing to the biclustering of gene expression data’ in IEEE Transactions on Information Technology on Biomedicine, 10, (3), 2006, pp 519-525
Series/Report no.:
IEEE Transactions on Information Technology on Biomedicine 10 3
Abstract:
In a gene expression data matrix, a bicluster is a submatrix
of genes and conditions that exhibits a high correlation of
expression activity across both rows and columns. The problem
of locating the most significant bicluster has been shown to be
NP-complete. Heuristic approaches such as Cheng and Church’s
greedy node deletion algorithm have been previously employed.
It is to be expected that stochastic search techniques such as evolutionary
algorithms or simulated annealing might improve upon
such greedy techniques. In this paper we show that an approach
based on simulated annealing is well suited to this problem, and we
present a comparative evaluation of simulated annealing and node
deletion on a variety of datasets.We show that simulated annealing
discovers more significant biclusters in many cases. Furthermore,
we also test the ability of our technique to locate biologically verifiable
biclusters within an annotated set of genes.
Please note: There is a known bug in some browsers that causes an
error when a user tries to view large pdf file within the browser window.
If you receive the message "The file is damaged and could not be
repaired", please try one of the solutions linked below based on the
browser you are using.
Items in TARA are protected by copyright, with all rights reserved, unless otherwise indicated.