Spectral clustering on protein-protein interaction networks via constructing affinity matrix using attributed graph embedding

K Berahmand, E Nasiri, Y Li - Computers in Biology and Medicine, 2021 - Elsevier
Computers in Biology and Medicine, 2021Elsevier
The identification of protein complexes in protein-protein interaction networks is the most
fundamental and essential problem for revealing the underlying mechanism of biological
processes. However, most existing protein complexes identification methods only consider a
network's topology structures, and in doing so, these methods miss the advantage of using
nodes' feature information. In protein-protein interaction, both topological structure and node
features are essential ingredients for protein complexes. The spectral clustering method …
Abstract
The identification of protein complexes in protein-protein interaction networks is the most fundamental and essential problem for revealing the underlying mechanism of biological processes. However, most existing protein complexes identification methods only consider a network's topology structures, and in doing so, these methods miss the advantage of using nodes' feature information. In protein-protein interaction, both topological structure and node features are essential ingredients for protein complexes. The spectral clustering method utilizes the eigenvalues of the affinity matrix of the data to map to a low-dimensional space. It has attracted much attention in recent years as one of the most efficient algorithms in the subcategory of dimensionality reduction. In this paper, a new version of spectral clustering, named text-associated DeepWalk-Spectral Clustering (TADW-SC), is proposed for attributed networks in which the identified protein complexes have structural cohesiveness and attribute homogeneity. Since the performance of spectral clustering heavily depends on the effectiveness of the affinity matrix, our proposed method will use the text-associated DeepWalk (TADW) to calculate the embedding vectors of proteins. In the following, the affinity matrix will be computed by utilizing the cosine similarity between the two low dimensional vectors, which will be considerable to improve the accuracy of the affinity matrix. Experimental results show that our method performs unexpectedly well in comparison to existing state-of-the-art methods in both real protein network datasets and synthetic networks.
Elsevier
以上显示的是最相近的搜索结果。 查看全部搜索结果