i want to cluster the protein interaction database, into sub cluster and for that I used Hierarchical Clustering in R. But i get warning message which i can't understand and cluster would not created. My code and Database are as below:
Database:
trpD trpB
serB sdaA
pabA trpA
pabB trpA
pabA pabB
serB glyA
serB trpB
trpC trpA
ilvA trpA
serB ilvA
trpB trpA
pabB trpB
trpE trpC
trpC trpB
trpE trpB
pabB trpC
sdaA trpB
pabA trpD
trpE trpD
pabA trpC
sdaA trpA
serB trpA
pabA trpE
ilvA glyA
pabB trpD
trpD trpC
ilvA trpB
glyA trpA
glyA trpB
pabA trpB
trpE trpA
glyA sdaA
trpD trpA
here traA interact with trpB, serB interact with sdaA and so on... now i want to cluster them.My code is:
rm(list=ls())
options(max.print = 10000000)
library(igraph) # load package igraph
library(combinat)
library(e1071)
library(maptree)
read_database <- read.table("C:/Users/Priyanka/Desktop/text_summary.txt", header=TRUE, comment.char = "")
read_database
data_frame <- data.frame (read_database$V1, read_database$V2)
data_frame
dim(data_frame)
d_euclidian <- dist(read_database, method = "euclidean")
d_euclidian
I got the warning: Warning message: In dist(data_frame, method = "euclidean") : NAs introduced by coercion
can any one help? And also can any one tell me about other technique for clustering PPI. And can i use K means clustering here? If yes than How??? Please help..
Thanks...