I have a file with about 17000 rows and I preformed a simple linear regression on
Gene_id expA expB
GeneA 5.462109 5.006181
GeneB 2.667692 4.208152
GeneC 4.796976 4.122660
GeneD 3.127125 3.676322
GeneE 4.500583 4.104575
GeneF 4.598430 4.853717
And I preformed the regression analysis using
plot(log2(data$expA)~log2(data$expB))
regression <- lm(log2(moved.data$expA)~log2(moved.data$expB))
abline(regression)
I am interested in what Genes which are outliers are from my regression analysis.
I tried using the identify(log2(data$expA)~log2(data$expB), row.names(data))
function but I have a lot of points in my graph so to go click over them one by one doesn't seem feasible to me.
I also looked here: Outliers with robust regression in R
but that doesn't tell me how to figure out the Gene names of the outliers