I have a dataframe like this:
df <- data.frame(
sample1 = c(0, 1, 2, 0, 2, 1),
sample2 = c(0.3, 3, 2, 0.4, 2, 3),
sample3 = c(0.2, 1, 3, 0.1, 3, 3),
sample4 = c(0.4, 2, 4, 0.3, 1, 1),
sample5 = c(0.1, 2, 4, 0.2, 5, 3),
sample6 = c(0.2, 3, 1, 0.1, 6, 3),
sample7 = c(0.2, 1, 1, 0.4, 1, 1)
)
And the groups defined in this df:
groups <- data.frame(
samples = c("sample1", "sample2", "sample3", "sample4", "sample5", "sample6", "sample7"),
groups = c("group1", "group1", "group1", "group2", "group2", "group3", "group3")
)
Using R, I want to keep only the rows where the sum in at least one group is greater than 0.5, so that the resulting df would be:
sample1 sample2 sample3 sample4 sample5 sample6 sample7
2 1 3 1 2 2 3 1
3 2 2 3 4 4 1 1
5 2 2 3 1 5 6 1
6 1 3 3 1 3 3 1