I have a sample (rows) by species (columns) dataframe. And a column in another dataframe that codes the samples into groups. I want to select all of the columns where all of the samples in any of the groups have a nonzero value.
species frame:
structure(list(Otu000132 = c(0L, 56L, 30L, 52L, 1L, 4L, 31L, 4L, 17L, 9L, 4L),
Otu000144 = c(191L, 14L, 58L, 137L, 127L, 222L, 26L, 175L, 133L, 107L, 43L),
Otu000146 = c(0L, 0L, 0L, 0L, 16L, 62L, 41L, 16L, 60L, 32L, 0L),
Otu000147 = c(0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L),
Otu000151 = c(2L, 9L, 4L, 1L, 0L, 4L, 4L, 2L, 3L, 0L, 0L),
Otu000162 = c(2L, 1L, 0L, 0L, 1L, 1L, 0L, 2L, 1L, 0L, 0L),
Otu000164 = c(2L, 0L, 1L, 2L, 0L, 0L, 0L, 0L, 0L, 0L, 0L),
Otu000174 = c(0L, 0L, 3L, 1L, 0L, 2L, 0L, 1L, 2L, 1L, 0L),
Otu000176 = c(1L, 9L, 0L, 1L, 2L, 5L, 3L, 3L, 8L, 2L, 2L),
Otu000186 = c(1L, 0L, 0L, 0L, 0L, 0L, 0L, 1L, 0L, 0L, 0L),
Otu000190 = c(1L, 1L, 1L, 0L, 0L, 5L, 1L, 2L, 7L, 0L, 0L)),
.Names = c("Otu000132", "Otu000144", "Otu000146", "Otu000147",
"Otu000151", "Otu000162", "Otu000164", "Otu000174",
"Otu000176", "Otu000186", "Otu000190"),
row.names = 30:40, class = "data.frame")
grouping frame:
structure(c(30, 31, 32, 33, 34, 35, 36, 37, 38, 39,
40, 1, 1, 1, 2, 2, 2, 2, 3, 3, 3, 3),
.Dim = c(11L, 2L))
desired output:
structure(list(Otu000132 = c(0L, 56L, 30L, 52L, 1L, 4L, 31L, 4L, 17L, 9L, 4L),
Otu000144 = c(191L, 14L, 58L, 137L, 127L, 222L, 26L, 175L, 133L, 107L, 43L),
Otu000151 = c(2L, 9L, 4L, 1L, 0L, 4L, 4L, 2L, 3L, 0L, 0L),
Otu000176 = c(1L, 9L, 0L, 1L, 2L, 5L, 3L, 3L, 8L, 2L, 2L),
Otu000190 = c(1L, 1L, 1L, 0L, 0L, 5L, 1L, 2L, 7L, 0L, 0L)),
.Names = c("Otu000132", "Otu000144", "Otu000151",
"Otu000176", "Otu000190"),
row.names = 30:40, class = "data.frame")
I feel like this should be something that I could do with dplyr select, but I can't figure it out. Anyone have suggestions for starting me on a path?