Filling the missing values within each id in r

Question

I have a dataframe having some rows missing value. Here is a sample dataframe:

df <- data.frame(id = c(1,1,1, 2,2,2, 3,3,3),
                 item = c(11,12,13, 24,25,26, 56,45,56),
                 score = c(5,5, NA, 6,6,6, 7,NA, 7))

> df
  id item score
1  1   11     5
2  1   12     5
3  1   13    NA
4  2   24     6
5  2   25     6
6  2   26     6
7  3   56     7
8  3   45    NA
9  3   56     7

Grouping the dataset by id column, I would like to fill those NA values with the same score.

the desired output should be:

> df
  id item score
1  1   11     5
2  1   12     5
3  1   13     5
4  2   24     6
5  2   25     6
6  2   26     6
7  3   56     7
8  3   45     7
9  3   56     7

Any ideas?

Thanks!

You may use `fill` i.e. `df %>% group_by(id) %>% fill(score, .direction = "downup")` — akrun, Mar 03 '22 at 19:53

score 5 · Accepted Answer · answered Mar 03 '22 at 20:38

5

We can group by 'id' and fill

library(dplyr)
library(tidyr)
df %>%
   group_by(id) %>% 
   fill(score, .direction = "downup") %>%
   ungroup

answered Mar 03 '22 at 20:38

akrun

874,273
37
540
662

score 3 · Answer 2 · answered Mar 03 '22 at 21:18

Here is another option with base R

> transform(df, score = ave(score, id, FUN = function(x) mean(x, na.rm = TRUE)))
  id item score
1  1   11     5
2  1   12     5
3  1   13     5
4  2   24     6
5  2   25     6
6  2   26     6
7  3   56     7
8  3   45     7
9  3   56     7

BrunoPLC · Answer 3 · 2022-03-04T12:38:26.863

0

Another option is to create your own function,eg:

fill.in<-function(dataf){
  dataf2<-data.frame()
  for (i in 1:length(unique(dataf$id))){
  
    dataf1<-subset(dataf, id %in% unique(dataf$id)[i])
    
    dataf1$score<-max(dataf1$score,na.rm=TRUE)
    
    dataf2<-rbind(dataf2,dataf1)
  }
  return(dataf2)
}


fill.in(df)

edited Mar 04 '22 at 12:38

answered Mar 03 '22 at 22:41

BrunoPLC

91
2

Filling the missing values within each id in r

3 Answers3