I am working with really big data with R
. My data is on Hive
and I am using rjdbc
. I am thinking of using a reference table on R because its impossible to load the table onto R even just using 10% sample. I am using the tbl
function from dplyr
.
transaction <- tbl(conn,"transaction")
R gave me an error message :
the dbplyr package is required to communicate with the database backends.
I am using a remote computer and it's impossible to install package on this R version. Any other solutions to solve the problem?