Questions tagged [multidplyr]

multidplyr is an R package by Hadley Wickham that enables parallel processing on partitioned data.frames. This tag should not be used for dplyr-only questions.

multidplyr is an R package by Hadley Wickham that enables parallel processing on partitioned data.frames. It is a complement to his popular dplyr package and part of the extended tidyverse ecosystem of packages.

51 questions
0
votes
1 answer

How to install and call package ‘multidplyr’ using windows 10 and R 3.4.4

I have attempted to use devtools::install_github("hadley/multidplyr") however when I use it I get the error Error: Failed to install 'multidplyr' from GitHub: (converted from warning) cannot remove prior installation of package ‘rlang’
Dominic Naimool
  • 313
  • 2
  • 11
0
votes
1 answer

porcess multiple .csv files with code for one, and output as whole

I am using R to sum up column "A" in multiple .CSV files, I've prepared code for 1 csv file, is there anyway that i can process all the csv files in one go and output as one? MyFile <- read.csv(tk_choose.files(caption = "Choose CSV files from…
Bomber Gay
  • 39
  • 4
0
votes
2 answers

Collapse based on a condition

This question is similar to a question already posted few days ago, Collapse rows from 0 to 0 The new twist here which is different from the previous question is this, how do we collapse rows by Id for only those rows where the time different is…
Sundown Brownbear
  • 491
  • 1
  • 5
  • 15
0
votes
1 answer

Dplyr and RJDBC : unable to find an inherited method for function ‘dbGetQuery’ for signature ‘"JDBCConnection", "tbl_df"

I'm using RJDBC package for connecting to an Oracle DB. I need to retrieve a huge amount of data, and I would like to distribute the inquiry through different cores. I know that the table has five millions records. Thus I write a SQL Query…
0
votes
1 answer

How to pass vector of column names into multidplyr's partition function in R

I am facing an issue with multidplyr's partition function. My objective is to find the summary statistics by group of column names. ex: rcols <- c("cyl","am","vs") Now I wanted to find summary statistics by using above rcols object. I can do it in…
789372u
  • 77
  • 1
  • 8
0
votes
2 answers

R multidplyr: how to assign packages to clusters

I am using the package multidplyr where you can split the data over multiple cores within a dplyr pipe. You can assign values and functions to the cores with cluster_assign_value(), but the question is how to assign packages to the clusters? Because…
JdP
  • 714
  • 7
  • 14
1 2 3
4