Questions tagged [summarize]

A dplyr instruction ( actually named summarise( ) ) to create a new data frame by grouping data according to given grouping variables. Use this tag along with the dplyr version being used. Mind the spelling in the method name.

summarise() creates a new data frame. It will have one (or more) rows for each combination of grouping variables; if there are no grouping variables, the output will have a single row (or more, as of dplyr 1.0.0) summarising all observations in the input. It will contain one column for each grouping variable and one column for each of the summary statistics that you have specified.

836 questions

votes

1 answer

Get Proportion Graph From Summarise and Facet Wrap

I have three categorical variables and one numeric variable; I want to show proportions by segmenting the data based on my categorical variables and getting the proportions of the numeric variable. The data is as follows: ID Brand Color Gear…

r ggplot2 facet summarize

asked Aug 21 '19 at 15:24

Johal Alberto Baez

votes

2 answers

How can I optimize the dplyr code by group if all calculations are the same

I have the following data frame, which is a subset of a much larger one containing over 3 million rows. df <- data.frame(Group = c(1,1,1,2,2,3,3,3,2,2,4,4,1,4,1,3,1,3,2,4,2,1,3,2,4), SubGroup =…

r optimization dplyr summarize

asked Aug 01 '19 at 04:54

Dfeld

votes

1 answer

Recreate dplyr summarise in data.table

Just out of curiosity, is there a way of recreating the summary output using data.table instead of dplyr? dt1 <- data.table( uid=c("A00111", "A00112","A00113","A00211","A00212","A00213","A00214","A00311","A00312"), area=c("A001",…

r dplyr data.table summarize

asked Jul 09 '19 at 08:24

Chris

1,197
9
28

votes

1 answer

Creating new variable with summary values based on group

I really have two questions. I am quite certain that the second one would help me solve the first one, but I might be on the wrong track altogether and there might be simpler solutions. First question: I would like to make a stacked bar chart using…

r ggplot2 dplyr summarize

asked Jun 14 '19 at 21:53

Tea Tree

votes

0 answers

dplyr summarize using a reference column for values

I'm trying to perform a simple summarize operation using a trimmed mean by referencing a trim-value column. I keep getting length errors and I cannot understand why this is not working. Maybe I'm missing something obvious? I don't need any special…

r dplyr summarize

asked Jun 12 '19 at 23:02

qab

votes

3 answers

How can I collapse multiple columns and to generate new variables from the different levels/values that were collapsed?

I have a dataset (df) similar to this one: df <- data.frame("ID"=c(1, 1, 1, 2, 2), "Method of payment"=c("cash","liabilities", "shares", "cash", NA), "USD"=c(110, 130, 200,…

r group-by aggregate summarize

asked May 26 '19 at 19:41

Esperanta

votes

1 answer

Remove duplicates based on second column

I am trying to write a section of code that does a few things: 1) group dataset by ID 2) count the number of unique months in column data.month 3) remove all IDs that have less than 9 months 4) print distinct IDs based on the company (ie print…

r filter duplicates summarize

asked May 23 '19 at 07:12

Cae.rich

votes

1 answer

Is there a way to auto summarize bulk data?

I am want to be able to take "bulk" material lists and have them automatically summarized to "sum" like-with-like items. For example, would there be an efficient way to accomplish the following? Is there an efficient way to get from: to. . . ? Your…

excel excel-formula sum summary summarize

asked May 06 '19 at 19:45

Bryan M

votes

1 answer

Grouping by multiple factors and summarizing counts of factors

I have a bunch of categorical ship "Type" data, e.g. passenger, fishing, cargo etc. within different distances offshore (DOS, e.g 0-12 nm, 0-25 nm etc.) for different months of the year. Initially I want to get a count of the number of Type, e.g.…

r group-by dplyr factors summarize

asked May 02 '19 at 18:58

Lmm

votes

1 answer

compute the breteau index with R

Here is a data frame I have and I want to compute the breteau index ## Here is the table commune container house aegypti albopictus yde4 c1 h1 6 6 yde2 c2 h2 2 3 yde7 c3 h3 …

r indexing summarize

asked Apr 30 '19 at 04:47

Armel Tedjou

votes

2 answers

R question: shapiro.test function not working in dplyr::summarize while other summary functions do

When I try to use shapiro.test as a summary function on my R DataFrame I get the error: df %>% summarize_all(shapiro.test) Error: Column `A` must be length 1 (a summary value), not 4 Here is my setup: df = data.frame(A=sample(1:10,5),…

r dataframe dplyr summarize

asked Apr 25 '19 at 17:26

abalter

9,663
17
90
145

votes

2 answers

Calculations with dplyr based on specific factors and dates and summaries of values

I have a data frame of counts of different classifications of ship on specific dates at certain distances off shore (DOS), e.g. 0-12nm and 0-100nm - I would like to subtract the ships within the 0-12nm DOS from 0-100nm, so that I can calculate how…

r dplyr apply summarize

asked Apr 24 '19 at 19:43

Lmm

votes

0 answers

How to work with dependencies when grouping/summarising over multiple columns?

I'm trying to summarise multiple columns in a data frame using dplyr's group_by/summarise. If there is a dependency on an earlier column in one of the later columns, summarise uses the already summarised values. Is there a way to avoid this…

r dplyr dependencies multiple-columns summarize

asked Apr 24 '19 at 08:09

Tom

votes

3 answers

How to summarize leave to without public holidays?

mysql datetime summarize holidays-gem

asked Apr 19 '19 at 06:18

Prochu1991

votes

2 answers

Gensim summarization returning repeated lines as summary of text documents

I am getting repeated lines in my summarizer output. I am using genism in python for summarizing text documents. How to remove duplicate lines from the output of the summarizer. The output is coming with repeated content. How can I only keep unique…

python nlp gensim summarization summarize

asked Mar 27 '19 at 18:24

checkmate

Prev 1 2 3

…

55 56 Next