Questions tagged [frequency-distribution]

A frequency distribution is an arrangement of the values that one or more variables take in a sample

A frequency distribution is an arrangement of the values that one or more variables take in a sample. Each entry in the table contains the frequency or count of the occurrences of values within a particular group or interval, and in this way, the table summarizes the distribution of values in the sample.

220 questions
0
votes
0 answers

Frequencies of age falling into right breaks

I need to calculate frequencies, the following code I'm using is working but I need the breaks to include the ages rounded to 3 digits, so that the frequency of a given age falls into the right break/class freqNT = NT$Age range(freqNT) #[1] 1.251826…
0
votes
1 answer

Breaks with the size classes, R

I need to make histograms with the breaks with my size classes, which are: 1:1.2, 1.2:1.4, 1.4:1.6, 1.6:1.8, 1.8:2, 2:2.2, 2.2:2.4, etc and their respective frequencies are: 4, 20, 52, 142, 205, 307, 445 I'm using the following code: op <-…
0
votes
2 answers

sas relative frequencies by group

I have a categorical variable, say SALARY_GROUP, and a group variable, say COUNTRY. I would like to get the relative frequency of SALARY_GROUP within COUNTRY in SAS. Is it possible to get it by proc SUMMARY or proc means?
Giorgio Spedicato
  • 2,413
  • 3
  • 31
  • 45
0
votes
1 answer

Can't obtain histogram from pandas grouped data

I have this DF. f = { 'Router_name':['count'] } a = a.groupby(['Week_end']).agg(f) Which produces the following data .. Router_name count Week_end 29 3 30 10 31 6 32…
0
votes
1 answer

DAX - Display the top 1 count in a calculated column

I have a [Company] column, I have a [Billing day] column in a powerpivot sheet. A Company can have several different Billing days. I want to show in a Pivot table, next to the company name, the Billing day with the highest number of occurences. The…
0
votes
1 answer

Decompose a signal into n equally sized sub-bands

In my Matlab script I'm loading an audio signal which I then want to decompose into n different sub-bands (and do different stuff with each band). Let's, for example, assume we have a signal containing frequencies up to 1KHz which I want to divide…
Nyfiken Gul
  • 654
  • 4
  • 20
0
votes
1 answer

R: Creating ggplot histograms to mirror freq() function

I'm trying to use ggplot to create a bar plot (or histogram) to mirror the freq function from the descr package (where each discrete value in the variable gets its own column in the frequency plot, with the x-ais ticks centered around each value),…
RSS
  • 163
  • 1
  • 2
  • 11
0
votes
1 answer

Summarizing frequencies across two columns with Pandas

I am looking for a Pandas function, that perrforms the following elementary operation given a DataFrame consisting off two columns. I would like to obtain the conditional distribution of elements in the first column given each particular value in…
splinter
  • 3,727
  • 8
  • 37
  • 82
0
votes
1 answer

SAS countif function without using proc

I need to do a frequency distribution of this one column data without using any proc freq; proc sql. I'm only allowed to use proc sort. In excel I would use a simple countif, but I don't know how to do this in SAS given above contraint. data…
Sam
  • 3
  • 3
0
votes
1 answer

Generate values from a frequency distribution

I'm currently analyzing a 16 bit binary string - something like 0010001010110100. I have approximately 30 of these strings. I have written a simple program in Matlab that counts the numbers of 1's in each bit for all 30 strings. So, for example: 1…
Jack D.
  • 41
  • 5
0
votes
0 answers

Conditional Freq Dist uni-grams from n-grams

If I have such a list of tuples: train = [('pad thai', 'FOOD#QUALITY'), ('Ginger House', 'RESTAURANT#GENERAL'), ('fried dumplings', 'FOOD#QUALITY'), ('Chinese restaurant', 'RESTAURANT#GENERAL'), ('customer service', 'SERVICE#GENERAL'), …
user3058703
  • 571
  • 1
  • 8
  • 22
0
votes
1 answer

R or Python - simulate specific theoretical multimodal distribution

I have different distribution which belong to biological data. These distributions are expected to follow either a single modal distribution (mean = 0.5), either a bimodal distribution (0.33 and 0.66) or a trimodal distribution (0.25, 0.5,…
dilution
  • 13
  • 2
0
votes
0 answers

Python: Counting strings in a list of dictionaries

I have extracted a list of dictionaries from Stanford NER and created a list like the following : myList = [ {'A':{},'B':['C','D'], 'names': {'PERSON': [u'John Butters', u'Bill', u'Hillary Clinton'],'LOCATION': [], 'ORGANIZATION': [u'FactSet',…
mk_sch
  • 1,060
  • 4
  • 16
  • 31
0
votes
0 answers

converting frequency to percentage in hist() changes distribution of data

I am trying to plot the data below as a histogram, where the female (f) standard length data (sl) is overlaid by the male (m) standard length data (sl) and the frequency is shown as a proportion of the total number of individuals in the sample; …
Daniel Svozil
  • 85
  • 1
  • 1
  • 9
0
votes
2 answers

spark dataset API : find out distribution of device usage for each user along with other aggregation

Below is my dataset. user,device,time_spent,video_start userA,mob,5,1 userA,desk,5,2 userA,desk,5,3 userA,mob,5,2 userA,mob,5,2 userB,desk,5,2 userB,mob,5,2 userB,mob,5,2 userB,desk,5,2 I want to find out below aggregation for each user. user …