Questions tagged [binning]

binning is the process of grouping data into "bins" used in statistics and data analysis

Binning is the process of grouping data into "bins" used in statistics and data analysis. For details see also Data binning - Wikipedia, the free encyclopedia

684 questions
0
votes
2 answers

Combining multiple columns of values together in R

So I scripted a code that allows me to bin values in a single column. However, the problem I have is that my data is contained in multiple columns, all of varying length and with NAs in it. The script I have for binning goes like this:- bin <-…
user2062207
  • 955
  • 4
  • 18
  • 34
0
votes
1 answer

How collect additional row data on binned data in R

I want sort the values of one data.frame column into predetermined bins, and then sum values that are in the same rows, but a different column.What I'm trying to do is sort dataframe column items into bins based on one value and then get a sum of a…
trenlo
  • 3
  • 1
0
votes
1 answer

Binning values in R with multiple files

So I've got a slight problem with binning values contained in multiple text files into set ranges. I've had a look online for various packages and came across sm which can bin values and you can specify the bin range as well like so:- xb <-…
user2062207
  • 955
  • 4
  • 18
  • 34
0
votes
1 answer

Collapse/mean data in Matlab with respect to a different set of data

I have two sets of data, but the sets have a different sizes. Each set contains the measurements itself (MeasA and MeasB, both double) and the time point (TimeA and TimeB, datenum or julian date) when the measuring happened. Now I want to match the…
Matty
  • 63
  • 1
  • 9
0
votes
1 answer

bin data depending on values of a separate column

I have a dataset which looks somehow like this toy example: s1 = pd.Series(np.random.rand(5)) s2 = pd.Series(np.random.rand(5) * 10) cat1 = pd.Series(['s1'] * 5) cat2 = pd.Series(['s2'] * 5) s = s1.append(s2).reset_index(drop=True) c =…
NoIdeaHowToFixThis
  • 4,484
  • 2
  • 34
  • 69
0
votes
2 answers

Binning two vectors of different ranges using R

I'm trying to assess the performance of a simple prediction model using R, by discretizing the prediction results by binning them into defined intervals and then compare them with the corresponding actual values(binned). I have two vectors actual…
Sailesh
  • 115
  • 2
  • 10
0
votes
1 answer

Binning in Pandas

Given the following dataframe in Pandas: "Age","Gender","Impressions","Clicks","Signed_In" 36,0,3,0,1 73,1,3,0,1 30,0,3,0,1 49,1,3,0,1 47,1,11,0,1 I need to make a separate categorical variable (column) which holds the bin label for each row based…
Mudit Rustagi
  • 13
  • 1
  • 3
0
votes
0 answers

How to Calculate area bining in MATLAB

I have a matrix of size (Mx3).Column 1 & 2 are latitude and longitude, respectively. Column three is temperature value. I want to do area binning of the data so that in every 100 meter square bin show 1 point instead of a few points. Any one has an…
0
votes
1 answer

binning information based on criteria using C++

I have my information that looks like this No. ID DATE_EVENT TIME_EVENT EVENT CODE 102995 018159871 07/08/2014 09:01:57 9008 1111 20398 018159871 07/08/2014 09:01:58 1000 1402 105541 018159871 07/08/2014…
Taylrl
  • 3,601
  • 6
  • 33
  • 44
0
votes
1 answer

Binning ages in R

I'm trying to write a function that bins ages into different groups. Suppose my data is the following: birthyear 1987 1995 1994 1981 1994 1989 1985 1987 1996 1981 1980 1994 1996 1983 1949 1988 1998 1977 1967 1968 And my function is written to…
Archimeow
  • 240
  • 6
  • 14
0
votes
1 answer

Creating Histogram with Binning Averages

I am making two histograms by using moving averages and binning. I have gotten a moving average of 18k data points , most that are 0 values ,by using excel. This is what I was looking to get done through R "Moving Average" I would like to use R in…
Dgales4130
  • 23
  • 7
0
votes
1 answer

2D data binning with overlapping in python

I have data in XYZ type. For example: x = numpy.arange(100) y = numpy.arange(100) Z = numpy.random.random_sample((100,)) I would like to bin data, for example, with overlap lengths of dx = 2 and dy = 2. What I did is: nx = len(x) ny =…
hoang tran
  • 3,918
  • 3
  • 19
  • 21
0
votes
1 answer

Framework of choice for building smart histogram in cocoa touch

I'm familiar with the basics of objective-c and cocoa touch and I want to start now my first large iOS project. I want to build an iPad app with a histogram like you know it from several photo apps where you can see e.g. the distribution of…
Mahatma_Fatal_Error
  • 720
  • 1
  • 10
  • 26
0
votes
1 answer

Manually specifying bins with stat_summary2d

I have a large set of data that consists of coordinates (x,y) and a numeric z value that is similar to density. I'm interested in binning the data, performing summary statistics (median, length, etc.) and plotting the binned values as points with…
0
votes
2 answers

R bin data by unique ID

From a data.frame: RowColFovCellID 1Feret 001001000      1.1       001002000      0.3       001002000      0.2       001003000      1.5       001001000      3.4       001002000      2.4       003003001      0.7       001001000      3.6       I would…
William Grimes
  • 675
  • 2
  • 11
  • 29