Questions tagged [binning]

binning is the process of grouping data into "bins" used in statistics and data analysis

Binning is the process of grouping data into "bins" used in statistics and data analysis. For details see also Data binning - Wikipedia, the free encyclopedia

684 questions
0
votes
0 answers

Fitting Probability distribution function in eac histogram for each bin data using matlab.....?

I have x data and I did binning and created histogram of each bin.Now I want to fit the probability distribution function in each bin so I can see histogram and probability distribution function at the same graph. Here 'X' is in horizontal axis data…
0
votes
0 answers

Spark: grouping of data stream based on cycle time

I need your inputs regarding grouping of data stream within spark streaming on the basis of cycle time. We are receiving input data in this formats {Object_id:"vm123", time:"1469077478" , metric :"cpu.usage" , value :"50.8"}. Data frames are…
0
votes
1 answer

How to use binning method for identifying the incoming point belongs to which bin?

I have small query. I have two data sets. In one data sets for example I did binning and calculated the mean value and std value along with group binning. Now in I have second data sets of same parameters say X. I would like identify this X data…
ravi pandit
  • 117
  • 12
0
votes
0 answers

Android Camera Error: aec_calculate_sensitivity binning is less than 1

I am developing an Android app on Unity that uses the Camera (front and back) and once the app starts this error spams about 20 times a second in logcat. It stops when I close my app again. I have tried to find any information about…
0
votes
1 answer

irregular binning with regards to the sum of a column

I would like to bin a dataframe in pandas based on the sum of another column. I have the following dataframe: time variable frequency 2 7 7 3 12 2 4 13 3 6 15 4 6 …
WGP
  • 708
  • 2
  • 7
  • 18
0
votes
1 answer

Inconsistent behaviour of `cut`: Different intervals with same number and same displayed cut points

I encountered the following inconsistent behaviour of cut which gives me a headache: x <- 0.2316 cut(x, c(0, 0.2315, 10)) #gives 0.232 as cutpoint and choses second interval ## [1] (0.232,10] ## Levels: (0,0.232] (0.232,10] cut(x, c(0, 0.232, 10))…
vonjd
  • 4,202
  • 3
  • 44
  • 68
0
votes
1 answer

Irregular binning of python pandas dataframe

I am getting to grips with python pandas. The toy problem below, illustrates an issue I am having in a related exercise. I have sorted a data-frame so that it presents a column's values (in this case students' test scores) in ascending…
Sam Gregson
  • 159
  • 1
  • 14
0
votes
1 answer

Python convert Continuous data into categorial

I have a continuous floating point data, ranging from -257.2 to 154.98, I have no idea how it is distributed. But I would want it to be in the bins - say -270 to -201, -200 to -141, -140 to -71, -70 to -1, 0 to 69, 70 to 139, 140 to 209 Is there a…
Adorn
  • 1,403
  • 1
  • 23
  • 46
0
votes
0 answers

Binning a data set using Pandas

Given a csv file of... neg,,,,,,, SAMPLE 1,,SAMPLE 2,,SAMPLE 3,,SAMPLE…
KeironO
  • 179
  • 1
  • 9
0
votes
2 answers

Smooth values using bin Boundaries: Where do you set a value who sits right between the lower and upper boundary?

In response to @j.jerrod.taylor's answer, let me rephrase my question to clear any misunderstanding. I'm new to Data Mining and am learning about how to handle noisy data by smoothing my data using the Equal-width/Distance Binning method via "Bin…
user2771721
  • 502
  • 1
  • 6
  • 13
0
votes
1 answer

using Spark: binning column1 and find mean of column2 based on column1's bins

I am learning apache spark and scala language. So some help please. I get 3 columns (c1, c2 and c3) from querying cassandra and get it in a dataframe in the scala code.. I have to bin(bin size = 3) (statistics, like in histogram ) c1 and find mean…
B1K
  • 198
  • 1
  • 2
  • 9
0
votes
1 answer

Creating Hexbins with Dates in R hexbin()

I am trying to create hexbins where the x-axis is a date using the hexbin function in the hexbin package in R. When I feed in my data, it seems to convert the dates into a numeric, which gets displayed on the x-axis. I want it force the x-axis to…
Stefan P
  • 3
  • 1
0
votes
1 answer

Calculating Mutual Information between features in Matlab

I need to calculate the mutual information between various features for designing a classification model using logistic regression. I am facing following problems: I need to divide my data into n bins having approximately equal number of samples.…
Neel Shah
  • 349
  • 1
  • 4
  • 12
0
votes
0 answers

Binning variables in a dataframe with input bin data from another dataframe

Being a beginner-level user of R, despite having read (1) numerous posts about binning&grouping here at SO, and (2) documentation on data.table and dplyr packages, I still can't figure out how to apply the power of those packages for binning…
Aktan
  • 11
  • 2
  • 5
0
votes
1 answer

How can a bin width be made consistent between multiple matplotlib histograms?

I have a small function that is intended to take two lists of numbers and to compare them using their respective histograms overlaid and using a ratio plot. The bin width of the ratio plot is inconsistent with the bin width of the overlaid…
d3pd
  • 7,935
  • 24
  • 76
  • 127