Questions tagged [kurtosis]

Kurtosis is a statistical measure, which characterizes the extreme data (outlier) character of a distribution compared with the normal distribution. Positive (excess) kurtosis indicates a distribution that is more outlier-prone than a normal distribution. Negative (excess) kurtosis indicates a distribution that is less outlier-prone than a normal distribution.

Overview

From Wikipedia:

In probability theory and statistics, kurtosis (from Greek: κυρτός, kyrtos or kurtos, meaning "curved, arching") is a measure of the "tailedness" of the probability distribution of a real-valued random variable. In a similar way to the concept of skewness, kurtosis is a descriptor of the shape of a probability distribution and, just as for skewness, there are different ways of quantifying it for a theoretical distribution and corresponding ways of estimating it from a sample from a population. Depending on the particular measure of kurtosis that is used, there are various interpretations of kurtosis, and of how particular measures should be interpreted.

The standard measure of kurtosis, originating with Karl Pearson, is based on a scaled version of the fourth moment of the data or population. This number is related to the tails of the distribution, not its peak;[1] hence, the sometimes-seen characterization as "peakedness" is mistaken. For this measure, higher kurtosis is the result of infrequent extreme deviations (or outliers), as opposed to frequent modestly sized deviations.


Tag usage

Questions on tag should be about implementation and programming problems, not about the statistical or theoretical properties of the concept. Consider whether your question might be better suited to Cross Validated, the StackExchange site for statistics, machine learning and data analysis.

48 questions
2
votes
1 answer

Comparing Matlab and Apache statistics - kurtosis

Hi I am currently comparing statistics between Matlab and Apache functions. Here Apache functions are tested in Java. For the very same set of data, I get different results from a double array (double[] ) as…
kaligne
  • 3,098
  • 9
  • 34
  • 60
1
vote
0 answers

How to get skewness and kurtosis using Julia

I'm working on pointcloud lidar data. I want to calculate skewness and kurtosis to distinguish road from ground. My current intensity column looks like this in plot Is there an implementation on another language? I read First, the skewness of the…
h612
  • 544
  • 2
  • 11
1
vote
1 answer

Why do we have different values for skewness and kurtosis in MATLAB and Python?

Following are the codes for skewness and kurtosis in MATLAB: clc; clear all % Generate "N" data points N = 1:1:2000; % Set sampling frequency Fs = 1000; % Set time step value dt = 1/Fs; % Frequency of the signal f…
LearnerABC
  • 51
  • 1
  • 1
  • 7
1
vote
1 answer

Does the kurtosis calculation in the e1071 package in R need more than 3 values?

When calculating kurtosis using the e1071 package like the following: x_test <- c(100,98, 4) kurtosis(x_test) I always get -2.33 no matter what 3 numbers I pick. With 2 numbers, it's always -2.75. However when using other methods (such as pandas in…
1
vote
0 answers

Measuring "single strongest peak" in a distribution

I'd like to automatically detect whether data have a very strongly discernable peak, with any particular distribution. The data can otherwise be quite noisy, or there might be several 'false' peaks. Here are a few examples of the performance I'd…
1
vote
1 answer

Is kurtosis in excess when using the function from pyspark module?

When using the kurtosis function from the pyspark module pyspark.sql.functions.kurtosis(col), is the result in excess of the Normal distribution? ie. is 3 already subtracted from the kurtosis to yield k-3? Or would we have to calculate the excess…
thentangler
  • 1,048
  • 2
  • 12
  • 38
1
vote
2 answers

R - adding values for one column based on a function using another column

I have a dataset that looks like this head(dataset) Distance Lag time Kurtosis 7.406100 10 144.1700 1 77.31800 1 81.15400 1 4.249167 6 I want to add values to the kurtosis column. To calculate kurtosis I need to group the Distances…
N. J.
  • 105
  • 6
1
vote
0 answers

How can I generate a distribution with given Skeweness, kurtosis, mean and standard deviation in Matlab?

I need to create some distributions with given Skeweness, Kurtosis, Mean and Standard Deviation. I have generated a normal distribution by doing this: targetMean = [0.5, 0.75, 0.95, 1.2] for n = 1:size(targetMean,2) partnerLag =…
Glu
  • 327
  • 3
  • 14
1
vote
1 answer

Different moments given by R using the same library

I'm using R along with library moments to generate a small dataset and compute the four initial moments of my data: Mean Variation Skewness Kurstosis The code is shown below. I set a random seed for my PRNG and generates 1000 data points using a…
Iago Carvalho
  • 410
  • 1
  • 5
  • 15
1
vote
1 answer

How can I calculate the kurtosis of already binned data?

Does anyone know how to calculate the kurtosis of a distribution from binned data alone using Python? I have a histogram of a distribution, but not the raw data. There are two columns; one with the bin number and one with the count number. I need…
KWx
  • 310
  • 1
  • 10
1
vote
1 answer

How do I simulate returns from an empirically derived distribution in MATLAB (Or Python)?

I also have to keep in mind the skewness and the kurtosis of the distribution and these have to be reflected in the simulated values. My empirical values are past stock returns (non-standard normal distribution). Is there an existing package that…
Akshay Sakariya
  • 107
  • 1
  • 9
1
vote
1 answer

What is the difference between this code and this formula?

I am writing an algorithm and calculating the kurtosis of the distribution of daily returns. I am trying to get my calculation of kurtosis to match that of Excel's. Excel's calculation supposedly uses the formula at the top of this webpage:…
skibeats
  • 23
  • 3
1
vote
1 answer

Rapidminer for finding statistics

I am a beginner in Rapidminer . While working on Rapidminer i didn't find how to calculate or get skewness and kurtosis of a attributes in an exampleset. I would like to know is there a way to get best fit line for a scatter plot graph
1
vote
0 answers

Higher order moments and shape parameters

I have used 5th moment of my data as a feature for classification and it gives good results, but i don't know what it measures? is it a shape parameter like kurtosis and skewness? I'm using matlab's m=moment(X,order); which returns the central…
Rashid
  • 4,326
  • 2
  • 29
  • 54
1
vote
1 answer

How to calculate Skewness and Kurtosis in a similar manner as a weighted average in Excel?

I would like to compute some descriptive statistics about the following table of information. The main thing that I like to describe is the Price component. Price Qty 493 5 4500 8 2107 14 269 1 The weighted average is…
Jochem
  • 3,295
  • 4
  • 30
  • 55