I'm working with this WNBA dataset here. I'm analyzing the Height
variable, and below is a table showing frequency, cumulative percentage, and cumulative frequency for each height value recorded:
From the table I can easily conclude that the first quartile (the 25th percentile) cannot be larger than 175.
However, when I use Series.describe()
, I'm told that the 25th percentile is 176.5. Why is that so?
wnba.Height.describe()
count 143.000000
mean 184.566434
std 8.685068
min 165.000000
25% 176.500000
50% 185.000000
75% 191.000000
max 206.000000
Name: Height, dtype: float64