Questions tagged [statistics]

Consider whether your question would be better asked at https://stats.stackexchange.com. Statistics is the mathematical study of using probability to infer characteristics of a population from a limited number of samples or observations.

Statistics is the scientific study of the collection, analysis, interpretation, presentation, and organization of data. Numerous programming languages provide support for implementing statistical techniques.

Consider whether your question would be better asked at CrossValidated, a Stack Exchange site for probability, statistics, data analysis, data mining, experimental design, and machine learning. StackOverflow questions on statistics should be about implementation and programming problems, not about theoretical discussions of statistics or research design. Therefore, this tag should never be used alone but always in combination with a specific programming language (like for example , , , , ).

16319 questions
4
votes
3 answers

numpy polyfit with data that has varying levels of statistical significance

Polyfit is a great tool to fit a line to a set of points. However my data has varying levels of statistical significance. For example, for one point (x1,y2) I might only have 10 observations, while for another point (x2,y2) I might have 10,000…
Sam Odio
  • 2,717
  • 5
  • 22
  • 25
4
votes
1 answer

Python: PyPy: SciPy/Chi-square distribution?

I have a Python project where performance is fairly important. Honestly, I know Python (at least pure Python) isn't the best tool for the job, but I'm using it anyhow because I want to distribute a very simple pure python version of some research…
dsimcha
  • 67,514
  • 53
  • 213
  • 334
4
votes
2 answers

Predicting a Poisson process

I want to predict the inter-arrival times of road traffic with Poisson distribution. At the moment, I produce the (synthetic) arrival times with Poisson process so that the inter-arrival times have exponential distribution. Observing the past data,…
4
votes
1 answer

How to calculate statistical significance of conversion rate of several goals?

I am making a python/django split testing or a/b testing library for my own use. However I don't know how to calculate if my test is statisically significant. I have no knowledge of statistics, so I cannot understand most wikipedia articles, or web…
Amandasaurus
  • 58,203
  • 71
  • 188
  • 248
4
votes
3 answers

How do I calculate a p-value if I have the t-statistic and d.f. (in Perl)?

I have written a Perl script that performs many one-sample t-tests. I get thousands of t-statistics with their degrees of freedom (df). I need to upgrade the script to also return their p-values (there are too many to look them up manually in a…
Abdel
  • 5,826
  • 12
  • 56
  • 77
4
votes
2 answers

What percent of tablet users browse in portrait vs. landscape orientation?

iPad and various Android tablets have been out for awhile now, but I haven't been able to find the answer to this question: When using tablets, which orientation is used most often, especially for Web browsing? My specific need is to optimize the…
Lax
  • 41
  • 2
4
votes
4 answers

What is the most efficient way to log user action and page viewes?

I've created a website where users will be able to add entries, associate rows in different tables and so on. I need to track what actions users are doing for a score table. I also need to keep track on page views. I'm trying to figure out what is…
Steven
  • 19,224
  • 47
  • 152
  • 257
4
votes
3 answers

Two Factor ANOVA Errorbar plot in R

We're teaching a stats class for biology students and trying to use R as the computing and data visualization platform. As much as possible, we'd like to avoid using extra packages and doing anything terribly "fancy" in R; the focus of the course…
James Waters
  • 323
  • 1
  • 3
  • 9
4
votes
3 answers

Chi-Squared Probability Function in C++

The following code of mine computes the confidence interval using Chi-square's 'quantile' and probability function from Boost. I am trying to implement this function as to avoid dependency to Boost. Is there any resource where can I find such…
neversaint
  • 60,904
  • 137
  • 310
  • 477
4
votes
3 answers

k-means with ellipsoids

I have n points in R^3 that I want to cover with k ellipsoids or cylinders (I don't really care; whichever is easier). I want to approximately minimize the union of the volumes. Let's say n is tens of thousands and k is a handful. Development…
4
votes
2 answers

How is uniformity expressed?

I don't know anything about statistics and it was difficult for me to find A way to describe my question that was clear. I am doing some initial research on a system that will measure the uniformity of electricity across a conductor. Basically we…
scubasteve
  • 2,718
  • 4
  • 38
  • 49
4
votes
4 answers

How to specify FUN used in by( ) or related apply( ) functions

In a by() function, I will use cor (correlation) to be the FUN there. However, I'd like to setup use="complete.obs" too. I don't know how to pass this argument in the FUN = cor part. For example, by(data, INDICES=list(data$Age), FUN=cor)
Leo5188
  • 1,967
  • 2
  • 17
  • 21
4
votes
1 answer

Tests for Variance Components - mixed model

See the following situation: Ok, based on this, I have fitted the following models above in R (however, I am not sure if these models are right): library(nlme) model1 <- lm(Y ~ Treatm * VarT, data = datarats) model2 <- lme(Y ~ Treatm * VarT, data =…
user55546
  • 37
  • 1
  • 15
4
votes
1 answer

Getting "set statistics io on" results in t-sql for tuning

I want to add monitoring capabilities to a complex process involving many stored procedures. In some cases I want to capture the number of logical reads produced by a single statement. In other words, I would like to turn on the set statistics io…
user975383
  • 51
  • 4
4
votes
1 answer

when plotting autocorrelation function plot getting UFuncTypeError

when plotting autocorrelation function plot gettingUFuncTypeError: Cannot cast ufunc 'divide' output from dtype('float64') to dtype('int32') the error happens after the comment # Plot the autocorrelation function for X and Y. do I need to convert…