Questions tagged [p-value]

In statistical significance testing the p-value is the probability of obtaining a test statistic at least as extreme as the one that was actually observed.

The p-value is a key concept in the approach of Ronald Fisher, where he uses it to measure the weight of the data against a specified hypothesis, and as a guideline to ignore data that does not reach a specified significance level. Fisher's approach does not involve any alternative hypothesis, which is instead the Neyman–Pearson approach. The p-value should not be confused with the Type I error rate (false positive rate) α in the Neyman–Pearson approach – though α is also called a "significance level" and is often 0.05, these terms have different meanings, these are incompatible approaches, and the numbers p and α cannot meaningfully be compared.

791 questions
4
votes
3 answers

SAS Enterprise p-value and percentile

I'm considering teaching my introductory statistics course in SAS Enterprise Guide. I want my students to be able to calculate p-values and percentiles for various distributions (binomial, normal, t, chi-square) with the drop-down menus if at all…
Meg
  • 696
  • 1
  • 7
  • 20
4
votes
1 answer

Non-numeric argument to mathematical function

I want to get pvalues from a data set. I have not had any problems to use pnorm, but I have now. data(iris) iris[,-5]<- scale(as.matrix(iris[,-5])) # K-Means Cluster Analysis fit <- kmeans(iris[,-5], 5) # 5 cluster solution # get cluster means…
user3027252
4
votes
1 answer

Calculate p values based on differences between bootstrapped 95% confidence intervals

I have bootstrapped model fits to some data from different three groups, using the 2.5 and 97.5 percentiles to generate a 95% confidence interval. I know that if 95% confidence intervals do not overlap then there there is a significant difference…
X__
  • 89
  • 2
  • 8
4
votes
2 answers

Error in boot() related to replacement length and data or data types? - R

boot() is failing with one dataset and succeeding with another ... must be a data issue? I just can't figure out the difference. But at least now I think I've got it reproducible. In both cases, an interaction between an integer and factor…
user2561533
  • 69
  • 1
  • 7
4
votes
1 answer

Calculate venn diagram hypergeometric p value using R

Hi,I saw someone calculate venn diagram overlap p-value as in the following example. They use hypergeometric distribution and R. When I apply their function in R, I just cannot get the same results. Can anyone help me about this? The sample I saw in…
user2700418
  • 43
  • 1
  • 3
3
votes
1 answer

How to add p-values (at a y position less than 1) to a ggplot with a logarithmic axis

I am having trouble adding p-values to a ggplot when the axis is logarithmic and the values to be plotted are all well below 1. It seems that no matter where I tell the function to put the p-value, it always puts it at or above 1, which often ruins…
tlbello
  • 43
  • 5
3
votes
1 answer

difference of summary between sklearn and statsmodels OLS

The goal is to detect and fix why the report between my sklearn "summary" implementation is not matching with the results of OLS statsmodels. The only thing is matching, is the beta coefficients. import pandas as pd import numpy as np from…
PeCaDe
  • 277
  • 1
  • 8
  • 33
3
votes
1 answer

Generate correlation matrix with specific columns and only with significant values in corrplot

I have a data.frame database with 14 columns. I split these columns into two groups: [,1:6] and [,7:14]. df<-read.csv("http://renatabrandt.github.io/EBC2015/data/varechem.csv", row.names=1) df I would like to calculate the correlation between…
wesleysc352
  • 579
  • 1
  • 8
  • 21
3
votes
3 answers

Finding p-value and z statistics along with the OLS Linear regression

I could find the coefficients and intercepts from linear regression but unable to find a suitable method to get p-value and z value for respective variable trend. Additionally, not able to find a method to save the output results in excel format.…
Lalantra
  • 67
  • 1
  • 11
3
votes
2 answers

Plotly box p-value significant annotation

I have started to use and love plotly boxplots to represent my data. However, I struggle to find a way to contrast between the two groups. Is there a way to introduce statistical significant comparison between the data when using Plotly? I would…
3
votes
1 answer

Different adjusted p values from the same LMM after adjusting via sjPlots tab_model and emmeans contrast function?

I guess I'm missing some basic knowledge and I may be overlooking something important here... Background: I have a dataset in which animals from 4 different groups (1 control and 3 treatment groups) underwent grip strength testing. Each trial…
jan
  • 31
  • 3
3
votes
1 answer

How to perform two-sample unequal sized t-test for two np.array in python?

I have two numpy arrays with different sizes that I would like to run a t-test on directly in python to see the p-value.
Stack_Protégé
  • 302
  • 1
  • 15
3
votes
2 answers

Error in .subset2(x, i, exact = exact) : subscript out of bounds in R

I am trying to loop over specific numeric columns from dataframe, the goal is to extract correlations and p-values using "cor.test" function. The correlation consists in calculate the linear relationship of one categorical variable composed of 0 and…
AlSub
  • 1,384
  • 1
  • 14
  • 33
3
votes
3 answers

How to plot a Miami plot (GWAS) in R?

I'm looking for a function to run a Miami Plot (GWAS) which looks like this: I will have data for 2 different GWASes. To simulate the plot, you can use this dataset…
Gianmarco M
  • 71
  • 1
  • 6
3
votes
1 answer

How to eliminate variables with p value > 0.7 before computing stepwise polynomial regression?

I am trying to run a stepwise regression using AIC (through step) with 1,400 variables, but my computer just freezes. It works if I include <300 variables (after 13 hrs of running). Is there a way to eliminate some of the variables (if p-value >.7)…