Questions tagged [hypothesis-test]

Functions used to choose between competing hypotheses about one or more probability distributions. For statistical questions, please use stats.stackexchange.com.

Common hypothesis tests include the one-sample and paired t-test for means, the z-test, which approximates the t-test for large samples, F-test for differences in variance, and Chi-square test for independence, and Fisher's exact test for differences in proportion.

Please note that this tag is totally different from testing, which refers to software testing.

349 questions

votes

0 answers

Can you use hypothesis testing for feature selection?

How do you apply hypothesis testing to your features in a ML model? Let say for example that I am doing a regression task and I want to cut some features (once I have trained my model) to increase performance. How do I apply hypothesis testing to…

asked Jul 04 '19 at 17:54

Felipe Araya

votes

0 answers

Hypothesis test for pixel matrix equivalence

I'm going through this tutorial and I'm reworking it in R and I was able to do so. I tried to extend this tutorial by thinking of ways to objectively detect differences in images; e.g., can I run a hypothesis test that takes the pixel matrices of…

r matrix spatial vision hypothesis-test

asked Jul 01 '19 at 20:37

user1329307

votes

1 answer

linearHypothesis equivalent for ols command (rms package) in R

I am trying to use "linearHypothesis" function from "car" package to test coefficients of a model estimated with "ols" from "rms" package. The function works with "lrm" objects but not with "ols" objects. Have you got any alternatives? I know that…

r linear-regression hypothesis-test

asked Jun 13 '19 at 11:28

Giovanni

votes

1 answer

Tableau error "All Fields must be aggregate or constant" when invoking TabPy SCRIPT_REAL

I am calling a TabPy server via a calculated field in a Tableau worksheet to run a hypothesis test: does the rate of Bookings vary significantly by Group? I have a table such as: Group Bookings 0 A 1 1 A 0 3998 …

python python-2.7 tableau-api hypothesis-test tabpy

asked May 31 '19 at 19:20

C8H10N4O2

18,312
8
98
134

votes

1 answer

Chi-Squared test for independence with one data column as integer and other as object?

I'm trying to perform hypothesis test for Independence in python but my one data column(finances) has float data type and other column(gender) has object data type. I have created the following hypothesis: Ho : finances are gender independent Ha :…

python pandas chi-squared hypothesis-test statistical-test

asked May 17 '19 at 03:25

Bhavishya

votes

0 answers

I need to create hypothesis test on series of question

I need to perform hypothesis test on the following questions: Is saving money (finances) gender dependant? Is there any differences in money saving (finances) between people from city or village? Do Women have a Healthier Lifestyle than Men? Are…

python pandas statistics hypothesis-test

asked May 16 '19 at 21:26

Yogesh

votes

1 answer

How to write a loop to simulate sampling distribution of t-statistic under null using a true model?

What I currently have a problem with this problem is understanding how to fimulate 10,000 draws and fix the covariates. Y X1 X2 X3 1 4264 305.657 7.17 0 2 4496 328.476 6.20 0 3 4317 317.164 4.61 0 4 …

r loops linear-regression t-test hypothesis-test

asked May 06 '19 at 18:30

cookiemonster3009

votes

0 answers

How to verify that model output & observed data distribution are similar?

Looking for advice on how to determine wether my model output data distribution is similar (and if so, then how similar) to the observed datasets distribution. Basically I have a GBM model with mean reversion that provides seemingly good results,…

matlab model verification hypothesis-test

asked Mar 28 '19 at 22:44

MRK_S

votes

0 answers

How to calculate p-value comparing bootstrap-based predicted probabilities and observed probabilities

Given the sample data dat below, I would appreciate any help to: (1) check whether my approach below following boot's vignette to calculate bootstrap-based predictions from logistic regression is correct and help correct if there is any mistake in…

r permutation p-value statistics-bootstrap hypothesis-test

asked Mar 17 '19 at 14:28

Krantz

1,424
1
12
31

votes

0 answers

What do you do if the sample size for an A/B test is larger than the population?

I have a list of 7337 customers (selected because they only had one booking from March-August 2018). We are going to contact them and are trying to test the impact of these activities on their sales. The idea is that contacting them will cause them…

hypothesis-test statistical-test statistical-sampling

asked Feb 02 '19 at 21:38

datababie

votes

3 answers

Hypothesis test - Multiple Regression

If β1= β2=0 and β3= 0 is null hypothesis. If β1= β2=0 holds and β3= 0 fails to hold, then what can you conclude? Does it mean there is a strong correlation between x1 and x2? or There is a linear relationship between the outcome variable(Y) and x3?

regression hypothesis-test

asked Feb 02 '19 at 06:32

Bhuneshwer

votes

1 answer

Beginner Question: Effect of transforming the targets in regression model

I am new to datascience, and I was working with Keras LSTM but with no success. And calculating the r2 score gets 0.0 every time. So after some googling I found the below scikit-learn example [1] and I have some as a novice in datascience I am…

scikit-learn statistics lstm data-science hypothesis-test

asked Dec 23 '18 at 07:55

Ramzy

votes

0 answers

optimal sample size for control/test group for T-test

Recently we launched a feature on one of our website pages.I have all the historical data worth 6 months about the page including impressions and CTR .It has been 20 days since we launched the feature and now we want to know if there is any…

statistics sampling digital t-test hypothesis-test

asked Nov 28 '18 at 22:09

M_Nayak

votes

0 answers

why does z-test indicate significantly different for 2 distributions that looks similar (hypothesis testing)

I have two distributions below (Kaggle dataset: Rossman sales) that look similar visually: Sales on normal days & sales on school holiday. However, they seems to fail z-test (hypothesis testing) in Python - why is that so? How should I perform the…

python statistics data-science hypothesis-test

asked Nov 28 '18 at 10:29

jasonlcy91

votes

1 answer

ptest on groupby() python

I've a dataframe as below: Region Votes A 23 B 26 A 32 B 46 A 32 B 24 I calculated mean of votes for region A and B by following code…

python group-by hypothesis-test

asked Nov 16 '18 at 22:55

Vro

Prev 1 2 3

…

23 24 Next