Questions tagged [hypothesis-test]

Functions used to choose between competing hypotheses about one or more probability distributions. For statistical questions, please use stats.stackexchange.com.

Common hypothesis tests include the one-sample and paired t-test for means, the z-test, which approximates the t-test for large samples, F-test for differences in variance, and Chi-square test for independence, and Fisher's exact test for differences in proportion.

Please note that this tag is totally different from , which refers to software testing.

349 questions
0
votes
0 answers

Can you use hypothesis testing for feature selection?

How do you apply hypothesis testing to your features in a ML model? Let say for example that I am doing a regression task and I want to cut some features (once I have trained my model) to increase performance. How do I apply hypothesis testing to…
0
votes
0 answers

Hypothesis test for pixel matrix equivalence

I'm going through this tutorial and I'm reworking it in R and I was able to do so. I tried to extend this tutorial by thinking of ways to objectively detect differences in images; e.g., can I run a hypothesis test that takes the pixel matrices of…
user1329307
  • 117
  • 2
  • 16
0
votes
1 answer

linearHypothesis equivalent for ols command (rms package) in R

I am trying to use "linearHypothesis" function from "car" package to test coefficients of a model estimated with "ols" from "rms" package. The function works with "lrm" objects but not with "ols" objects. Have you got any alternatives? I know that…
Giovanni
  • 1
  • 2
0
votes
1 answer

Tableau error "All Fields must be aggregate or constant" when invoking TabPy SCRIPT_REAL

I am calling a TabPy server via a calculated field in a Tableau worksheet to run a hypothesis test: does the rate of Bookings vary significantly by Group? I have a table such as: Group Bookings 0 A 1 1 A 0 3998 …
C8H10N4O2
  • 18,312
  • 8
  • 98
  • 134
0
votes
1 answer

Chi-Squared test for independence with one data column as integer and other as object?

I'm trying to perform hypothesis test for Independence in python but my one data column(finances) has float data type and other column(gender) has object data type. I have created the following hypothesis: Ho : finances are gender independent Ha :…
0
votes
0 answers

I need to create hypothesis test on series of question

I need to perform hypothesis test on the following questions: Is saving money (finances) gender dependant? Is there any differences in money saving (finances) between people from city or village? Do Women have a Healthier Lifestyle than Men? Are…
Yogesh
  • 67
  • 1
  • 11
0
votes
1 answer

How to write a loop to simulate sampling distribution of t-statistic under null using a true model?

What I currently have a problem with this problem is understanding how to fimulate 10,000 draws and fix the covariates. Y X1 X2 X3 1 4264 305.657 7.17 0 2 4496 328.476 6.20 0 3 4317 317.164 4.61 0 4 …
0
votes
0 answers

How to verify that model output & observed data distribution are similar?

Looking for advice on how to determine wether my model output data distribution is similar (and if so, then how similar) to the observed datasets distribution. Basically I have a GBM model with mean reversion that provides seemingly good results,…
MRK_S
  • 9
  • 1
  • 3
0
votes
0 answers

How to calculate p-value comparing bootstrap-based predicted probabilities and observed probabilities

Given the sample data dat below, I would appreciate any help to: (1) check whether my approach below following boot's vignette to calculate bootstrap-based predictions from logistic regression is correct and help correct if there is any mistake in…
Krantz
  • 1,424
  • 1
  • 12
  • 31
0
votes
0 answers

What do you do if the sample size for an A/B test is larger than the population?

I have a list of 7337 customers (selected because they only had one booking from March-August 2018). We are going to contact them and are trying to test the impact of these activities on their sales. The idea is that contacting them will cause them…
0
votes
3 answers

Hypothesis test - Multiple Regression

If β1= β2=0 and β3= 0 is null hypothesis. If β1= β2=0 holds and β3= 0 fails to hold, then what can you conclude? Does it mean there is a strong correlation between x1 and x2? or There is a linear relationship between the outcome variable(Y) and x3?
Bhuneshwer
  • 577
  • 1
  • 9
  • 26
0
votes
1 answer

Beginner Question: Effect of transforming the targets in regression model

I am new to datascience, and I was working with Keras LSTM but with no success. And calculating the r2 score gets 0.0 every time. So after some googling I found the below scikit-learn example [1] and I have some as a novice in datascience I am…
Ramzy
  • 171
  • 2
  • 14
0
votes
0 answers

optimal sample size for control/test group for T-test

Recently we launched a feature on one of our website pages.I have all the historical data worth 6 months about the page including impressions and CTR .It has been 20 days since we launched the feature and now we want to know if there is any…
M_Nayak
  • 1
  • 3
0
votes
0 answers

why does z-test indicate significantly different for 2 distributions that looks similar (hypothesis testing)

I have two distributions below (Kaggle dataset: Rossman sales) that look similar visually: Sales on normal days & sales on school holiday. However, they seems to fail z-test (hypothesis testing) in Python - why is that so? How should I perform the…
jasonlcy91
  • 454
  • 6
  • 14
0
votes
1 answer

ptest on groupby() python

I've a dataframe as below: Region Votes A 23 B 26 A 32 B 46 A 32 B 24 I calculated mean of votes for region A and B by following code…
Vro
  • 69
  • 1
  • 9