Questions tagged [regression]

Regression analysis is a collection of statistical techniques for modeling and predicting one or multiple variables based on other data.

Wiki

Regression is a common applied statistical technique and a cornerstone of machine learning. Various algorithms and software packages can be used to fit and use regression models.

In other words, regression is a statistical measure that attempts to determine the strength of the relationship between one dependent variable (usually denoted by Y) and a series of other changing variables (known as independent variables). Typically the dependent variables are modeled with probability distributions whose parameters are assumed to vary (deterministically) with the independent variables.

Tag usage

Questions on regression should be about implementation and programming problems, not about the statistical or theoretical properties of the technique. Consider whether your question might be better suited to Cross Validated, the StackExchange site for statistics and machine learning.

Read more:

9532 questions

votes

9 answers

Best approach to what I think is a machine learning problem

I am wanting some expert guidance here on what the best approach is for me to solve a problem. I have investigated some machine learning, neural networks, and stuff like that. I've investigated weka, some sort of baesian solution.. R.. several…

machine-learning modeling neural-network classification regression

asked Feb 07 '09 at 00:45

Kirby

votes

2 answers

XGBoost Best Iteration

I am running a regression using the XGBoost Algorithm as, clf = XGBRegressor(eval_set = [(X_train, y_train), (X_val, y_val)], early_stopping_rounds = 10, n_estimators = 10, …

python-3.x machine-learning regression xgboost

asked Aug 21 '18 at 19:12

Alessandro Ceccarelli

1,775
5
21
41

votes

2 answers

How to restrict output of a neural net to a specific range?

I'm using Keras for a regression task and want to restrict my output to a range (say between 1 and 10) Is there a way to ensure this?

python neural-network keras regression

asked Apr 19 '18 at 01:13

megan adams

votes

1 answer

Do dynlm and dlm have same mathematical expressions?

I am currently using dynamic linear regression (dynlm) for my analysis. However, I do also find another model called dynamic linear model (dlm). I find that dlm has an official mathematical expression by West and Harrison (1989) and everywhere.…

r dynamic regression mathematical-expressions

asked Nov 01 '17 at 10:53

Eric

votes

1 answer

What standard errors are returned with predict.glm(..., type = "response", se.fit = TRUE)?

I am going to fit the model on the data provided in this excellent example on how to compute the 95% confidence interval for the response, after performing a logistic regression: foo <- mtcars[,c("mpg","vs")]; names(foo) <- c("x","y") mod <- glm(y ~…

r statistics regression glm confidence-interval

asked Jun 14 '17 at 03:23

Alex

15,186
15
73
127

votes

2 answers

R - Plm and lm - Fixed effects

I have a balanced panel data set, df, that essentially consists in three variables, A, B and Y, that vary over time for a bunch of uniquely identified regions. I would like to run a regression that includes both regional (region in the equation…

r regression plm

asked Apr 26 '17 at 14:13

Jasper

votes

1 answer

How does R handle ordinal predictors in lm()?

As I understand it, when you fit a linear model in R using a nominal predictor, R essentially uses dummy 1/0 variables for each level (except the reference level), and then giving a regular old coefficient for each of these variables. What does it…

r statistics regression linear-regression

asked Jan 30 '17 at 19:18

MissMonicaE

votes

5 answers

Solve best fit polynomial and plot drop-down lines

I'm using R 3.3.1 (64-bit) on Windows 10. I have an x-y dataset that I've fit with a 2nd order polynomial. I'd like to solve that best-fit polynomial for x at y=4, and plot drop-down lines from y=4 to the x-axis. This will generate the data in a…

r plot regression solver

asked Jan 17 '17 at 00:49

jeffgoblue

votes

1 answer

Scikit-Learn SVR Prediction Always Gives the Same Value

I'm about to predict IMDB score (film rate) using Support Vector Regression in Scikit-Learn. The problem is it always gives the same prediction result for every input. When i predict using data training, it gives various result. But when using data…

python machine-learning scikit-learn regression svm

asked Dec 10 '16 at 01:38

Kadek Dwi Budi Utama

votes

1 answer

How to compute standard error from ODR results?

I use scipy.odr in order to make a fit with uncertainties on both x and y following this question Correct fitting with scipy curve_fit including errors in x? After the fit I would like to compute the uncertainties on the parameters. Thus I look at…

python numpy scipy regression curve-fitting

asked Dec 07 '16 at 22:54

Ger

9,076
10
37
48

votes

1 answer

Why is bam from mgcv slow for some data?

I am fitting the same Generalized Additive Model on multiple data sets using the bam function from mgcv. While for most of my data sets the fit completes within a reasonable time between 10 and 20 minutes. For a few data sets the run take more than…

r performance regression gam mgcv

asked Nov 24 '16 at 13:10

unique2

2,162
2
18
23

votes

2 answers

Polynomial regression in spark/ or external packages for spark

After investing good amount of searching on net for this topic, I am ending up here if I can get some pointer . please read further After analyzing Spark 2.0 I concluded polynomial regression is not possible with spark (spark alone), so is there…

machine-learning regression apache-spark-mllib

asked Aug 10 '16 at 13:58

sourabh

votes

1 answer

ggplot2: add regression equations and R2 and adjust their positions on plot

Using df and the code below library(dplyr) library(ggplot2) library(devtools) df <- diamonds %>% dplyr::filter(cut%in%c("Fair","Ideal")) %>% dplyr::filter(clarity%in%c("I1" , "SI2" , "SI1" , "VS2" , "VS1", "VVS2")) %>% …

r ggplot2 regression facet

asked May 28 '16 at 03:42

shiny

3,380
9
42
79

votes

2 answers

Naming explanatory variables in regression output

Each one of my variables is a list on its own. I am using a method found on another thread here. import numpy as np import statsmodels.api as sm y = [1,2,3,4,3,4,5,4,5,5,4,5,4,5,4,5,6,5,4,5,4,3,4] x = [ …

python regression statsmodels

asked Apr 12 '16 at 01:08

aspiringcoderzzz

votes

1 answer

Compute a kernel ridge regression in R for model selection

I have a dataframe df df<-structure(list(P = c(794.102395099402, 1299.01021921817, 1219.80731174175, 1403.00786976395, 742.749487463385, 340.246973543409, 90.3220586792255, 195.85557320714, 199.390867672674, 191.4970921278, 334.452413539092,…

r regression model-comparison

asked Oct 29 '15 at 14:21

SimonB

Prev 1 2 3

…

99 100 Next