Questions tagged [linear-regression]

for issues related to linear regression modelling approach

Linear Regression is a formalization of relationships between variables in the form of mathematical equations. It describes how one or more random variables are related to one or more other variables. Here the variables are not deterministically but stochastically related.

Example

Height and age are probabilistically distributed over humans. They are stochastically related; when you know that a person is of age 30, this influences the chance of this person being 4 feet tall. When you know that a person is of age 13, this influences the chance of this person being 6 feet tall.

Model 1

heighti = b0 + b1agei + εi, where b0 is the intercept, b1 is a parameter that age is multiplied by to get a prediction of height, ε is the error term, and i is the subject

Model 2

heighti = b0 + b1agei + b2sexi + εi, where the variable sex is dichotomous

In linear regression, user data X is modelled using linear functions Y, and unknown model parameters W are estimated or learned from the data. E.g., a linear regression model for a k-dimensional user data can be represented as :

Y = w1 x1 + w2 x2 + ... + wk xk

Reading Statistical Modeling: The Two Cultures http://projecteuclid.org/download/pdf_1/euclid.ss/1009213726

In scientific software r for statistical computing and graphics, function lm (see lm) implements linear regression.

6517 questions

votes

2 answers

How can I ignore the NA data when I do the lm function?

My question is rather simple, but I could not get it resolved after trying a lot of things. I have two data frames. >a col1 col2 col3 col4 1 1 2 1 4 2 2 NA 2 3 3 3 2 3 2 4 4 3 4 1 > b col1…

r linear-regression missing-data

asked Nov 23 '10 at 17:54

didimichael

votes

1 answer

Linear regression with only previous values in moving window

I have a huge dataset and would like to perform a rolling linear regression over a window of 60. However, I want that only the 60 previous values are considered for the linear regression. My Dataframe DF consists of following Columns: Date …

r linear-regression

asked Mar 03 '17 at 17:08

Henky

votes

1 answer

Precision_score and accuracy_score showing value error

I'm new to this machine learning and using this boston dataset for predictions. Everything except the result for precision_score and accuracy_score is working fine . This is what i have done : import pandas as pd import sklearn from…

python machine-learning scikit-learn linear-regression

asked Feb 25 '17 at 08:46

harshi

votes

1 answer

Plotting both a GLM and LM of same data

I would like to plot both a linear model (LM) and non-linear (GLM) model of the same data. The range between 16% - 84% should line up between a LM and GLM, Citation: section 3.5 I have included a more complete chunk of the code because I am not…

r ggplot2 linear-regression logistic-regression drc

asked Feb 24 '17 at 19:04

Arch

votes

0 answers

Linear regression accuracy 95%, but predicts past data

Having a pandas dataframe of 4 rows of features, I create labels for them from "forecast_col" and shift them back to the past to make prediction later: pandasdf['label'] = pandasdf[forecast_col].shift(-forecast_out) Taking all the rows except the…

python machine-learning linear-regression prediction

asked Feb 20 '17 at 09:48

Александр Нагорный

votes

2 answers

Least Squares Fit on Cubic Bezier Curve

I would like fit a cubic bezier curve on a set of 500 random points. Here's the code I have for the bezier curve: import numpy as np from scipy.misc import comb def bernstein_poly(i, n, t): """ The Bernstein polynomial of n, i as a…

python linear-regression curve-fitting bezier least-squares

asked Feb 17 '17 at 12:25

stepp0

votes

2 answers

By two combinations of predictors in linear regression in R

Suppose that I have X1,...,X14 potential predictors. Now for a given Y i want to make the OLS scheme: Y~X1+X2 Y~X1+X3 .... Y~X1+X14 .... Y~X14+X13 which is basically all the by two combinations of all the predictors. After all those regressions…

r linear-regression lm

asked Feb 12 '17 at 19:40

Hercules Apergis

votes

1 answer

How to handle missing data in machine learning?

I have a dataframe which always has missed information between 9pm of Fridays and 0am on Mondays. I'm using this data to make prediction trough linear regression algorithm, so this jump gumps up my predictions: date timestamp …

python pandas machine-learning scikit-learn linear-regression

asked Feb 06 '17 at 21:36

mllamazares

7,876
17
61
89

votes

1 answer

Gradient Descent For Mutivariate Linear Regression

Ok, so what does this algorithm exactly mean? What I know : i) alpha : how big the step for gradient descent will be. ii) Now , ∑{ hTheta[x(i)] - y(i) } : refers to Total Error with given values of Theta. The error refers to the difference…

linear-regression gradient-descent

asked Feb 02 '17 at 20:32

Rishabh Chopra

votes

1 answer

How to obtain coefficient values from Spark-MLlib Linear Regression model (Scala)?

I'd like to obtain coefficient values of Linear Regression(LR) model in Spark-MLlib. Here I use the 'LinearRegressionWithSGD' to build the model and you can find the sample from the following…

scala apache-spark linear-regression apache-spark-mllib

asked Jan 25 '17 at 07:13

Ramkumar

votes

1 answer

How to plot confidence bands for my weighted log-log linear regression?

I need to plot an exponential species-area relationship using the exponential form of a weighted log-log linear model, where mean species number per location/Bank (sb$NoSpec.mean) is weighted by the variance in species number per year…

r plot regression linear-regression lm

asked Jan 21 '17 at 02:23

Christine S

votes

3 answers

Spark ML Linear Regression - What Hyper-parameters to Tune

I'm using the LinearRegression model in the Spark ML for predicting price. It is a single variate regression (x=time, y=price). Assume my data is clean, what are the usual steps to take to improve this model? So far, I tried tuning regularization…

linear-regression apache-spark-ml hyperparameters

asked Jan 21 '17 at 01:08

gyoho

votes

0 answers

Is there a way to intersect real-valued column with a sparse column?

crossed_column is able to intersect a few sparse (categorical) columns. Is there a way to intersect a real-valued column with a sparse column in a LinearRegressor ? The mathematical meaning of this seems clear: I need different weights at continuous…

tensorflow linear-regression

asked Jan 20 '17 at 07:59

noname7619

3,370
3
21
26

votes

1 answer

Cost Function, what's the difference between sum(x) and ones(1,length(x)) *x?

I'm doing Professor Andrew Ng's Machine Learning course on Coursera. I'm trying to code the cost function. This was my first solution: J= (1/(2*m))* (ones(1,97) * (((X*theta)-y).^2 )); But it wasn't accepted, so I tried it with sum: J = 1 / (2 * m)…

matlab machine-learning octave linear-regression

asked Jan 10 '17 at 08:48

GniruT

votes

1 answer

How do I interpret the TukeyHSD output in R? (in relation to the underlying regression model)

I built a simple linear regression model with 'Score' as the dependent variable, and 'Activity' as the independent one. 'Activity' has 5 levels: 'listen' (reference level), 'read1', 'read2', 'watch1', 'watch2'. Call: lm(formula = Score ~…

r linear-regression tukey

asked Jan 07 '17 at 23:09

fannilegoza

Prev 1 2 3

…

99 100 Next