I'm generally quite new to Python, and I'm having trouble making a linear regression model. I need to make it from a training and test set from a large excel dataset (.csv).
I've split the dataset already:
import pandas as pd
import numpy as np
df = pd.read_csv('C:/Dataset.csv')
df['split'] = np.random.randn(df.shape[0], 1)
split = np.random.rand(len(df)) <= 0.75
training_set = df[split]
testing_set = df[~split]
How can I use this split data to make a linear regression model using the Mean Average Error?
Thanks.