Data here, with trendline - https://i.stack.imgur.com/iljp6.jpg
Hey everyone,
I'm tinkering around with admissions stats and I wanted to construct a model that can be used off only one variable. IE, what are your chances of getting admitted with a GPA of 3.5, or a GPA of 3.7, etc. The issue is that as the GMAT increases (x axis), the GPA increases as well. So the two variables are not independent of each other.
What I was thinking of doing was to simply multiply each data point by a factor, scaling for the difference. So at 680, I wouldn't multiply by anything, but at 730, I would reduce the GPA by 0.08.. etc. Is this the correct process, or is there a more formal approach to properly run through this data?