I am working with R
. I need to identify the predictors of higher Active trial start percentage over time (StartDateMonthsYrs
). I will do linear regression with Percent.Active
as the dependent variable.
My original dataframe is attached and my obtained Active trial start percentage over time (named Percent.Active
is presented here.
So, I need to assess whether federal sponsored trials, industry sponsored trials or Other sponsored trials were associated with higher active trial start percentage over time. I have many other variables that I wneed to assess but this is the sample of my data.
I am thinking to do many crosstabs for each variable (eg Fedral & Active then Industry & Active..etc.) in each month (may be with help of lapply
then accumulate the obtained percentages data in the second sheet then run the analysis based on that.
My code for linear regression is as follow:
q.lm0 <- lm(Percent.Active ~ Time.point+ xyz, data.percentage);summary(q.lm0)