Being a R user, I am learning Stata now using this resource, and am puzzled about the merge
command.
In R, I don't have to worry about merging data wrongly, because it merges everything anyway. I don't need to worry if the common columns contain any duplicates, because the Y
dataframe will merge to each of the duplicated row in X
dataframe. (using all=FALSE
in merge
)
But for Stata, I need to remove the duplicate rows from X
before proceeding to merge.
Is it being assumed in Stata that, in order for merge
to proceed, the common column in the master table must be unique?