I would like to calculate distance between two groups. I am very confused.
I have a two data sets. One is about a company and one is about employees.
I would like to find out how their age( a company in which an employee is hired and an employee) are similar or not. I think I need to standarize also..
- calcuate euclidean distance between each person and a company. (4-5 people in a company)
- calculate euclidean distance between each person and a company in industry level.
My dataset is like this:
person person_age company company_age industry
1 50 1 5 1
2 40 1 5 1
3 30 2 1 1
4 20 2 1 1
5 25 3 8 2
The following code will reproduce my data.frame:
person <- 1:5
person_age <- c(50,40,30,20,25)
company <- c(1,1,2,2,3)
company_age <- c(5,5,1,1,8)
industry <- c(1,1,1,1,2)
myData <- data.frame(person, person_age, company, company_age, industry)
Please help me. I don't mind using SAS or R. I am very confused.