I am having below mentioned data in R dataframe:
DF
structure(list(ID = c("VVC-110", "VVC-111", "VVC-111", "VVC-112",
"VVC-113"), Add = c("255 3RD FLOOR A SQUARE PLOT NO 10 POCKET 4 SECTOR 11 ",
"7045 Liberty Ave. Gastonia, Rose Street ", "22 S. Holly St. \nWinter Garden,.",
"9416 Washington St. \nStafford, Leatherwood Circle", "466 Pawnee Street \nSicklerville,Ridgeview Court \nMundelein,.."
), State = c("Alabama", "Alaska", "Arizona ", "California ",
"Colorado"), City = c("Birmingham", "Anchorage", "Phoenix", "Los Angeles",
"Denver"), Zipcode = c(58765L, 75974L, 98052L, 89406L, 12421L
), Add_1 = c("255, 3rd FLOOR A SQUARE PLOT NO.10 POCKET 4 SECTOR 11, ",
"7045 Liberty Ave. Gastonia, Rose Street View, New", "22 S. Holly St. \nWinter Garden,.",
"9416, Washington St., \nStafford, Leather Wood", "466 Pawnee Street \nSicklerville"
), State_1 = c("Alabama", "Alaskaa", "Arizona", "California",
"Colorado"), City_1 = c("Birmingham", "Anchorage", "Phoenix",
"LosAngeles", "Den ver"), Zipcode_1 = c(58765L, 75974L, 98052L,
89406L, 12421L)), class = "data.frame", row.names = c(NA, -5L
))
By utilizing the above mentioned dataframe, I want to determine the %
match of particular two strings on which i can be assure that for how many rows there are two fields/column are likely same.
- % of String Match between
Add
andAdd_1
. - % of String Match between
State
andState_1
.
Disclaimer: All the %
shown in the Required Output Dataframe are random, that can be vary based on the logic and approach.