1

I am trying to parse a prose paragraph for anything that might resemble an address. I have a database of addresses I am matching against and these are the only addresses I am interested in. I'm using a lamp server but technology specific answers aren't what I require right now. More of a question of how.

Can anyone provide ideas? Perhaps Regex? or perhaps I should use a database of cities/states etc?

Thanks.

Matt
  • 22,721
  • 17
  • 71
  • 112
Jason
  • 15,064
  • 15
  • 65
  • 105

1 Answers1

0

It looks like this question hasn't gotten answered because it's entirely unclear what the problem parameters are. If you want a more specific answer to a problem, please describe your problem more fully.

In general I would suggest aproaching a problem like this using some piece of known data ... small collection of words or formats that delieniate and address, then match on the context of those words to see if they really flesh out to a full address.

Caleb
  • 5,084
  • 1
  • 46
  • 65