I'm a beginner in the field of artificial intelligence... I can use GATE or any other Natural Language Processing but I don't have an answer for this :
Do you know how to evaluate how 2 sentences can be close? even with a large data set?
Do you have any recommendations? I can use the number of permutation, the lengh, the number of tokens, metaphone them, etc... but I don't know what test I should use.
My goal is : - "Hello Jarvis" - "Hello Romain, how are you"
- "Hello arvis"
- "Hello Romain, how are you"
- "Hello mister Swift"
- I don't know what you are expecting, is this like "Hello Jarvis" ?
- Yes
- Ok, Hello Romain, How are you?
- "Hello mister swift, how are you?"
- I don't know what are you expecting.
Exemple
By 1, 2, 3 or n is just an example of similarity scale.
Basic
- "Hello IA" is closed to
- "Hello IA" by 0
- "Hello AI" by 1
- "Hello Jarvis" is closed to
- "Hello AI" by 2
- "Hello IA" by 2
- "Hello! mister Swift" is closed to
- "Hello AI" by 3
- "Hello IA" by 3
- "Hello Jarvis" by 2
Less Basic
- "Hello IA" is (token length, token word, grammatically, syntactically) closed to
- "Hello IA" by (0,0,0,0)
- "Hello AI" by (0,1,0,0)
- "Hello Jarvis" is closed to
- "Hello AI" by (0,2,1,1)
- "Hello IA" by (0,2,1,1)
- "Hello! mister Swift" is closed to
- "Hello AI" by (1,2,2,2)
- "Hello IA" by (1,2,2,2)
- "Hello Jarvis" by (1,2,2,2)