One of the most useful techniques in textmatching is regex.
Questions tagged [textmatching]
78 questions
0
votes
2 answers
approx text matching and updation at same time
I have a data frame as df1 which contains a column of the name of the university as University_name and has 500000 number of rows. Now I have another data frame as df2 which contains 2 columns as university_name and university_aliases and has 150…

girijesh96
- 455
- 1
- 4
- 16
0
votes
2 answers
Exact Match Words 'Keys' in a Dictionary to Pandas DataFrame Column & return appropriate value
Just want to preface that this question evolved from a previous question of mine which can be found here. I had a few follow-up's that ultimately changed the original question so here we are..
Assume we have the following dataframe:
d = {'keywords'…

J_Win
- 97
- 2
- 8
0
votes
1 answer
Check if all letters in Small-String are contained in Large-String
I am trying to write some code to determine if the letters in a small string are contained in a larger string in R. The Accuracy would then be returned by a percentage.
I found the following on StackOverflow (check if all characters of one string…

PMH123
- 13
- 3
0
votes
1 answer
Comparing text: what is the best approach
I have a database which stores sequential versions of a text file. What I would like to do is allow the user to call up any two historical files (e.g version 1 And version 4) and compare them for differences:
Spelling changes
New words / new…

DrBorrow
- 950
- 12
- 24
0
votes
1 answer
Is it possible to use Knuth-Morris-Pratt Algorithm for string matching on text to text?
I have a KMP code in PHP which is can do string matching between word to text. I wonder if i can use KMP Algorithm for string matching between text to text. Is it possible or not? and how can i use it for finding the matching of the string between 2…

Hikaru18
- 77
- 1
- 10
0
votes
1 answer
Extracting specific patient IDs from email subject line
I'm looking to extract patient IDs from an email subject line. I'm working with two data frames: One has output from an SQL database (contains email subject line) and the other has patient information (hospital name and patient ID).
I want to use…

j.reyes
- 1
0
votes
3 answers
mySQL problems with text matches using IN statement
I have a largish table (2 million rows), with a column containing text identifiers (these are latin names of species, Homo_sapiens, Tyranosaurus_rex, etc.)
I have another table containing latin names and "common" names of species, and I can query…

user2667066
- 1,867
- 2
- 19
- 30
0
votes
5 answers
How to use LIKE in a query to find multiple words?
I have a cust table
id name class mark
1 John Deo Matt Four 75
2 Max Ruin Three 85
3 Arnold Three 55
4 Krish Star HN Four 60
5 John Mike Four 60
6 Alex John Four 55
I would…

user41048
- 53
- 1
- 2
- 9
0
votes
2 answers
Javascript/Jquery: How do i alert labels text that contains my search?
Hi I've searched and found different things based loosely around this and some more closely tied but I haven't been able to devise and create a workable solution for the problem I'm having:
Summary:
I am looking to create a view in a specific way…

harrison
- 79
- 1
- 11
0
votes
1 answer
Best way to match 2 text documents
I'm trying to make such a software which makes 2 text documents intelligently sort of like checking how much the text matches, not like DIFF
I have searched a quite on Google, And I found 2 things which is Graph & TFIDF.
But I'm confused between…

Akshay Chordiya
- 4,761
- 3
- 40
- 52
0
votes
2 answers
Perl module or code for finding overlapping region of two strings
I have two strings.
They are not substrings of each other but there is a overlapping region between them.
my $str1 = "AAAAAAAAAABBBBBBBBCC";
my $str2 = "BBBBBBBBCCZZZZZZZZZZ";
I want to find this overlapping region.
"AAAAAAAAAABBBBBBBBCC"
…

Morteza
- 37
- 7
0
votes
1 answer
Update information to my column in awk
I have two files - in each file is two column and many rows.. I need to update information from second file and overwrite it in first file.
For example:
File1: File2
A 1 B 7
B 2 C 8
C 3 D 9
D 4 E 10
E 5 H 1
F 6 …

Geroge
- 561
- 6
- 17
0
votes
1 answer
How can i match the text file with some keywords in other file
I have one text file with some data. now i have a list of keywords in other text file file.
Now i want to do the maching so that i can find out how many keywords from the second text file are in the first file.
I am not a big programmer , so is…

Mirage
- 30,868
- 62
- 166
- 261
-1
votes
1 answer
Match all kind of price in a gives text using regex
I am using following regex for following kind of text below
Regex: [\s](rs|price)?([\s.]*\d+[\s\d.]*)(pkg|k|(?:la(?:c|kh|k)|crore|cr)s?|l)[\s.]
Text:
65 lac this is 55 lac. and more price 100 lac. randome text to test price45 crs. and 65 cr and to…

iamabhaykmr
- 1,803
- 3
- 24
- 49
-1
votes
1 answer
Unstructured Text/Number merge
I am trying to match fields in 2 separate datasets. They are both address fields. One data set may contain something like "532 Sheffield Dr" and the other may contain only "Sheffield Dr". Another example is "US21 Ramp and Hays RD" with "US 21",…

Cyclops
- 17
- 6