Questions tagged [textmatching]

One of the most useful techniques in is .

78 questions
1
vote
1 answer

Matching words with more than two number of words in different data frame in r

I have two data frames DF1 and DF2 like this. ID = c(1, 2, 3, 4) Issues = c('Issue1, Issue4', 'Issue2, Issue5, Issue6', 'Issue3, Issue4', 'Issue1, Issue5') Location = c('x', 'y', 'z', 'w') Customer = c('a', 'b', 'c', 'd') DF1 = data.frame(ID,…
1
vote
0 answers

Efficiently deduplicate text matching

struct Text { words: Vec, ... } struct Input { words: Vec, ... } I have a text processing application with multiple steps. During one of the steps, I run JaroWinkler between the each word of text and input, pick the…
Gurwinder Singh
  • 38,557
  • 6
  • 51
  • 76
1
vote
2 answers

Python Text similarity and matching - increase weighting when terms are together

I have two columns in pandas which contain a sequence of terms, and my objective is to find the entry from column B which is the closest match to for the entries in column A. I have used the TF-IDF to find the similarity between the two columns,…
user1058210
  • 1,639
  • 7
  • 29
  • 49
1
vote
0 answers

Text matching order by relevance

In MySQL is there a way we can order the results in order of set of rules also used for filtering the results? E.g. if we have a name field and input from the user then I would like to filter and sort the records as following If we match the input…
HBalyan
  • 253
  • 4
  • 11
1
vote
1 answer

Can I set array in class from inside action listener? JAVA

Basically all I need to do is get array1 and array2 from each of my buttonlisteners and I have no clue how to do this as it is.. Right now the TextMatch button doesn't work because array1 and array2 are empty, is there a way to set them from the…
1
vote
1 answer

Checking if the value in various columns somehow match the value in one specific column

I have a huge database and I want to check if the value (different words) in the range H:V matches what's in column D, just a word would be matched in this case. Here what it would look like: Columns A B C D E F G H I J K L M N O ... Rows 1 …
Beavis855
  • 25
  • 7
1
vote
3 answers

text matching - unstructured data to structured data - in SAS or R

I need to know how to map unstructured data to structured data. I have a variable that has customer's addresses that includes their cities. The name of the city for example DELHI, can be of the form "DELHI", "DEHLI" "DILLI", "DELI" and I need to…
0
votes
2 answers

Matching partial file names from an excel table and copying all matching files to a subfolder inside the source folder

I'm trying to match using partial file names, then copy all matching files to a subfolder within the source folder. This is another users pic, but I'm trying to use "TPS" from column D to copy the "TPS report" file into a new folder called "found…
0
votes
1 answer

How to return the most match value via SequenceMatcher

I have to match a product's category name returned from API response and product's category name from data base. For example: api_category = "packing tape", category names from DB = ["packing material", "packaging equipment"] from difflib import…
0
votes
0 answers

text matching in quora dataset

I designed a text matching model based on Transformer on the Quora dataset, but why are the F1 values of the models I designed based on Transformer very low, around 70%? For the reproduced ESIM model, the F1 value is only about 70% when the encoder…
alanlee
  • 1
  • 1
0
votes
1 answer

keywords matching between dictionary values as list and pandas column

Let's say, I have dataframe df with column name as news_text, news_text lebron james is the great basketball player. leonardo di caprio has won the oscar for best actor avatar was directed by steven speilberg. ronaldo has resigned from manchester…
Bits
  • 179
  • 1
  • 2
  • 7
0
votes
0 answers

exact keywords matching of dictionaries values with pandas column

I have the pandas dataframe with column name as TEXT consists strings, TEXT tom hardy played as bane in movie called dark knight rises. will smith created the controversy in oscars 2023 famous movie actress emily blunt plays opposite football star…
Learner
  • 800
  • 1
  • 8
  • 23
0
votes
2 answers

Extract values from raster matching csv and raster filenames

I have a folder with many csv files. Each file has several columns as well as lat and long columns. Another folder have many rasters in tif format. The .csv files are named based on Julian date (e.g. 251.csv), and so the rasters (e.g. 251.tif). I…
Ilaria
  • 11
  • 2
0
votes
4 answers

Match if variable has WORD repeated more than once

I have this variable: >echo $br_name srxa wan-a1 br-wan3-xa1 0A:AA:DD:C1:F1:A3 ge-0.0.3 srxa wan-a2 br-wan3-xa2 0A:AA:DD:C1:F2:A3 ge-0.0.3 I am trying to create a conditional where it detects whether ge-0.0.3 is repeated more than 1 time in my…
nickcrv06
  • 127
  • 11
0
votes
1 answer

If text match in range conditonal formula Google Sheets error

I'm trying to get a match in a range of text in Google Sheets basically I'm using this formula: =IF(REGEXMATCH(H2:M2, "Hi"), "Yes", "No") But I'm getting an error that is:
user16239103