Questions tagged [phrase]
210 questions
4
votes
1 answer
How to validate a chapter heading for text using fuzzy logic in Java
I need a solution for identifying incorrect chapter headings in a book.
We are developing an ingestion system for books that does all sorts of validation, like spell-checking and offensive-language-filtering. Now we'd like to flag chapter…

Jesse Harris
- 73
- 5
4
votes
3 answers
How to split a string into an array of suffixes?
What is the most efficient way to split a string into an array of its suffixes?
Say you have the string "The weather is nice", I want to generate an array of its suffixes as such:
[0] = "nice"
[1] = "is nice"
[2] = "weather is nice"
[3] = "the…

ponycat
- 189
- 1
- 2
- 12
4
votes
1 answer
Is it possible to find exact matches only when searching for a phrase in Lucene.net?
I know similar questions have already been asked, but I cannot find any answers that suit what I am looking for.
Basically, I want to search for phrases and only return matches which have that exact phrase only and not partial matches.
e.g. A…

user1662409
- 157
- 3
- 12
3
votes
1 answer
How lucene phrases works without position and offset index?
Lucene allow you to index terms with position and offsets, but even without it is possible to use phrase search. So how lucene can calculate word order in index without this information?

yura
- 14,489
- 21
- 77
- 126
3
votes
1 answer
PowerQuery - Find a phrase and return a word containing that phrase
I have a column in which the phrase "23PL..........") occurs in various random places (sometimes at the beginning, sometimes at the end, sometimes in the middle.) The phrase 23PL is followed by a random string of 14 characters.
I would like to…

DamianD
- 43
- 5
3
votes
2 answers
Phrase extraction with Spacy
Does spacy have some APIs to do phrase* extraction as one would do when using word2phrase or the Phrases class from gensim? Thank you.
PS. Phrases meant as collocations in Linguistics.

user_1177868
- 414
- 4
- 18
3
votes
3 answers
print bigrams learned with gensim
I want to learn bigrams from a corpus using gensim, and then just print the bigrams learned. i've not seen an example that does this.
help appreciated
from gensim.models import Phrases
documents = ["the mayor of new york was there", "human computer…

Aviad Rozenhek
- 2,259
- 3
- 21
- 42
3
votes
1 answer
regex for phrase searching
I have to search phrase in big string may be length of 500 or 600 or greater now I have to check whether phrase exist or not
phrase = "Lucky Draw"
big string1 = "I'm looking for Lucky Draw a way to loop through the sentences and check"
big…

Usman Asif
- 320
- 2
- 12
3
votes
1 answer
c# iTextSharp Combine two 'Chunk'
I need to know if its possible to combine two Chunk from iTextSharp
Phrase phrase = new Phrase();
var text1 = new Chunk("hello");
var text2 = new Chunk("world");
phrase.add();
I want to combine text1 and text2, and then add them to a phrase.

Raul Pechero
- 101
- 1
- 6
3
votes
1 answer
how to find documents that only contain searched words in `solr`
For example, I have a solr collection that contains documents with a field called "key_phrase".
I know it is easy to find all documents that contain all the searched words in a search query. (i.e. using mm=100% in edismax)
However, what I am asking…

Gloria Chen
- 31
- 2
3
votes
0 answers
Elasticsearch - Fuzzy, phrase, completion suggestor and dashes
So I have been asking separate questions trying to achieve the search functionality I would like to achieve but still falling short so thought I would just ask people what they suggest for the optimal Elasticsearch settings, mappings, indexing and…

Robert Garcia
- 416
- 3
- 16
3
votes
1 answer
Match lucene entire field exact value
I'm creating a Lucene 4.10.3 index.
I am using he StandardAnalyzer.
String indexpath="C:\\TEMP";
IndexWriterConfig iwc=newIndexWriterConfig(Version.LUCENE_4_10_3,new StandardAnalyzer(CharArraySet.EMPTY_SET));
Directory dir =…

Rameshwar Nagpure
- 174
- 1
- 12
3
votes
1 answer
Searching phrases in Lucene
Could somebody point me to an example how to search for phrases with Lucene.net?
Let's say I have in my index a document with field "name", value "Jon Skeet". Now I want to be able to find that document when searching for "jon skeet".

mmiika
- 9,970
- 5
- 28
- 34
3
votes
0 answers
Is elasticsearch phrase aggregation exist?
I know that with ES you can find the top most terms in a document. But how about phrase count? I checked the internet and couldn't find any answer. Is there a way to find top most phrases in a document with ES?
for example; top terms aggregation…

Fatih Aktepe
- 571
- 2
- 10
- 19
3
votes
1 answer
Lucene phrase query with wildcards
I come up with solution to programmaticlly create query to search for phrase with wildcards using this code:
public static Query createPhraseQuery(String[] phraseWords, String field) {
SpanQuery[] queryParts = new SpanQuery[phraseWords.length];
…

Antonio Tomac
- 438
- 5
- 12