Questions tagged [wordbreaker]
17 questions
18
votes
2 answers
Word break in languages without spaces between words (e.g., Asian)?
I'd like to make MySQL full text search work with Japanese and Chinese text, as well as any other language. The problem is that these languages and probably others do not normally have white space between words. Search is not useful when you must…

Joe Langeway
- 300
- 2
- 8
6
votes
1 answer
Full text search - Contains plus wildcard and single quote
I have a table with a name field with this
Test O'neill 123
If I use
SELECT *
FROM table F
WHERE CONTAINS ( F.*, '"Test O''neill 123"' )
it works fine but if I use a wildcard * I get no results.
SELECT *
FROM table f
…

Gonzalo.-
- 12,512
- 5
- 50
- 82
4
votes
1 answer
Determining Word boundaries using DirectWrite
DirectWrite provides line-breaking information via IDWriteTextAnalysisSink::SetLineBreakpoints(). However I need to determine word boundaries as well, in accordance with Unicode Annex #29: Unicode Text Segmentation.
Is there a way to do this using…

bright
- 4,700
- 1
- 34
- 59
3
votes
0 answers
Full-Text Seach and stemming on multilanguage column
I have a table with a column that contains data in different languages, like that:
Id Text Language
1 name en
2 names en
3 имя ru
4 nom fr
I need Full-text search for this multilingual column, but FTS is…

AtlasPromotion
- 101
- 6
3
votes
2 answers
Can I define which characters are allowed to 'break' a word?
I'm showing up veeeery long URLs in my Safari extension. Obviously, they can't fit on a single line. Currently, word breaking rules make it so most URLs are on two lines: the first one is rather short and ends with the ? symbol, and the other is…

zneak
- 134,922
- 42
- 253
- 328
3
votes
3 answers
sql server full text search: what are the default word breakers in English?
Where can I find the list of default word breakers for English in sql server full text search?

trs
- 2,454
- 13
- 42
- 61
3
votes
1 answer
SQL2008 fulltext index search without word breakers
I are trying to search an FTI using CONTAINS for Twitter-style usernames, e.g. @username, but word breakers will ignore the @ symbol. Is there any way to disable word breakers? From research, there is a way to create a custom word breaker DLL and…

Andrew Tibbetts
- 2,874
- 3
- 23
- 28
2
votes
1 answer
Is there such a thing as third party SQL Server word breaker for Hungarian language?
I want to use CONTAINS on a fulltext index and use FORMSOF(...) on Hungarian data.
Is it possible? I KNOW it is not supported by default in SQL Server.

TDaver
- 7,164
- 5
- 47
- 94
2
votes
2 answers
Does anyone know if there is a list of German words used in the Fulltext word breaker?
SQL Server Full Text Search uses language specific Word Breakers.
For the German language this is used to break/split words including compound words. However, it appears not all known compound words are included in the Word Breaker. I would like to…

Coolcoder
- 4,036
- 6
- 28
- 35
2
votes
0 answers
How do I use word breaking in ReportLab?
I use MWLIB and ReportLab to convert MediaWiki articles to PDF.
I got this really long link that for whatever reason causes the sentence above to have really long spaces between the words. I think the link makes such a long word that it just draws…

user1531921
- 1,372
- 4
- 18
- 36
1
vote
2 answers
Full text search and domain names
I stumbled accross an issue with full text search in columns which may contains domain names on a Microsoft SQL Server 2012.
A table cell containing example.com is added to the full text catalog by the term example and com. The latter makes it…

sk904861
- 1,247
- 1
- 14
- 31
1
vote
1 answer
How can I use the word breaker from Microsoft SQL Server 2005 in my stand-alone C# program
I am writing a program which will query a full text search index on Microsoft SQL Server 2005. The data being indexed are large blobs of text (multiple lines).
I need to extract only those lines of text which contain the search words for display…

Nikki Locke
- 2,759
- 6
- 29
- 53
0
votes
1 answer
SQL Server Word Breaker differences between versions (full text search)
I have two different environments, a LIVE and a STAGING environment.
These are running SQL Server 2016 web edition but there are slight differences in the version of both the SQL host itself, and some full text components, namely the word breaker…

Matthew Abbott
- 60,571
- 9
- 104
- 129
0
votes
1 answer
Help Executing this Java Application
I received this source code: http://www.sbbic.org/split.zip under the Apache 2.0 License: http://www.apache.org/licenses/LICENSE-2.0
It came to me from the author without documentation or support (the author doesn't have the time right now, he is…

Nathan
- 1,483
- 3
- 18
- 41
0
votes
1 answer
ignore "." as word breaker in Sql Server 2008
Is there a way to ignore . as a word breaker for full text in SQL Server 2008.
The main purpose of this is I want to be able to seach ip addresses in fulltext search.

Fulya
- 1
- 1