Questions tagged [full-text-indexing]

indexing for full-text search

569 questions
2
votes
0 answers

Server-less search engine using Firebase and Lunr.js or other helping libraries

Firebase does not really have the capabilities in full-text searching yet so I was wondering if Lunr.js can help (or other libraries). I'd like to know how or to create a good scheme in indexing data directly in Firebase and if Lunr.js can help me…
2
votes
1 answer

How can I set up Solr to tokenize on whitespace and punctuation?

I have been trying to get my Solr schema (using Solr 1.3.0) to create terms that are tokenized by whitespace and punctuation. Here are some examples on what I would like to see happen: terms given -> terms tokenized foo-bar -> foo,bar one2three4 ->…
claytron
  • 1,575
  • 13
  • 20
2
votes
1 answer

Indexing pdf documents

What the best way to index pdf documents? Should I index them by converting pdf documents to txt or there is a better way to index pdf files?
Ahsan Iqbal
  • 1,422
  • 5
  • 20
  • 39
2
votes
0 answers

Understanding full text index of Neo4j

On the latest version of Neo4j, I'm trying to understand how a text search can be done. I've read the manual on creating full text indexes but please help me with the following questions: When I want to do partial text matches, I would create a…
kami
  • 361
  • 3
  • 15
2
votes
1 answer

How long MYSQL takes to build FULLTEXT index text column per GB?

I have a MYISAM table in MySQL 5.6 with a text column with millions of rows. The total space taken by that table is about 40 GB most of which is in the text column. I started the process 70 plus hours ago with this command: CREATE FULLTEXT INDEX…
ace
  • 11,526
  • 39
  • 113
  • 193
2
votes
3 answers

How to limit ElasticSearch results by a field value?

We've got a system that indexes resume documents in ElasticSearch using the mapper attachment plugin. Alongside the indexed document, I store some basic info, like if it's tied to an applicant or employee, their name, and the ID they're assigned in…
ckasek
  • 23
  • 1
  • 6
2
votes
4 answers

TIFF IFilter not properly reading text in VARBINARY column

link textI want to use the TIFF IFilter built in to Windows 2008 Server R2 with Full-Text search in SQL Server 2008... also R2. I have installed the filter through server manager and updated the "Force TIFF IFilter to perform OCR for every page in a…
2
votes
0 answers

COUNT(*) on MyIsam Table with FT INDEX

I have some doubts when using FULL TEXT INDEX on myISAM tables in mySQL 5.5. I have table like this: CREATE TABLE `search_descr` ( `vid` int(11) NOT NULL, `v_name_text` text COLLATE utf8_polish_ci, `d_title` text COLLATE…
M. Klecha
  • 21
  • 3
2
votes
2 answers

How can I force postgres 9.4 to hit a gin full text index a little more predictably? See my query plan bug

POSTGRES 9.4 has been generating a pretty poor query plan for a full text query with LIMIT 10 at the end: SELECT * FROM Tbl WHERE to_tsvector('english'::regconfig, ginIndexedColumn) @@ to_tsquery('rareword') LIMIT 10 this generates: "Limit …
Isaac Bolinger
  • 7,328
  • 11
  • 52
  • 90
2
votes
1 answer

Why is my SQL QUERY using CONTAINS numbers taking up to 2 minutes

I have a table named Locations with a FullText Index on all columns. There's one PK Column (INT) and the rest are TEXT/VARCHAR. This table has 300,000 records. The following query is taking 2 minutes to return one record. SELECT TOP 1 * FROM…
2
votes
0 answers

Full Text indexing Multiple languages

My DB stores content in three languages (English,French & Arabic) I have full text indexing enabled for few tables and would like to know few best practices: 1. When show I us language neutral indexing? 2. Can I Index Arabic? I don't see Arabic in…
Abdel Raoof Olakara
  • 19,223
  • 11
  • 88
  • 133
2
votes
1 answer

What is the difference between EdgeNGramTokenizerFactory EdgeNGramFilterFactory in SOLR?

What is the difference between these two filters? They seem to have the same effect? Can anyone supply an example of how they are applied to some text?
Pinguthepenguin
  • 175
  • 3
  • 12
2
votes
1 answer

How to create a fulltext index on array attributes?

I'd like to create a fulltext index for an array attribute without needs to redundantly copy all its strings. I tried by defining an index for "attrib[*].string" but this does not work. Am I using the wrong syntax? If not, is it hard to support such…
augustin-s
  • 683
  • 4
  • 14
2
votes
0 answers

javahelp indexer for full text search

I have a problem with the full text search database generation in javaHelp. In order to generate the db I have to execute a command from a batch file: java -cp jhall.jar com.sun.java.help.search.Indexer -db .\JavaHelpSearch .\html This only works…
Neo
  • 1,337
  • 4
  • 21
  • 50
2
votes
1 answer

How to change engine of table without making any changes to table

My table is using 'InnoDB' storage engine when i try to do 'full text search' on it gives me an error 'The used table doesn't support full text indexes' That's why i tried to change my table engine from 'InnoDB' to 'MyIsam' but it again given me…
deepak
  • 23
  • 2