Questions tagged [analyzer]

For questions related to a person or device that analyses (or analyzes) given data

An analyser, also spelt analyzer, is a person or device that analyses given data. It examines in detail the structure of the given data and tries to find patterns and relationships between parts of the data. An analyser can be a piece of hardware or a software program running on a computer.

Information is taken from Wikipedia. The following article contains more information about the topic and links to other topics about commonly used analyzer types.

642 questions
5
votes
1 answer

Creating a Lucene.net Custom Analyzer

I am trying to create a custom analyzer in Lucene.net 4.8 - however I am running into an error I can't fathom. My analyzer code : public class SynonymAnalyzer : Analyzer { protected override TokenStreamComponents CreateComponents(string…
chilluk
  • 217
  • 2
  • 17
5
votes
1 answer

How to practially use a keywordanalyzer in azure-search?

a little relating and continuing to this question: Azure Search Analyzer I want to use a keywordanalyzer for word collections. We have documents (products) with different fields like product_name, brand, categorie and so on. To implement a keyword…
ldenkewi
  • 53
  • 1
  • 3
5
votes
2 answers

Elasticsearch aggregation turns results to lowercase

I've been playing with ElasticSearch a little and found an issue when doing aggregations. I have two endpoints, /A and /B. In the first one I have parents for the second one. So, one or many objects in B must belong to one object in A. Therefore,…
5
votes
1 answer

GNU Makefile "preprocessor"?

Is there an option to output the "preprocessed" makefile, something equivalent to the GCC's -E option? I have a project comprised of an hierarchy of dozens of modules, each with its makefile. The build is invoked from a master makefile. That master…
ysap
  • 7,723
  • 7
  • 59
  • 122
5
votes
1 answer

How to configure correctly FOS Elastica analyzers and filters?

In my symfony2 application, I am using FOS Elastica bundle to perform searches. I have tried to set up analyzers and filters but it seems they just have no effect. For instance, if I search for the word 'cake', the objects containing the sentence…
Sébastien
  • 5,263
  • 11
  • 55
  • 116
5
votes
1 answer

NHibernate Query Analyzer

Could anyone please describe how to use NHibernate Query Analyzer? I downloaded it and trying to build it but it fails, I am not quite sure how to use it. I am using nHibernate with Sybase and WCF. Any help is greatly appreciated. Thanks
VoodooChild
  • 9,776
  • 8
  • 66
  • 99
5
votes
2 answers

elasticsearch custom tokenizer - split token by length

I am using elasticsearch version 1.2.1. I have a use case in which I would like to create a custom tokenizer that will break the tokens by their length up to a certain minimum length. For example, assuming minimum length is 4, the token "abcdefghij"…
5
votes
2 answers

What analyzer should I use for a URL in lucene.net?

I'm having problems getting a simple URL to tokenize properly so that you can search it as expected. I'm indexing "http://news.bbc.co.uk/sport1/hi/football/internationals/8196322.stm" with the StandardAnalyzer and it is tokenizing the string as the…
John_
  • 2,931
  • 3
  • 32
  • 49
5
votes
2 answers

Eclipse Memory Analyzer: Unable to attach to 32-bit process

when trying to acquire a Heapdump of my eclispe rcp application with the Java Memory Analayzer I get the following error message: Error creating heap dump. jmap exit code = 1 4120: Unable to attach to 32-bit process running under WOW64 The -F option…
Markus
  • 1,452
  • 2
  • 21
  • 47
5
votes
1 answer

Multiple analyzers on 1 field

Got 2 fields (name & description), using a Danish analyzer. Now I need to support it with an English analyser too. What is my solutions? Add more analysers on the fields? Set the language when looking up? (don't think this works, because the data…
5
votes
2 answers

How to use static code analyzer introduced in android ndk r8e?

http://developer.android.com/tools/sdk/ndk/index.html As the above link suggests, android ndk r8e : Added static code analyzer for Linux/MacOSX hosts. For information on using the analyzer, see CHANGES.HTML. In CHANGES.HTML: Added static code…
xgdgsc
  • 1,367
  • 13
  • 38
5
votes
2 answers

KeywordAnalyzer and LowerCaseFilter/LowerCaseTokenizer

I want to build my own analyzer that uses both filters/tokenizers. I mean, the same field is Keyword (entire stream as a single token) and lowercase If KeywordAnalyzer use only, the value of field keeps the case-insensitive. If I use…
jotapdiez
  • 1,456
  • 13
  • 28
5
votes
1 answer

Lucene Highlighter with stemming analyzer

I am using Lucene's Highlighter class to highlight fragments of matched search results and it works well. I would like to switch from searching with the StandardAnalyzer to the EnglishAnalyzer, which will perform stemming of terms. The search…
user605331
  • 3,718
  • 4
  • 33
  • 60
4
votes
1 answer

Lucene mutli-language analyzer/index approach

I have a working Lucene index supporting a suggestion service. When a user types into a search box it queries the index by the SUGGESTION_FIELD. Each entry in SUGGESTION_FIELD can be one of many supported languages and each is stored using an…
Rob McFeely
  • 2,823
  • 8
  • 33
  • 50
4
votes
1 answer

Customizing Analyzers in Solr

In Solr I have a custom Analyzer that has two parameters. I know how to specify this Analyzer in the schema.xml but I'm wondering how I can pass the two arguments either in the schema.xml or runtime in the code.
Fredrik L
  • 1,790
  • 4
  • 21
  • 28