Questions tagged [analyzer]

For questions related to a person or device that analyses (or analyzes) given data

An analyser, also spelt analyzer, is a person or device that analyses given data. It examines in detail the structure of the given data and tries to find patterns and relationships between parts of the data. An analyser can be a piece of hardware or a software program running on a computer.

Information is taken from Wikipedia. The following article contains more information about the topic and links to other topics about commonly used analyzer types.

642 questions
10
votes
8 answers

Dependency map for Java classes and methods

I have a Java project that I've been working on for a while. The design started out pretty good but slowly degraded as changes were made. I'm looking for some sort of tool that will analyze the project. It'd be really nice to have a map of the…
Ryan
  • 4,517
  • 7
  • 30
  • 34
9
votes
2 answers

Roslyn analyzer missing assembly warning

After creating a Roslyn analyzer package targeting .Net Standard 2.0, when I reference the package in another project, I receive the following error: 'C:\Users\username.nuget\packages\analyzer4\1.0.0.1\analyzers\dotnet\cs\Analyzer4.dll' depends on…
mbabramo
  • 2,573
  • 2
  • 20
  • 24
9
votes
1 answer

line breaks or punctuation marks as position gaps in elasticsearch

In elasticsearch, is there a way to set up an analyzer that would produce position gaps between tokens when line breaks or punctuation marks are encountered? Let's say I index an object with the following nonsensical string (with line break) as one…
Shadocko
  • 1,186
  • 9
  • 27
9
votes
2 answers

Create Custom Analyzer after index has been created

I am trying to add a custom analyzer. curl -XPUT 'http://localhost:9200/my_index' -d '{ "settings" : { "analysis" : { "filter" : { "my_filter" : { "type" : "word_delimiter", …
Cris Pinto
  • 293
  • 3
  • 18
9
votes
2 answers

Normalize British and American English for Elasticsearch

Is there a best practice for normalizing British and American English in Elasticsearch? Using a Synonym Token Filter requires an incredibly long configuration file. There are actually several thousand differently spelled words in UK and US English…
Simon Steinberger
  • 6,605
  • 5
  • 55
  • 97
8
votes
2 answers

How to analyse Websphere core*.dmp file and Snap*.trc files?

All, I have my application running on websphere app server 7.0. I get some of these core dumps and trace files like core.20110909.164930.3828.0001.dmp and Snap.20110909.164930.3828.0003.trc. My question is, just like the thread dumps generated by…
Ayusman
  • 8,509
  • 21
  • 79
  • 132
8
votes
1 answer

Elasticsearch : search results on clicking on Hashtag

I have a hashtag with tags in camel Case like #teamIndia. Now when this hashtag is clicked, it should fetch all results which have "#teamIndia" in it, It should first show results with "#teamIndia", then results with "teamIndia" and then "team…
Kunal Dethe
  • 1,254
  • 1
  • 18
  • 38
8
votes
1 answer

What does Field.Index.NOT_ANALYZED_NO_NORMS mean

I know what does not_analyzed mean. In short the field will not be tokenized by specified Analyzer. However, what does a NO_NORMS means? I see the documentation, but please explain me in plain English. what is index-time field and document boosting…
ankitjaininfo
  • 11,961
  • 7
  • 52
  • 75
8
votes
2 answers

Elasticsearch count terms ignoring spaces

Using ES 1.2.1 My aggregation { "size": 0, "aggs": { "cities": { "terms": { "field": "city","size": 300000 } } } } The issue is that some city names have spaces in them and aggregate…
user432024
  • 4,392
  • 8
  • 49
  • 85
8
votes
2 answers

What lucene analyzer can be used to handle Japanese text?

Which lucene analyzer can be used to handle Japanese text properly? It should be able to handle Kanji, Hiragana, Katakana, Romaji, and any of their combination.
Franz See
  • 3,282
  • 5
  • 41
  • 48
8
votes
1 answer

What kind of memory leaks XCode Analyzer may not notice?

I'm afraid that asking this question may result in some downvotes, but after making some not satisfying research I decided to take a risk and ask more experienced people... There are many questions here referring to some specific problems connected…
radekEm
  • 4,617
  • 6
  • 32
  • 45
8
votes
1 answer

Why does Lucene QueryParser needs an Analyzer

I'm new to Lucene and trying to parse a raw string into a Query using the QueryParser. I was wondering, why is the QueryParser.Parse() method needs an Analyzer parameter at all? If analyzing is something that has to do with querying, then an…
haim770
  • 48,394
  • 7
  • 105
  • 133
8
votes
3 answers

How to properly escape OR and AND in lucene query?

When I passed in a query "state:OR" lucene gave an error because it considers "OR" as a keyword for boolean clause, but here I actually man the abbreviation of Oregon, the state. I have seen that quoting OR so the query becomes 'state:"OR"' makes it…
teddy teddy
  • 3,025
  • 6
  • 31
  • 48
7
votes
1 answer

Strange Xcode Analyze results

When I Analyze my project in Xcode, I get a few strange errors. All of this code is part of a single method which creates arrays that can be used to make MKAnnotations. Sorry if this is an inundation of code—I did my best to comment out the…
eric.mitchell
  • 8,817
  • 12
  • 54
  • 92
7
votes
2 answers

Cannot run Code Analysis Analyzer in Visual Studio 2015

I installed Microsoft.CodeAnalysis.CSharp.FxCopAnalyzers from nuget into a project. The project it is installed into is an existing project, building against Framework 4.5.1. Code Analysis is Enabled on the project... ... and I see CA warnings in…
NikolaiDante
  • 18,469
  • 14
  • 77
  • 117
1 2
3
42 43