Questions tagged [elasticsearch]

Elasticsearch is a free and open, distributed, RESTful search engine based on Lucene.

Elasticsearch is a free and open, distributed, RESTful search engine based on and developed in .

Together, , , and are commonly known as the . With the addition of Beats, a free and open platform for single-purpose data shippers, the ELK Stack is now known as the Elastic Stack.

Elasticsearch is a distributed, scalable, and multitenant-capable full-text search engine with a RESTful web interface and schema-free documents. It is also document-oriented and includes various APIs, such as native Java and HTTP RESTful. Elasticsearch supports both schema on read and schema on write on the same data, providing speed an

Elasticsearch allows you to choose to store data locally for fast search or store remotely on object-based storage such as for lower-cost options.

Elasticsearch comes with various text preprocessing algorithms, allowing users to set up custom or predefined multilingual analyzers to improve search relevance.

Official clients are available in , , , , , , , and many other languages.

Latest stable releases:

Elastic is the parent company of the Elasticsearch product.

58010 questions
9
votes
1 answer

How to return actual value (not lowercase) when performing search with terms aggregation?

I am working on an ElasticSearch (6.2) project where the index has many keyword fields and they are normalized with lowercase filter for performing case-insensitive searches. The search working great and returning actual values (not lowercase) of…
Anam
  • 11,999
  • 9
  • 49
  • 63
9
votes
1 answer

How to append to an array in Elasticsearch using elasticsearch-py

Using the Official ElasticSearch Python library (Docs) I create an index: doc = { "something": "123a", "somethingelse": "456b", "timestamp": datetime.now(), "history": [] } es.index(index="someindex", doc_type="somedoctype",…
AO_
  • 2,573
  • 3
  • 30
  • 31
9
votes
3 answers

Content-type header not supported

I am following this link for elasticsearch. https://www.elastic.co/blog/a-practical-introduction-to-elasticsearch I am trying following curl to post the json data. curl -XPOST "http://localhost:9200/shakespeare/_bulk?pretty" --data-binary…
nas
  • 2,289
  • 5
  • 32
  • 67
9
votes
4 answers

org.apache.http.ContentTooLongException: entity content is too long [105539255] for the configured buffer limit [104857600]

Am trying to fetch the indexed PDF documents from my index (ElasticSearch). I have indexed my pdf documents using ingest-attachment processor plugin. Totally its 2500 documents has been indexed along with PDF attachment. Now am fetching those PDF by…
Karthikeyan
  • 1,927
  • 6
  • 44
  • 109
9
votes
2 answers

ElasticSearch n-gram tokenfilter not finding partial words

I have been playing around with ElasticSearch for a new project of mine. I have set the default analyzers to use the ngram tokenfilter. This is my elasticsearch.yml file: index: analysis: analyzer: default_index: tokenizer:…
asleepysamurai
  • 1,362
  • 2
  • 14
  • 23
9
votes
1 answer

Rails4 production elasticsearch error - failed to find geo_point, but works in Development

i have a fine running Rails 4 app using elastic search and searchkick.I have a geosearch by setting up geo_point and it works great in dev, but after deploying the same code and verifying the indices in production, on Digital Ocean with 3GB RAM and…
Milind
  • 4,535
  • 2
  • 26
  • 58
9
votes
1 answer

Airflow - how to send task logs to elastic search?

How should I configure or send task logs from airflow 1.9 to Elastic search? I found the config templates in the current git repository but not sure if that can be done in v 1.9
Amit Kumar
  • 465
  • 1
  • 6
  • 19
9
votes
3 answers

How to get all the values from an search result

I am new to Elastic Search. Is there any way to get all the search results for a search keyword? Elastic Search is limited to 10 or else we can set the size but we need to get the size??
raagavan
  • 951
  • 3
  • 12
  • 16
9
votes
1 answer

Can't pickle _thread.lock objects Pyspark send request to elasticseach

I am using pyspark streaming to collect data from tweepy. After all the set up, I send the dict(json) to elasticsearch via elasticsearch.index(). But I get "can't pickle_thread.lock objects" error and other 63 errors. The track back log is too long…
Zhongwei WANG
  • 93
  • 1
  • 1
  • 5
9
votes
1 answer

How to run two spark jobs in parallel in standalone mode

I have spark job in which I process a file and then do following steps. 1. Load the file into DataFrame 2. Push the DataFrame to elasticsearch 3. Run some aggregations on dataframe and save to cassandra I have written a spark job for this in which…
hard coder
  • 5,449
  • 6
  • 36
  • 61
9
votes
1 answer

Spring-data-elasticsearch search for specific fields in multiple indices

I try to search specific fields in multiple indices. I have two indices country and region. Both of the indices have a Field called name. I am able to specify my field name and my indices in my query using elasticsaerchTemplate: @Override public…
Patrick
  • 12,336
  • 15
  • 73
  • 115
9
votes
2 answers

Search implementation: ElasticSearch vs MongoDB vs Relational Database

I am designing a big travel market agency, where I have 170000 hotels and 3000 room types. A simple representation of my entities is: Hotel: destination: Paris rooms: room_a: type: single room_b: type:…
iiirxs
  • 4,493
  • 2
  • 20
  • 35
9
votes
2 answers

elasticsearch add field to all documents

I'm new here. I have about 200 thousand documents in one index, all have same type. I want to add one more field "category" (which is a keyword string) to every single document. Is there a convenient way to achieve this? I know normally one query…
Nico
  • 733
  • 1
  • 6
  • 7
9
votes
3 answers

Ways to only process new(index after last run) data in Elasticsearch?

Is there a way to get the date and time that an elastic search document was written? I am running es queries via spark and would prefer NOT to look through all documents that I have already processed. Instead I would like read the only documents…
SparkleGoat
  • 503
  • 1
  • 9
  • 22
9
votes
2 answers

ElasticSearch use "best match" of ngram terms instead of "synonym"?

Is it possible to tell ElasticSearch to use "best match" of all grams instead of using grams as synonyms? By default ElasticSearch uses grams as synonyms and returns poorly matching documents. It's better to showcase with example, let's say we have…
Alex Craft
  • 13,598
  • 11
  • 69
  • 133