In the es cluster, it has a large scale data, we used spark to compute data but in the way of elasticsearch-hadoop
, followed by https://www.elastic.co/guide/en/elasticsearch/hadoop/current/spark.html
We have to read full columns of an index. Is there anything that help?