Spark Thrift maintain caches among sessions

Question

In Spark Thrift, when using the beeline client, is it possible to:

User connects to the server using beeline and creates a cached table.
User connects again to the server using beeline (different session), and can use the cached table created in the previous session.

I tested this workflow, and in 1. I can see in the Spark UI that the cached table is there. When I finish the session in 1., the table is still there. But when I reconnect with the same user, I can't use it.

score 1 · Answer 1 · answered Nov 21 '17 at 13:36

As far as I know, you cannot do this. Sharing RDDs, DataFrames, and Datasets across applications (Spark Contexts or Spark Sessions) is where Alluxio comes in. https://www.alluxio.org/ Including Spark SQL tables in the Spark Catalog. However, you can always write it to a hive table.

Spark Thrift maintain caches among sessions

1 Answers1