How to guarantee hadoop FileSystem connections are managed when I use pool

Asked Mar 11 '13 at 12:08

Active Sep 17 '19 at 12:10

Viewed 801 times

Since FileSystem.get is not thread-safe, I use FileSystem.newInstance instead. but calling newInstance method each time when I need connection to HDFS may not be a good idea. So I made FileSystem connection pool.

This is first question.

Is this good approach?

Because I check Hive source, but they don't use this approach. just use HDFS API directly, and even never use newInstance. Why? how they make new FileSystem connection?

and they don't use FileSystem.close() too.

How they guarantee FileSystem will close?

edited Sep 17 '19 at 12:10

Naman

27,789
26
218
353

asked Mar 11 '13 at 12:08

Byung Min Baek

How to guarantee hadoop FileSystem connections are managed when I use pool

0 Answers0