Just wondering can sqoop run without a hadoop cluster? sort of in a standalone mode? Has anyone tried to run sqoop on spark, please share some experiences on it.
Asked
Active
Viewed 4,241 times
5
-
Given that it's a tool for transferring data to/from Hadoop, what good would it do on its own? StackOverflow isn't a polling site - "share some experiences" isn't a valid on-topic question around here. – ceejayoz Apr 24 '17 at 14:16
-
2sqoop2 is not a tool just transferring data to/from hadoop. There are cases, sqoop move things around outside hadoop. – Shengjie Apr 24 '17 at 14:24
1 Answers
10
To run Sqoop commands (both sqoop1
and sqoop2
), Hadoop is a mandatory prerequisite. You cannot run sqoop commands without the Hadoop libraries.
Sqoop works in local mode too, so it is not a requirement that the Hadoop daemons must be running. To run sqoop in local mode,
sqoop [tool-name] -fs local -jt local [tool-arguments]
Sqoop on Spark is still In-Progress. See SQOOP-1532

franklinsijo
- 17,784
- 4
- 45
- 63
-
Thanks for the reply. I guess, performance wise, running in local mode is pretty much like running a data loading job in a single process. – Shengjie Apr 30 '17 at 04:59