I have setup Hadoop1.2.1 in windows with CYGWIN installed.
I have started sshd service.
Also started namenode, datanode, mapreduce (job tracker, task tracker). I am able to see the namenode, datanode and mapreduce running status through the…
I have discovered some (significant) performance differences (in terms of real time runtime as well as CPU time) between Pig and Hive and am looking for ways to come to the bottom of these differences. I have used both language's explain feature…
I have a web application and 1 remote clusters(It can be one or more). These cluster can be on different machines.
I want to perform following operations from my web application:
1 HDFS Actions :-
Create New Directory
Remove files from HDFS(Hadoop…
I am using SingleColumnValueFilter, scan, getScanner, and retrieving the values from hbase table. The values being resulted is sorted in the row Id-vise. But I want the result to be sorted in the value-vise.
Example:
Hbase Table:
RowId | col1:amount…
There are notice about what how cascading/scalding optimized map-side evaluation
They use so called Partial Aggregation.
Is it actually better approach then Combiners? Are there any performance comparison on some common hadoop tasks(word count for…
My problem is registering the python UDF in hive.
I created an encryption and decryption python code to use in hive query. These are working as expected.
however I do not want to add files every time I use it, instead would like to make a permanent…
I'm new to the Apache hadoop. I install the prerequisite software and configure the every thing and eclipse plugins also done but when i click the new hadoop location it's not working .Any one can help me...
I am considering various technologies for data warehousing and business intelligence, and have come upon this radical tool called Hadoop. Hadoop doesn't seem to be exactly built for BI purposes, but there are references of it having potential in…
(I'm sure a similar question exists, but I haven't found the answer I'm looking for yet.)
I'm using Hadoop and Hive (for our developers with SQL familiarity) to batch process multiple terabytes of data nightly. From an input of a few hundred…
Problem:
Sqoop import failing due to wrong combinations of sqoop plugin, hadoop and couchbase connector.
Details:
I am using Sqoop 1.4.5-cdh5.2.0 with Hadoop 2.5.0-cdh5.2.0 on a Linux machine having java 6 for importing data from couchbase using…
I want to build hadoop eclipse plugin for hadoop 2.4.0 version. So how to build this plugin?
I've tried several options of using apache ant, but this did not solve my problem.
Using this link ,tried installing Hadoop version - 2.2.0(single node cluster)in ubuntu 12.04(64 bit machine)
http://bigdatahandler.com/hadoop-hdfs/installing-single-node-hadoop-2-2-0-on-ubuntu/
while formatting the hdfs file system via namenode…
I am trying to install hadoop on Windows. Following all the steps provided on http://v-lad.org/Tutorials/Hadoop/13.5%20-%20copy%20hadoop%20plugin.html
I am facing a problem installing the hadoop-plugin. The hadoop version downloaded from…
In eclipse I am trying to add new hadoop location. But when I click for adding new hadoop location, it is not responding. I build my own plugin for hadoop 1.0.4.
My Manifest looks like below,
Bundle-ClassPath:…