Questions tagged [hadoop-partitioning]

Hadoop partitioning deals with questions about how hadoop decides which key/value pairs are to be sent to which reducer (partition).

339 questions
-1
votes
1 answer

can we use log4j in mapreduce?

Can we use log4j to log in mapreduce? If so, provide the steps to use log4j in map-reduce to log the information. I have written the below log4.properties but, nothing was logged.
-1
votes
1 answer

Hadoop mapreduce using 2 mapper and 1 reducer using c++

Following the instructions on this link, I implemented a wordcount program in c++ using single mapper and single reducer. Now I need to use two mappers and one reducer for the same problem. Can someone help me please in this regard?
-1
votes
1 answer

I have to implement hadoop, so it can process the data of call detail records?

I have configured HDFS, Datanode and namenode and also hbase. I have stored a CDR csv file in HDFS. So how can I map it with Hbase and make ready to process it?
-1
votes
2 answers

Hadoop Map Task : Read the content of a specified input file

I'm pretty new to Hadoop environment. Recently, I run a basic mapreduce program. It was easy to run. Now, I've a input file with following contents inside input path directory fileName1 fileName2 fileName3 ... I need to read the lines of this file…
-1
votes
4 answers

New user SSH hadoop

Installation of hadoop on single node cluster , any idea why do we need to create the following Why do we need SSH access for a new user ..? Why should it be able to connect to its own user account? Why should i specify a password less for a new…
Surya
  • 3,408
  • 5
  • 27
  • 35
-2
votes
1 answer

Spark dataset withColumn add partition id

I am trying to write a helper function that takes a dataset of any typeDataset[_], and returns with one new column "partitionId" which is the id of the partition that single data unit belongs to. For example, if I have a dataset below and by default…
HayreddinLuo
  • 91
  • 1
  • 6
-2
votes
1 answer

For some of the hive queries I wasn't able to see the o/p?

My query is SELECT txnno, product FROM txnrecsbycat TABLESAMPLE(BUCKET 2 OUT OF 10) ORDER BY txnno; I am getting success but unable to view my O/p My o/p is: Total jobs = 1 Launching Job 1 out of 1 Number of reduce tasks determined at compile…
-2
votes
1 answer

I resarch about HDFS failures. For this I need to HDFS logs . Where can I download the logs?

I resarch about HDFS failures. For this I need to HDFS logs . Where can I download the logs ?
-3
votes
1 answer

How exactly to process data on Hadoop,Hive,Pig

I have learnt the basics of Apache Hadoop Hive. And know majority of commands. Now, how to exactly work on the data. I have huge amt of data available with me(got it from a person). But dont know what exactly to do. The data(.xlsx) is weekly sales,…
Sanjeev
  • 17
  • 1
  • 6
1 2 3
22
23