Highest Voted 'hadoop-partitioning' Questions

-1

votes

1 answer

can we use log4j in mapreduce?

Can we use log4j to log in mapreduce? If so, provide the steps to use log4j in map-reduce to log the information. I have written the below log4.properties but, nothing was logged.

asked Sep 07 '16 at 03:15

Mr Shetty

19
1
4

-1

votes

1 answer

Hadoop mapreduce using 2 mapper and 1 reducer using c++

Following the instructions on this link, I implemented a wordcount program in c++ using single mapper and single reducer. Now I need to use two mappers and one reducer for the same problem. Can someone help me please in this regard?

c++ hadoop hadoop-streaming hadoop-partitioning

asked Sep 08 '14 at 14:00

user3532122

15
4

-1

votes

1 answer

I have to implement hadoop, so it can process the data of call detail records?

I have configured HDFS, Datanode and namenode and also hbase. I have stored a CDR csv file in HDFS. So how can I map it with Hbase and make ready to process it?

hadoop hadoop-streaming hadoop2 hadoop-plugins hadoop-partitioning

asked Jul 23 '14 at 15:22

user3869412

1

-1

votes

2 answers

Hadoop Map Task : Read the content of a specified input file

I'm pretty new to Hadoop environment. Recently, I run a basic mapreduce program. It was easy to run. Now, I've a input file with following contents inside input path directory fileName1 fileName2 fileName3 ... I need to read the lines of this file…

java hadoop mapreduce cloudera hadoop-partitioning

asked Oct 15 '13 at 10:36

hadoopDev

1

-1

votes

4 answers

New user SSH hadoop

Installation of hadoop on single node cluster , any idea why do we need to create the following Why do we need SSH access for a new user ..? Why should it be able to connect to its own user account? Why should i specify a password less for a new…

hadoop hadoop-streaming hadoop-plugins hadoop-partitioning

asked Jul 23 '13 at 08:45

Surya

3,408
5
27
35

-2

votes

1 answer

Spark dataset withColumn add partition id

I am trying to write a helper function that takes a dataset of any typeDataset[_], and returns with one new column "partitionId" which is the id of the partition that single data unit belongs to. For example, if I have a dataset below and by default…

scala apache-spark dataset hadoop-partitioning

asked Jun 04 '19 at 21:25

HayreddinLuo

91
1
6

-2

votes

1 answer

For some of the hive queries I wasn't able to see the o/p?

My query is SELECT txnno, product FROM txnrecsbycat TABLESAMPLE(BUCKET 2 OUT OF 10) ORDER BY txnno; I am getting success but unable to view my O/p My o/p is: Total jobs = 1 Launching Job 1 out of 1 Number of reduce tasks determined at compile…

hadoop hive mapreduce hiveql hadoop-partitioning

asked Jul 25 '17 at 16:00

user3676429

1

-2

votes

1 answer

I resarch about HDFS failures. For this I need to HDFS logs . Where can I download the logs?

I resarch about HDFS failures. For this I need to HDFS logs . Where can I download the logs ?

hadoop hadoop2 hadoop-streaming hadoop-partitioning webhdfs

asked May 05 '16 at 08:07

Mehdi Medadian

7
1
3

-3

votes

1 answer

How exactly to process data on Hadoop,Hive,Pig

I have learnt the basics of Apache Hadoop Hive. And know majority of commands. Now, how to exactly work on the data. I have huge amt of data available with me(got it from a person). But dont know what exactly to do. The data(.xlsx) is weekly sales,…

hadoop hive hbase apache-pig hadoop-partitioning

asked May 28 '15 at 05:19

Sanjeev

17
1
6

Questions tagged [hadoop-partitioning]