Currently I have a Partitioned ORC "Managed" (Wrongly created as Internal first) Hive table in Prod with atleast 100 days worth of data partitioned by year,month,day(~16GB of data).
This table has roughly 160 columns.Now my requirement is to Add a…
I tried with various command and I still do have issues in viewing the yarn logs and here are the steps that I followed to view logs
[root@iop-rm1 /]# yarn application -list
17/04/03 13:15:21 INFO client.RMProxy: Connecting to ResourceManager at…
I'm looking for the Analytics for Apache Hadoop service in Bluemix to try out SystemT Text Analytics for a project I'm working on. It seems like it's not available any more. It this the case? Is there any replacement or is the service just…
I'm trying to import data into my Analytics for Apache Hadoop instance using Hadoop shell commands.
The Analytics for Apache Hadoop bluemix documentation provides a link to BigInsight documentation in the related links section. The link is:…
I have created the nodered boilerplate and i have binded the Analytics for Apache Hadoop service.
So it clearly appears as a binded service into the dashboard.
But when I launch the Nodered app and add a HDFS node, I get the following…
I am trying to run Hadoop Ibm Biginsights, but ibm bluemix not display.
Details: I have registered (free trail) and login into ibm bluemix, but unfortunately in Service category, Analytics for Hadoop not appear in the bigdata session. please find…
we have HDP cluster with 528 data nodes machines
in Ambari HDFS Configs , we configured 3 config group because the following:
212 data nodes machine are with 32G
119 data nodes machines are with 64G
197 data nodes machines are with 128G
so in…
Not able to run Spark job in yarn(cluster or client) mode through Livy, I am able to run spark job using Livy, but the jobs are running local mode and not visible on resource manager. I want to run these jobs in yarn-cluster mode.
I am using…
I have written following Scala program in Eclipse for reading a csv file from a location in HDFS and then saving that data into a hive table [I am using HDP2.4 sandbox running on my VMWare present on my local machine]:
import…
I am getting following error when I am intalling Hive on Apache Ambari.
resource_management.core.exceptions.Fail: Execution of '/usr/bin/yum -d 0 -e 0 -y install hive2_2_5_3_0_37' returned 1. Error: Package:…
i am working in the hive benchmarking https://github.com/hortonworks/hive-testbench
i have a problem in loading data to tables. in face the TPC-ds data generator generates the data and then tryes to load them in tables but in table 17 out of 24…
I've initially installed 2 slave nodes(node01, node02) cluster using Ambari Cluster Install wizard successfully.
My Ambari Server is running on other node ( node 00 ).
Problem is at the time of install wizard i gave only 2 slave node (because it…
I am using Big SQL from Analytics for Apache Hadoop in Bluemix and would like to look into logs in order to debug (e.g. map reduce job log - usually available under http://my-mapreduce-server.com:19888/jobhistory, bigsql.log from the Big SQL worker…
I've been trying to run a Shell Action on Analytics for Apache Hadoop, however I have run into some issues. For more information, see how to debug failed oozie workflows on Analytics for Apache Hadoop?.
In response to the issues, I have asked…
I'm trying to run an oozie workflow on Bluemix Analytics for Apache Hadoop, but it it failing. The output from calling Workflow status is as follows:
...
{
"errorMessage": "Main class [org.apache.oozie.action.hadoop.ShellMain], exit…