Questions tagged [biginsights]

IBM InfoSphere BigInsights brings the power of Hadoop to the enterprise. Apache Hadoop is the open source software framework, used to reliably manage large volumes of structured and unstructured data.

IBM InfoSphere BigInsights

103 questions
2
votes
2 answers

Copy files from Remote Unix and Windows servers into HDFS without intermediate staging

How can I copy files from remote Unix and Windows servers into HDFS without intermediate staging from the command line?
2
votes
1 answer

Inconsistent runtime kernel parameters in DOCKER container and on host

my host is on Ubuntu 14.04.2 LTS and I'm using the latest centos base image in order to create a DOCKER image of IBM InfoSphere BigInsights in order to push it to the Bluemix Container Cloud. I've solved nearly everything but I'm stuck with setting…
Romeo Kienzler
  • 3,373
  • 3
  • 36
  • 58
2
votes
1 answer

gpfs file access through Java API/Python

I am new to IBM-GPFS. As we can access HDFS via Java API, python scripting. Is there any similar way to access GPFS files? Please suggest me the solution.
1
vote
1 answer

HDFS-GPFS connector for using in Apache Spark

Is there a possibility to read data from IBM GPFS (Global Parallel Filesystem) in Apache Spark ? My intention is to use something like this sc.textFile("gfps://...") instead of sc.textFile("hdfs://...") The environment that is intended to be used…
dumitru
  • 2,068
  • 14
  • 23
1
vote
1 answer

Data Science Experience responds with an empty Hive table

From my Data Science Experience, I am able to make a connection to the Hive database in BigInsights and read the table schema. But Data Science Experience does not seem to be able to read the table contents as I get a count of zero! Here are some of…
1
vote
0 answers

Maven repo for BigInsights / IOP

I have a IBM BigInsights 4.2 cluster installed and I am trying to create native yarn jobs. To compile the java code on my laptop (not on the cluster), I need the hadoop client libraries. How can I get the hadoop java libraries for IBM BigInsigths…
1
vote
1 answer

Spark Hive reporting ClassNotFoundException: com.ibm.biginsights.bigsql.sync.BIEventListener

I'm attempting to run a pyspark script on BigInsights on Cloud 4.2 Enterprise that accesses a Hive table. First I create the hive table: [biadmin@bi4c-xxxxx-mastermanager ~]$ hive hive> CREATE TABLE pokes (foo INT, bar STRING); OK Time taken: 2.147…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
vote
3 answers

Spark Hive reporting pyspark.sql.utils.AnalysisException: u'Table not found: XXX' when run on yarn cluster

I'm attempting to run a pyspark script on BigInsights on Cloud 4.2 Enterprise that accesses a Hive table. First I create the hive table: [biadmin@bi4c-xxxxx-mastermanager ~]$ hive hive> CREATE TABLE pokes (foo INT, bar STRING); OK Time taken: 2.147…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
vote
1 answer

hive spark yarn-cluster job fails with: "ClassNotFoundException: org.datanucleus.api.jdo.JDOPersistenceManagerFactory"

I'm attempting to run a pyspark script on BigInsights on Cloud 4.2 Enterprise that accesses a Hive table. First I create the hive table: [biadmin@bi4c-xxxxx-mastermanager ~]$ hive hive> CREATE TABLE pokes (foo INT, bar STRING); OK Time taken: 2.147…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
vote
1 answer

How can I use python > 2.6.6 with spark on BigInsights on cloud Enterprise clusters?

The version of python with BigInsights is currently 2.6.6. How can I use a different version of Python with my spark jobs running on yarn? Note that users of BigInsights on cloud do not have root access.
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
vote
1 answer

BigInsights Example KnoxShell on Windows

Trying to use the KnoxShell example from Windows (using Cmder as shell emulator). When executing the following command, the shell locks, no prompt available and nothing happend. I can only close Shell. Is that specific to Windows? λ gradlew.bat -q…
Victor
  • 243
  • 1
  • 3
  • 9
1
vote
1 answer

BigInsights on cloud - Class org.apache.oozie.action.hadoop.SparkMain not found

I'm trying to execute the spark oozie example on the oozie_spark branch against a BigInsights for Apache Hadoop basic cluster. The workflow.xml looks like this:
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
vote
1 answer

SparkR job(R script) submit using spark-submit fails in BigInsights Hadoop cluster

I have created IBM BigInsights service with hadoop cluster of 5 nodes(including Apache Spark with SparkR). I trying to use SparkR to connect cloudant db and get some data and do some processing. SparkR job(R script) submit using spark-submit fails…
Pari Margu
  • 209
  • 3
  • 10
1
vote
2 answers

Message: RSA premaster secret error. ERRORCODE=-4499, SQLSTATE=08001

I'm trying to connect to BigSQL on a BigInsights on Cloud cluster from an iPython notebook on Bluemix Spark as a service. My notebook looks like this: import os cwd = os.getcwd() with open('certificate', 'w') as f: f.write('''-----BEGIN…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
vote
3 answers

BigSheets - java.io.IOException: java.lang.reflect.UndeclaredThrowableException

I'm unable to browse data with BigSheets on BigInsights on Cloud. When I select a file and change the Reader type, I receive the exception (see screenshot, below). I was previously able to browse data ok.
Chris Snow
  • 23,813
  • 35
  • 144
  • 309