Questions tagged [bluedata]

16 questions
1
vote
1 answer

How can I read data from DataTap using cpython?

I would like to read data from DataTap using cpython. In spark, I can do something like: df = spark.read.csv("dtap://MaprClus2/tmp/airline-safety.csv") How can I do the same if I am using cpython, for example when I don't have a pyspark Jupyter…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
vote
1 answer

bluedata pyspark hdfs write acces problem: hdfs_access_control_exception: permission denied

We have BlueData 3.7 running and I started the Cloudera 5.14 cluster with Spark and YARN. I get a csv file from Qumulo per NFS per DTAP into the Spark container and just do a small filter and save the outcome as parquet file per DTAP into our…
1
vote
2 answers

bdcsv.py returns "The datetime zone id '00:00' is not recognised"

I'm trying to run bdcsv.py: $ sudo python /opt/bluedata/bundles/bluedata-epic-entdoc-minimal-release-3.7-2207/scripts/monitoring/bdcsv.py \ -c localhost \ -f cred.json \ -s 2018/02/07-00:00:00 \ -e 2018/02/07-23:59:59 I received…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

BlueData - Kibana server is not ready yet

I've followed the instructions to add Kibana to my BlueData 4.0 controller: Login to the Controller. Execute the command docker exec -it monitoring- bash, where is the IP address of the Controller. Change the permissions of…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

catalog entry json has api specification error

I have created an application image for BlueData and deployed it to the app store. I noticed in tooltips on the app icon (App Store) the following error: catalog entry json has api specification error What can be causing this issue?
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

BlueData MLOPS - PermissionError: [Errno 13] Permission denied

I've created a MLOPS project on BlueData 4.0 and mounted the Project Repo (NFS) folder. I created the NFS service on Centos 7x as below: sudo yum -y install nfs-utils sudo mkdir /nfsroot echo '/nfsroot *(rw,no_root_squash,no_subtree_check)' |…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

bluedata mlops prediction - 'NoneType' object does not support item assignment

I'm calling the BlueData MLOPS prediction API to get a score for my model, but I'm getting the following response: ============================response begin========================================== Status code : {}200 Status text : {}OK Headers …
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

How to read and write from datatap using Tensorflow on BlueData?

I want to be able to use BlueData's datatap directly from TensorFlow. With pyspark, I can do something like this: df.write.parquet('dtap://OtherDataTap/airline-safety_zero_incidents.parquet') Note that I don't need to set up any libraries - it's…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

How to debug "ERROR: Could not reach the worker node."?

I am trying to set up a gateway and a worker node on a BlueData 3.7 controller using SSH credentials. I believe that I have met all the pre-requisites in the docs, however, I get an error trying to install the gateway and the worker in the…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

How to add the lustre file system client to a BlueData container?

I'm trying to set up a lustre client (docs) inside a docker container running on BlueData. As per this post, I've modified the BlueData config on each worker and the controller node: $ vi…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

How to work with sessions with the BlueData REST API?

I would like to use the BlueData API to programmatically manage by BlueData installation. I have seen some documentation here - it appears that I need to start off with Fetching a session but I would expect to create a session before I can fetch…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

How to read data from DataTap using pyspark on Cloudera 5.x?

I have created a Cloudera 5.x cluster with the Spark option set: I would like to run a simple test using PySpark to read data from one Datatap and write it to another Datatap. What are the steps for doing this with PySpark?
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

Starting Kibana - "No such container: monitoring-x.x.x.x"

I'm working through the BlueData docs to enable Kibana. To start Kibana: Login to the Controller. Execute the command docker exec -it monitoring- bash, where is the IP address of the Controller. ... I've ssh'd into…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

How do I pass docker parameters such as `--cap-add=XXX` to my docker instances running in BlueData?

I would like to run a container with --cap-add=IPC_LOCK. According to the BlueData 3.7 release notes, IPC_LOCK is supported: HAATHI-13547: Docker configuration now includes default IPC_LOCK capability for all deployed containers. IPC_LOCK is the…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
0
votes
1 answer

How to create credentials for Jupyterhub?

I have provisioned a Spark 2.3.1 notebooks & Jupyterhub with SSL & centos7x cluster on BlueData. When I access the Jupyerhub service, I am prompted for a username and password: My BlueData Tenant: is NOT configured for LDAP/AD has Cluster…
Chris Snow
  • 23,813
  • 35
  • 144
  • 309
1
2