Questions tagged [qubole]

Qubole Data Service (QDS) is cloud Big Data service running on an elastic Hadoop-based cluster

Source Creators of Facebook’s Big Data infrastructure and Apache Hive have leveraged their experience to deliver Qubole Data Service (QDS) – a cloud Big Data service offering the same advanced capabilities used by Big Data savvy organizations.

Minimize operational interaction and provide your data analysts with an easy to use graphical interface, built-in connectors, and seamless, elastic cloud infrastructure.

Your Hadoop cluster is ready within minutes post signup, letting you focus on building sophisticated data pipelines, running queries, scheduling jobs and monetizing your big data.

An auto-scaling cluster, improved I/O optimization, faster queries and support for hybrid pricing - realize cost savings of as much as 50%-60% in total, while accomplishing tasks faster.

87 questions
0
votes
0 answers

Container Packing in YARN

Qubole has implemented Container Packing in YARN for cloud deployments to reduce infrastructure cost, is there any similar implementation available in open source world?
banjara
  • 3,800
  • 3
  • 38
  • 61
0
votes
1 answer

Qubole: How can I download scheduler result in python?

Like title, I managed myself download the Qubole result using the query id in python, however, is there a method that I can download the result using scheduler job ID instead of query ID? Thanks.
atsang01
  • 207
  • 3
  • 12
0
votes
1 answer

unable to connect ms sql server from Presto in Qubole

I am using Qubole Data Service on Microsoft Azure. I have created Presto Cluster in Qubole. I want to connect to MS SQL Server from Presto to read data from MS SQL Server. I have created sqlserver directory on…
Heta Desai
  • 57
  • 1
  • 11
0
votes
1 answer

Comparing one day worth of data from S3 buckets faster

Consider 2 data flows below 1. Front End Box ----> S3 Bucket-1 2. Front End Box ----> Kafka --> Storm ---> S3 Bucket-2 The logs from the boxes are being transferred to S3 buckets. The requirement is to replace flow 1 by flow 2. Now the data…
Albatross
  • 669
  • 7
  • 24
0
votes
1 answer

How to query data from gz file of Amazon S3 using Qubole Hive query?

I need get specific data from gz. how to write the sql? can I just sql as table database?: Select * from gz_File_Name where key = 'keyname' limit 10. but it always turn back with an error.
daxue
  • 259
  • 1
  • 2
  • 11
0
votes
1 answer

Adding results of Hadoop job to Hive Table

I have a Map-only job that processes a large text file. Each line is analyzed and categorized. MultipleOutputs are used to output each category into separate files. Eventually all the data gets added to a Hive table dedicated to each category. …
FrankGT
  • 117
  • 7
0
votes
2 answers

Inserting local csv to a Hive table from Qubole

I have a csv on my local machine, and I access Hive through Qubole web console. I am trying to upload the csv as a new table, but couldn't figure out. I have tried the following: LOAD DATA LOCAL INPATH INTO TABLE ; I get the error…
0
votes
1 answer

HDFS Data Export error: could only be replicated to 0 nodes, instead of 1

After succussefully running an Hive Query when Data Export to MySQL starts in Qubole, I am getting the following error (in my log file): org.apache.hadoop.ipc.RemoteException: java.io.IOException: File …
0
votes
2 answers

Is there an api to send notifications based on job outputs?

I know there are api to configure the notification when a job is failed or finished. But what if, say, I run a hive query that count the number of rows in a table. If the returned result is zero I want to send out emails to the concerned parties.…
-1
votes
1 answer

trying to execute s3-sqs qubole connector for spark structured streaming

I am trying to follow, https://github.com/qubole/s3-sqs-connector and trying to load the connector but seems like the connector is not available on maven and while generating the buiold manually the sqs classes are not loaded. Can some1 guide me on…
Dipesh
  • 1
-1
votes
1 answer

Can I write an HTML script and pass information from the script to a cell on Qubole?

Is it possible to write an HTML script and have the user interact on the HTML script and pass the data back to the zeppelin cell and have it rerun the data passed back? Thank you! Update: Have some progress on rerunning the cell on a html…
Dillon
  • 11
  • 1
-2
votes
1 answer

Error running presto query on kinesis

I am getting following error while running presto query on kinesis. Query 20151031_142753_00003_xxxxx failed: Rate exceeded for stream xxxxx under account xxxxxx. (Service: AmazonKinesis; Status Code: 400; Error Code:…
Rajesh
  • 17
  • 3
1 2 3 4 5
6