Questions tagged [hawq]

This tag is for questions about Pivotal HAWQ, a SQL on Hadoop implementation

Pivotal HAWQ supports low-latency analytic SQL queries, coupled with massively parallel machine learning capabilities, to shorten data-driven innovation cycles for the enterprise. HAWQ enables discovery-based analysis of large data sets and rapid, iterative development of data analytics applications that apply deep machine learning. It reads data from and writes data to HDFS natively. Using HAWQ functionality, you can interact with petabyte range data sets. HAWQ provides users with a complete, standards-compliant SQL interface to Hadoop.

Homepage

Official Documentation

126 questions
1
vote
1 answer

Query failing with "ERROR: Canceling query because of high VMEM usage"

We have small array of gpdb cluster. in that, few queries are failing System Related information TOTAL RAM =30G SWAP =15G gp_vmem_protect_limit= 2700MB TOTAL segment = 8 Primary + 8 mirror = 16 SEGMENT HOST=2 VM_OVERCOMMIT RATIO =72 Used this calc…
vs0731
  • 17
  • 1
  • 8
1
vote
2 answers

I need suggestion if any one tried Apache HAWQ installation built on top of Quantcast File System (QFS)

I have requirement where need to install Apache HAWQ on top of Quantcast File System (QFS). As I understood Apache HAWQ is installed on top of HDFS. Suggestion / guidance please.
Mahesh
  • 243
  • 1
  • 5
  • 13
1
vote
1 answer

HAWQ table size estimation

I have a question around the storage size calculation/estimation for a table to be loaded into HAWQ? I have a 30MB table in HIVE, which I am trying to load using PXF into HAWQ, example: create table t2 tablespace data as select * from…
1
vote
1 answer

Spark + HAWQ Integration (HDP 2.4.2)

I am using HDP 2.4.2 I want to connect Spark with HAWQ for data ingestion. Please let me know if there is any recommended/correct approach, currently I am using postgress jdbc driver for connecting spark with HAWQ. I am facing issues like -DataFrame…
nilesh1212
  • 1,561
  • 2
  • 26
  • 60
1
vote
0 answers

How to exclude a method from an XA transaction with Spring annotations

I have a complex process that is designed to be transactional, and is coded to be tranactional with Spring annotations. When I run it against databases that have XA aware drivers, it works well (apart from the headache of setting up each RDBMS to…
pojo-guy
  • 966
  • 1
  • 12
  • 39
1
vote
1 answer

Pivotal HDB - How to push data from HAWQ Internal /external readable table TO Greenplum

We have small hadoop and Greenplum cluster. Current data pipeline flow is : External table >> hadoop-hawq external readbale table >>hawq internal table. Output : 1.WE are trying to extend data pipeline using GREENPLUM. Basically wan to push HAWQ …
NEO
  • 389
  • 8
  • 31
1
vote
2 answers

HAWQ installation on Redhat

I am installing HAWQ on RedHat servers provisioned on Amazon EC2. I already have HDP 2.3 setup on the cluster. I have cloned HAWQ from Github. First I run ./configure --prefix=/opt/hawq. In the second step, I run make. The dependencies are compiling…
Raman
  • 1,221
  • 13
  • 20
1
vote
1 answer

Setting up passwordless ssh failed for all the HAWQ hosts

we have 3 node and trying to setup hdfs and pivotal hawq with ambari and i have already enabled passwordless ssh for all the 3 machines but when i start hawq service i am getting "Setting up passwordless ssh failed for all the HAWQ hosts" this…
Dython
  • 134
  • 4
  • 12
1
vote
2 answers

Master postgres initdb failed while deploying HAWQ 2.0 on Hortonworks

I tried to deploy HAWQ 2.0 but could not get the HAWQ Master to run. Below is the error log: [gpadmin@hdps31hwxworker2 hawqAdminLogs]$ cat ~/hawqAdminLogs/hawq_init_20160805.log 20160805:23:00:10:050348…
HP.
  • 19,226
  • 53
  • 154
  • 253
1
vote
4 answers

How does HAWQ resource manager do if it can not acquire enough resource?

As we know, if a query ask resource manager to get resource but can not get enough resource(below than the minimal numbers of vsegs), it will still ask resource in the next round. For example, if a query ask 1000 vsegs at first time, but it only get…
1
vote
5 answers

PXF JSON plugin error

Using HDP 2.4 and HAWQ 2.0 Wanted to read json data kept in HDFS path into HAWQ external table? Followed below steps to add new json plugin into PXF and read data. Download plugin "json-pxf-ext-3.0.1.0-1.jar" from…
Pra
  • 31
  • 6
1
vote
1 answer

Getting error try to select hive table using hcatalog from HAWQ

I am using Hortonworks (HDP)sandbox and on top of that install HAWQ 2.0 I'm trying to select hive table using hcatalog but not able to access hive tables form HAWQ. Executing below steps mention in pivotal doc. postgres=# SET pxf_service_address TO…
Pra
  • 31
  • 6
1
vote
2 answers

Greenplum Security: How do i setup Password-less Login: Windows (via Putty) to Linux(greenplum master machine-centos6.5) in gpdb cluster

We have small array of GPDB Cluster. We have recently upgraded the GPDB and configured the GCC web portal. In that, we noticed Our machines is still showing password at the time of SSH . We tried following Document commands on greenplum…
NEO
  • 389
  • 8
  • 31
1
vote
2 answers

How to get total row count and max(timestamp) column in select list for all table in particular schema

We have Postgres based Read Only database. in that,we have 52 tables under one schema. We are trying to output row count and max(timestamp) column for all tables under one schema. Environment is : PostgreSQL 8.2.15 (Greenplum Database 4.2.0 build 1)…
NEO
  • 389
  • 8
  • 31
1
vote
3 answers

Hadoop dependences error when I tried to install pxf-service

I build Apache Hawq from source. When I tried install pxf-service, the error below occurs: [root@host-172-16-0-108 hawqadmin]# rpm -ivh pxf-service-3.0.0-22126.noarch.rpm error: Failed dependencies: hadoop >= 2.7.1 is needed by…
louis lau
  • 161
  • 3
  • 11
1
2
3
8 9