Questions tagged [greenplum]

Greenplum is the worlds first open-source massively parallel processing database based on PostgreSQL.It provides powerful and rapid analytics on petabyte scale data volumes. Uniquely geared toward big data analytics, Greenplum Database is powered by the world’s most advanced cost-based query optimizer delivering high analytical query performance on large data volumes.

Greenplum is a massively parallel processing database based on PostgreSQL and is designed for analytic data warehouses to manage, store and analyze terabytes to petabytes of data. Greenplum is developed by Pivotal.

797 questions
0
votes
1 answer

How to declare new field values based on pivot value source field

I will try to word this as best as possible, so please bear with me. I am unpivoting data from 15 fields in Greenplum PostgreSQL. These fields may or may not contain numbers that I am unnesting into the new field. If the field does contain a number…
user3329160
  • 165
  • 2
  • 13
0
votes
1 answer

Is MERGE supported in Greenplum Database 4.3.5.1 build 1

I am trying to create a merge statement for Greenplum DB and I am getting an syntax error. So I am wondering if the MERGE is even supported the way I am writing it. I have two approaches Approach 1- MERGE into public.table20 pritab USING ( …
Nik
  • 371
  • 4
  • 15
0
votes
1 answer

How to alter column (Changing datasize), if table was created with partitions?

I have created table with a partition: CREATE TABLE edw_src.pageviewlog_dev ( accessurl character varying(1000), msisdn character varying(1000), customerid integer ) WITH ( OIDS=FALSE ) DISTRIBUTED BY (msisdn) PARTITION BY RANGE(customerid) …
0
votes
0 answers

Modify COALESCE function to consider two adjacent values for the result in Greenplum

I'm currently using the COALESCE function to grab the first NOT NULL value from a list of fields. Example: COALESCE(header_to_node_13.subsetname, header_to_node_12.subsetname, header_to_node_11.subsetname, header_to_node_10.subsetname, …
user3329160
  • 165
  • 2
  • 13
0
votes
2 answers

Can't DISTRIBUTE BY in psql GreenPlum

I am trying to create a table on a RHEL5 machine running GreenPlum (Postgres) server. The table I am trying to create needs to have a "DISTRIBUTE BY" clause with a few of the columns as its predicate. Without the "distribute by" the whole command…
Ronnie
  • 1
  • 1
0
votes
2 answers

PSQL - Select size of tables for both partitioned and normal

Thanks in advance for any help with this, it is highly appreciated. So, basically, I have a Greenplum database and I am wanting to select the table size for the top 10 largest tables. This isn't a problem using the below: select sotaidschemaname…
Vinny Pem
  • 23
  • 1
  • 3
0
votes
1 answer

Informatica CDC rejects data

Our source is oracle ERP system where we have installed Informatica CDC, our target is Greenplum tables to which we load the data as 1-1 logic. We execute the session in real time mode which means the session will be keep on running, when any…
0
votes
1 answer

psql copy from variable table name?

For some reason the code below breaks in psql as supplied with Greenplum at the \copy stage: \set tmp1 public.tmp1 DROP TABLE IF EXISTS :tmp1; CREATE TABLE :tmp1 (new_id varchar(255), old_id BIGINT) DISTRIBUTED BY (old_id); \echo :tmp1 \copy :tmp1…
mgoldwasser
  • 14,558
  • 15
  • 79
  • 103
0
votes
1 answer

How to get specific quantiles using R while connecting to Greenplum database?

I am trying to get quantiles from my data- Lets say (0.25, 0.5 and 0.75 ) for a variable . I could figure out how to do it from a dataframe Quantile <- mydataframe %>% summarize(quantile(variable_name,c(.25,0.5,0.75))) However instead of a…
sourav
  • 179
  • 1
  • 1
  • 14
0
votes
1 answer

How to add new partitions automatically in partitioned table based on data inserted in GreenPlum.?

I have a partitioned table in greenplum(modeled after psql), which has been partitioned with specific range of values. Now, i have to insert the data again into the same table. New values for Partitions might overlap with existing ones. I have…
Srini
  • 3,334
  • 6
  • 29
  • 64
0
votes
1 answer

In Greenplum where is data temporarily inserted into tables within a function actually stored?

Our project has a database function that stores millions of records from queries into intermediate tables as part of its steps, and then joins those intermediate tables to get the final results, which are stored in a final table. The intermediate…
James
  • 127
  • 2
  • 8
0
votes
1 answer

updating records in table A from table B if column XYZ ( checksum ) changes in table B

I am having a table A and table B. Table A is created from Table B ( and few other table join operation ). Table A has all of its column which are subset of column in table B. There is a column called as check_sum in table A and table B. This is…
user1188611
  • 945
  • 2
  • 14
  • 38
0
votes
1 answer

How can I generate dynamic SQL in WhereScape (on Greenplum)?

We have a table of business rules that define column names and tests to perform - null/populated, >=, <=, etc., and I want to create a procedure or function that implements these rules dynamically. Can I do this in WhereScape RED, or do I have to…
PhilHibbs
  • 859
  • 1
  • 13
  • 30
0
votes
3 answers

Updating tables in greenplum database using gpload

I am having a local table XYZ in greenplum. I am populating that table from data from 5 other tables ( table XYZ has few columns and data from 5 different tables, populated by some join operation ). This is working fine. But problem i am facing here…
user1188611
  • 945
  • 2
  • 14
  • 38
0
votes
1 answer

Inlist the comments associated with objects in Greenplum Database

Is there any way to query and inlist all the comments associated with tables and their columns Please reply back if you any query for this Thanks