Questions tagged [tpc]

The industry-standard set of database benchmarks (TPC-DS, TPC-C, TPC-E, etc)

The TPC (Transaction Processing Performance Council) is a non-profit corporation founded to define transaction processing and database benchmarks and to disseminate objective, verifiable TPC performance data to the industry.

See http://www.tpc.org/information/benchmarks.asp

40 questions
1
vote
1 answer

Make utility complains about "undefined reference" when I tried to make TPC-E test

The problem is compounded by the fact that I am far from programming. I need to use the TPC-E test. I downloaded the source codes from their website. There is a miserable and almost meaningless ReadMe. Here are its contents: To use the Makefile you…
1
vote
2 answers

Hive select query return top 100 syntax error?

Here is my Hive query, straight from the TPC-DS toolkit: WITH customer_total_return AS (SELECT sr_customer_sk AS ctr_customer_sk, sr_store_sk AS ctr_store_sk, Sum(sr_fee) AS ctr_total_return …
crystyxn
  • 1,411
  • 4
  • 27
  • 59
1
vote
1 answer

TPC-DS queries generator

I need to test my data warehouse using TPC-DS. How can I generate queries for my data warehouse using TPC-DS? I tried to generate but it generate for a specific data warehouse. Thanks.
pas
  • 11
  • 3
1
vote
1 answer

Adding column to primary key removes uniqueness

I have around 34 million rows, each with 23 columns in the store_sales table in the tpc-ds dataset . I have a composite primary key of columns ss_item_sk and ss_ticket_number. Upon running the query SELECT count(DISTINCT ) .. I'm…
Zzrot
  • 304
  • 2
  • 4
  • 20
1
vote
0 answers

TPC-DS BenchMark on Hadoop - Why use star schema

I am trying to run tpc-ds benchmark with sparksql. In the document they talk about having star schema and number of tables. From what my understanding of hadoop is , its better to have denormalized data, and then you can you format like paraquet…
user2799564
  • 147
  • 2
  • 8
1
vote
1 answer

How to generate executable TPC-DS queries?

I have downloaded the DSGEN tool from the TPC-DS web site and already generated the tables and loaded the data into Oracle XE. I am using the following command to generate the SQL statements : dsqgen -input ..\query_templates\templates.lst…
user3395654
  • 13
  • 1
  • 3
1
vote
3 answers

Generating TPC-DS database for sql server

How do I populate the Transaction Processing Performance Council's TPC-DS database for SQL Server? I have downloaded the TPC-DS tool but there are few tutorials about how to use it.
zli89
  • 143
  • 2
  • 6
0
votes
1 answer

Error loading to MySQL TPC-H data via dbgen

I am using https://github.com/electrum/tpch-dbgen to generate the 8 tables of TPC-H. However, for the PART table, when loading the data to MySQL via load infile command, I get via show warnings, tons of warnings such as: | Note | 1265 | Data…
Ilonpilaaja
  • 1,169
  • 2
  • 15
  • 26
0
votes
1 answer

Does Spark TPCDS supports on YARN?

I am testing Spark-3.3.0-without-Hadoop using TPCDS referring spark-tpcds-datagen ,This spark is running on my Hadoop-3.2 Data is produced and -put to hdfs://xxx/tpcds/data330 When I run : ./SPARK/bin/spark-submit \ --master yarn \ #…
AppleCEO
  • 63
  • 7
0
votes
0 answers

How to run the generated qgen queries on the shell for hive and sparks and record each query execution time in an excel?

I've been working on performance comparison between hive and spark in a cluster as my final year project and I'm using tpch to generate the dataset then load the dataset into each tables using load data inpath. Then, I'm using qgen to generate the…
monchi
  • 1
  • 1
0
votes
0 answers

How to Get Nested Structure Using EF Core 7 TPC

I have base abstract class: public abstract class PossibleAnswerBase { public Uuid Id { get; protected set; } public string Title { get; protected set; } #region Navigation public Uuid QuestionId { get; protected set; } public…
Fallingsappy
  • 181
  • 3
  • 21
0
votes
0 answers

how to speed up TPC-H benchmark on Oracle database

im trying TPC-H benchmark on my Oracle database, testing takes place on a 10GB dataset. Currently I have target_memory set to 7GB, but the current test time is 18 minutes. The biggest problem is with the lineitem table(7,5GB), because its size does…
SItypack
  • 41
  • 5
0
votes
2 answers

TPC-DS data in Snowflake Cloud warehouse is available only with 10 TB and 100 TB, is it possible to get it with fewer records(10 GB or even lesser)?

By default, Snowflake comes with a 10 TB and 100 TB scale of TPC-DS datasets. Is it possible to get the same number of tables with fewer records/less volume(10 GB or even lesser than that)? Thanks in advance!
0
votes
1 answer

PySpark using both aggregate and group by

Can someone help me with pyspark using both aggregate and groupby functions? I have made my data frames, and applied filters and selects to get the data I want. However, I am now stuck trying to aggregate things correctly. Currently, my code outputs…
Wispers
  • 15
  • 3
0
votes
1 answer

TPC-H queries. Testing and how to run them?

So I have generated the TPC-H data. Now I am trying to run its queries on the database but I can't figure out if I should use some specific values or I should somehow generate them. For example, let's…
Maltem
  • 29
  • 8