Questions tagged [dremel]

Dremel is an ad-hoc query engine developed by Google. Google BigQuery is the external implementation of Google's internal tool Dremel. Use this tag for questions related to Dremel and use the google-bigquery tag for questions about BigQuery.

Dremel is an ad-hoc query engine developed by Google. Google BigQuery is the external implementation of Google's internal tool Dremel.

Dremel was designed for interactive queries and provides excellent response time due to it's tree-based architecture.

References

Related Tags

19 questions
19
votes
3 answers

Dremel - repetition and definition level

Reading Interactive Analysis of Web-Scale Datasets paper, I bumped into the concept of repetition and definition level. while I understand the need for these two, to be able to disambiguate occurrences, it attaches a repetition and definition level…
Tony Tannous
  • 14,154
  • 10
  • 50
  • 86
5
votes
1 answer

Understanding Parquet File's metadata information printed with parquet-tools "meta" command

I am in the process of learning Parquet File's internal representation, so I went through Apache Parquet's Github page, Google Dremel's paper to understand the definition and repetition levels and Twitter's blog to learn more about Parquet file. To…
2
votes
0 answers

Dremel, Null value in repeted field

I have a structure like this (I used JSON to represent data here, but this can be an object in any form): [ { "DocID": ["A", "B"] }, {}, ] Based on Dremel spec, The repetition level for the only data filed here "DocID" (which is…
fzerorubigd
  • 1,664
  • 2
  • 16
  • 23
1
vote
1 answer

Combining words (tomato,carrot) and display count of locales over time in Big Query [en,en-uk,en-sv,en-au as EN]

I am currently picking up SQL and have come across a problem. This question pertains to many use cases for me. I have locales in en-uk, en-au, es-latam, es-spain that I want to combine simply as EN or ES for reporting over time. Below are examples…
1
vote
0 answers

Separating columns ( array of arrays) - Advanced SQL looping

I tried using a name that more accurately describes my question but msg said I am limited to 150 chars. Looking for assistance from someone who has advanced SQL skills. Ideally I want to do it in SQL to let the computer do the work. Too much…
0
votes
1 answer

Can BigQuery queries get assigned to a different root nodes?

I have been trying to understand the limitations of BigQuery based on how it works under the hood, but it's hard for me to find an answer to the question in the title: Can BigQuery queries get assigned to a different root nodes? I currently work…
0
votes
0 answers

Dynamic Filter based on SESSION_USER

I have a table with rows that includes usernames. I have two users who will view the table and I want to filter the table to show specific usernames based on who the session_user is. Is this possible? Sample Data Username| --------- 1 | 2 …
Zyre Soriano
  • 575
  • 4
  • 13
  • 25
0
votes
1 answer

Combining Two Groups in SQL

I am looking to combine metrics for two groups under one field, while the other groups within that field remain on their own. How do I go about this? I am pulling operational metrics and grouping them by different pods. I have pods A, B, C and D. I…
0
votes
2 answers

Calculate the Time spent per user in a issue in sql

I have a table where I have multiple users working, I need to calculate total time spent per user based on status claimed ISSUEID Time User date status 10101 2019-08-30 19:51:43 peter 2019-08-30 CLAIMED 10101 …
syncdm2012
  • 445
  • 2
  • 10
  • 22
0
votes
2 answers

finding time difference between rows

I need to calculate the claimed time of IssueID by user Total claim time is time from status claim till recent waiting. Looks a bit complex Kindly help out. IssueID TransTime User Status 101 2019-08-23 0:25:41 Peter CLAIMED 101…
syncdm2012
  • 445
  • 2
  • 10
  • 22
0
votes
3 answers

Finding when requests are met or exceeded by customer by month

I have a table that has customers and I want to find what month the customer met or exceeded a certain number of requests. The table has customer_id a timestamp of each request. What I am looking for is the month (or day) that the customer met or…
mnickey
  • 727
  • 1
  • 6
  • 15
0
votes
1 answer

SQL Server : how to create table with columns name as schemaname.columnname

I am trying to create the table using below codes but the column name do not show schema name. Select customer as `tablename.customer` From tablename Also, I tried Select customer as [tablename.customer] From tablename Any suggestions? Thank…
KunalP
  • 1
0
votes
2 answers

How to convert symbols into ASCII (hex or dec) in BigQuery?

URLs aren't translating. For ex. & needs to be translated in to %26 in order for the auto-generated url to populate data in a dashboard. I've tried cast, convert and to_code_points but all to no avail.
S.R.
  • 3
  • 2
0
votes
0 answers

Dremel SQL: CASE WHEN 2 conditions return too many records

I would like to return a rows that have a column which values are between 80 and 91. Here's my query: SELECT SUM( CASE WHEN ROUND((offer_percent*100),1) > 90 THEN 1 ELSE 0 END) AS MoreThan90, SUM( CASE WHEN 80 <…
dave
  • 229
  • 1
  • 3
  • 13
0
votes
0 answers

aggregate of field that loops through records

I work in a ticketing system that exports data to a reporting DB using the company's version of SQL (similar to Postgre, though). I'm looking for a way to create a statement that pulls rows that are aggregates of related tickets. I've been having…
1
2