Questions tagged [trifacta]

Trifacta is designed for analysts to explore, transform, and enrich raw data into clean and structured formats. Trifacta utilizes techniques in machine learning, data visualization, human-computer interaction, and parallel processing for non-technical users to prepare data for a variety of business processes such as analytics.

https://www.trifacta.com/resource-library/introducing-trifacta-wrangler/ https://en.wikipedia.org/wiki/Trifacta

14 questions
3
votes
2 answers

Google cloud - get current time offset from UTC of timezone (when timezone is described using Supported Time Zone Value)

I'm including timezone identifiers as strings, in data that's being stored in Google Cloud DataStore, and I want to get the current time offset from UTC (including any adjustment for daylight saving time), for that timezone identifier. For example I…
Chris Halcrow
  • 28,994
  • 18
  • 176
  • 206
2
votes
1 answer

Combine multiple rows into single row in Google Data Prep

I have a table which has multiple payload values in separate rows. I want to combine those rows into a single row to have all the data together. Table looks something like this. +------------+--------------+------+----+----+----+----+ | Date |…
VSR
  • 87
  • 2
  • 18
2
votes
2 answers

Google Dataprep copy flows from one project to another

I have two Google projects: dev and prod. I import data from also different storage buckets located in these projects: dev-bucket and prod-bucket. After I have made and tested changes in the dev environment, how can I smoothly apply (deploy/copy)…
WJA
  • 6,676
  • 16
  • 85
  • 152
1
vote
0 answers

Dataprep - missing rows after processing

I have csv containing 1.5 milion rows. I prepared Dataprep job that parse data and store them to BQ (or CSV). But after processing I have nearly half of rows missing (around 700k). When I run this Dataprep job without any recipe steps I got the same…
y0j0
  • 3,369
  • 5
  • 31
  • 52
1
vote
1 answer

In Trifacta or Google Cloud Dataprep, i'm trying to flag rows with non alpha numeric (�). What formula do I use?

In Trifacta or Google Cloud Dataprep, i'm trying to flag rows with non alpha numeric (�). What formula do I use? tried this formula but doesn't work Replace Matches of `�` from EMPLOYEE_FIRST with NOT VALID
1
vote
1 answer

How to unpivot an unknown number of columns in Google Dataprep / Trifacta?

Trifacta / Google Dataprep allows one to unpivot data, using its Unpivot Transform operator, in which one specify which columns to unpivot at design stage. How could one say unpivot an unknown number of columns. Here is a data example: The…
Jan Krynauw
  • 1,042
  • 10
  • 21
1
vote
1 answer

Does Google Cloud Dataprep support importing Google Drive Sheets as data sources?

I'm importing datasets in Google Cloud Dataprep (by Trifacta) to perform transformations on my data sources. But I can't see Google Drive Sheets in the list after connecting them to Big Query Console. I'm about to use them as rules for my…
1
vote
2 answers

Cloud Dataprep - Replace code or id with value with middle dataset

I'm really new in GCP dataprep and now trying to create a recipe, but I can't figured out the way of doing it. In summary I have 2 files, the first one with this columns: NAME, CONTRY_CODE, ... And the second one with: COUNTRY_CODE,…
1
vote
1 answer

Remove quotes from CSV output

I'm using Trifacta Wrangler to rearrange some datasets, stored in CSV files. I have to produce a CSV file as output, but when I do that Trifacta Wrangler surrounds every value, even integers and floats, with quotes, even if the column type is…
DevOlly
  • 91
  • 1
  • 5
0
votes
1 answer

Trifacta, upload Recipe output to existing SharePoint list

I'm using Trifacta (8.7 I think) and I need to upload the output of a recipe to an existing SharePoint list, but I'm not seeing where I define which column in my output goes to the column in the SharePoint list. I'm hoping I don't have to match by…
0
votes
1 answer

How to discard (trim) double inverted quotes in CSV file on data prep?

I'm dealing with a csv file that contains double inverted quotes (since the data has commas in it). But Google sheets do not show me the double-quotes and hence i'm able to split the column by a delimiter (space in this case) properly, to retrieve…
Salih
  • 391
  • 1
  • 13
0
votes
0 answers

AWS RDS PostgreSQL - copying from/to csv files on EC2 instance

I've run into problem that I can't fix for a few days. The thing is - I have following architecture: Two EC2 instances which are nodes running Trifacta application (some kind of application for data scientists), AWS RDS PostgreSQL instance. Since…
0
votes
1 answer

Google Dataprep/Trifacta - Joining three datasets, de-duping but maintaining not matched records

This feels like it should be more straightforward than I am finding it. I have three datasets/lists of customers with some overlapping data, but some customers who exist only in each of the lists. List A: 500k records List B: 80k records List C: 70k…
0
votes
1 answer

Dataprep string column in format yyyy-mm-dd HH:MM:SS to datetime

How do I convert in Dataprep a string column with format yyyy-mm-dd HH:MM:SS to a datetime column?
WJA
  • 6,676
  • 16
  • 85
  • 152