Questions tagged [pdi]

PDI Pentaho’s Data Integration, also known as Kettle, provides extraction, transformation, and loading (ETL) capabilities.

PDI (Pentaho Data Integration), formally known as Kettle, is a project of data integration. It delivers powerful Extraction, Transformation, and Loading (ETL) capabilities, using a groundbreaking, metadata-driven approach.

External Links:

440 questions
1
vote
1 answer

Manually register a plugin

I'm trying to run a transformation using PDI embedded in Java code but I get an error about missing plugin (looks like it is the Abort step). I know I can use KETTLE_PLUGIN_BASE_FOLDERS so that PDI loads it automatically (that works), but in order…
ysn
  • 13
  • 3
1
vote
2 answers

how can i create dynamic json data to hit my rest client

I'm a beginner in data integration tool Pentaho. My data base table hase all columns together. Now I want to create a JSON from that table data in order to hit my REST client. My REST client accepts JSON like this, so I need to prepare the same. { …
Himanshi
  • 84
  • 1
  • 2
  • 13
1
vote
1 answer

PDI: how to combien IF and ISBLANK on formula step?

In my DB table, I have a column called type and its possible values are "A", "B" or Null. If It's Null, I want to replace it to "C", otherwise no change. So here's what I did on formula step: IF(ISBLANK([type]);"C") However I can an error message…
user11492726
1
vote
2 answers

pentaho spoon/pid: how to move files to folders with different name everytime?

I have new text files every month from where I extract the data and do some transformations. In the end of every month, I need to move these files to a folder with current date in name. Which means, the destination folder's name is different…
user11492726
1
vote
1 answer

Pentaho Data Integration Does not support IAM for authorization in S3 Output File Step

I am trying to upload a file to S3 which requires AWS credentials and a specific IAM role as well. I can connect using AWS CLI tools and Java SDK, but PDI 8.2 does not seem to support AWS roles or profiles for authentication and authorization.…
1
vote
5 answers

Set Kettle DB Connection password from Environment Variable

How do I set a database connection's password to use an environment variable. I cannot add these to the kettle.properties file (security policy). When I click control-space on OSX, nothing happens. If I try and paste the string ${PASSWORD} directly…
NullPumpkinException
  • 1,396
  • 1
  • 18
  • 22
1
vote
3 answers

Pentaho data integration JSON input in transformation job not showing all json objects

I am working with Pentaho Data integration tool for some of the ETL processing jobs. My data input is a json file named data.json having many json objects as…
KayV
  • 12,987
  • 11
  • 98
  • 148
1
vote
1 answer

Replace "0" (Number) with NULL in Pentaho Data Integration

How do I replace a specific value (Number or Integer) with NULL? I ve got a number field containing "0" and I need to replace it with NULL (empty). How do I do it with a Pentaho Data Integration step? I dont want to use javascript.
Dirk Sachse
  • 57
  • 3
  • 12
1
vote
3 answers

Apache Commons VFS - How to open whatever dir follow by specific dir?

I think this is simple thing but some how I fail with it. I have this : ftp://user:password@host/Folder/Folder/Folder/../${Year}/${Month}/*.csv What I'm trying is to open whatever folder name follow by specific folder name. I'm try to access file…
1
vote
1 answer

Input xlsx or csv file in Kettle from microsoft sharepoint that needs login

I want to input xlsx files or csv files from a microsoft sharepoint site that i need to login. I have account on the sharepoint but Kettle Microsoft excel input step does not have options for login. The result is that the transformation gives an…
1
vote
2 answers

Variable in Generate Rows step in PDI

I want to use variable in 'Generate rows' step in Pentaho. How can i use it. I want to give a file path in a 'Value' field but that should not be static so that i can pass this value into my csv input.
1
vote
1 answer

PDI - How to implement rijndael CBC encryption in kettle?

I have some csv's containing the sale transactions. The file is come from another department in another state and it's encrypted for security reason. It's encrypted using Rijndael-128 mode CBC encryption. I need to decrypt the file before make a…
Rio Odestila
  • 125
  • 2
  • 19
1
vote
1 answer

How to escape question mark (?) character in PDI Pentaho in a query

I'm trying to execute the following query in PDI 7.1 in an Execute SQL Statement step. update table_x set text = '? rows where update using https://example.net?param=1' where 1=1 The problem is that the first question mark corresponds to a…
1
vote
2 answers

PDI - Check data types of field

I'm trying to create a transformation read csv files and check data types for each field in that csv. Like this : the standard field A should string(1) character and field B is integer/number. And what I want is to check/validate: If A not string(1)…
Rio Odestila
  • 125
  • 2
  • 19
1
vote
2 answers

PDI - Collecting File From FTP Older Than N Day

I have a job that will Collect Data from FTP using Get a file with FTP and I want it's only collect yesterday file or older than n day or base on specific date. How do that? Is any way or possible? What I know is Get a file with FTP only copy file…
Rio Odestila
  • 125
  • 2
  • 19