Questions tagged [pdi]

PDI Pentaho’s Data Integration, also known as Kettle, provides extraction, transformation, and loading (ETL) capabilities.

PDI (Pentaho Data Integration), formally known as Kettle, is a project of data integration. It delivers powerful Extraction, Transformation, and Loading (ETL) capabilities, using a groundbreaking, metadata-driven approach.

External Links:

440 questions
2
votes
2 answers

Migrating Transformations in Pentaho PDI

We are using two servers, one as preprod and other as Production. When we are migrating jobs or Transformations from preprod to Prod it copies its connection properties as well and this affects our Production job execution. Can someone let me know…
suraj08
  • 119
  • 2
  • 11
2
votes
4 answers

How to Run PDI Transformation with Database from Java?

I am trying to run a PDI transformation involving database (any database, but noSQL one are more preferred) from Java. I've tried using mongodb and cassandradb and got missing plugins, I've already asked here: Running PDI Kettle on Java - Mongodb…
M.R. Murazza
  • 346
  • 3
  • 12
2
votes
2 answers

How should I perform data masking with pentaho PDI (spoon)?

I would perform data masking for more than 10 tables and each tables has more than 100 columns. I'd tried to mask data using pentaho PDI tool, but I couldn't find out how should I write mask data with it. How should I perform data masking with…
Naga
  • 10,944
  • 2
  • 21
  • 38
2
votes
1 answer

Compare table schemas before starting a job

We are currently working on a project where we need to check if the database schema has changed everytime we start a Spoon job, since our origin is a third party database that we have little to no control. The most obvious solution to us would be to…
Isabelle
  • 631
  • 1
  • 7
  • 14
2
votes
1 answer

Split a string in pentaho data integration

I'm a beginner with pentaho data integration and I want to split a string with the following form : FIRSTNAME LASTNAME CODE I want to isolate the firstname and lastname from the code noting that the lastname can contain more than a word. I thought…
Zakaria Belghiti
  • 521
  • 3
  • 8
  • 19
2
votes
2 answers

Pentaho - CSV Input not understanding special character [Windows to Linux]

I have a transformation on Pentaho Data Integration where the first thing I do is I use the "CSV Input" to map my flat file. I've never had a problem with it on windows, but now I'm chaning my server that spoon is going to run to a linux server and…
Leonardo Sibela
  • 1,613
  • 1
  • 18
  • 39
2
votes
1 answer

Is it Possible to have Multiple Kettle property files with multiple values but same parameter name

Is it Possible to have Multiple Kettle property files with multiple values but same parameter name For example.I am having client_id and client_name defined in kettle properties as 1 and Microsoft and also i wanted to store same parameters but the…
Coding_line
  • 127
  • 4
  • 15
2
votes
1 answer

Filter Rows Step not working and initializing both the conditions

As in the attached pic. I need to pass to the execute SQL step only if the count value from the table input is <> 0.Else it will move to dummy and wont go to execute process.But it is not working and table is getting deleted which is actually the…
Coding_line
  • 127
  • 4
  • 15
2
votes
2 answers

Pentaho Row Denormaliser Step Not Working

I have some sorted data that I'm trying to denormalize but the step in Pentaho isn't working correctly. Here is a snapshot of the sorted data: And here is a snapshot of the Row Denormaliser Step as I've configured it: What I get is: There are no…
Dezzie
  • 934
  • 3
  • 18
  • 35
2
votes
1 answer

In pentaho..How to pass a text file which contains all the definition of the connection parameters in the job?

I am using jdbc connection and i am passing parameters with example ${sample_db_connection} and that parameters has been defined in server in a text file as sample_db_connection=localhost and i want to pass the text file in the job step so that…
Coding_line
  • 127
  • 4
  • 15
2
votes
3 answers

Pentaho Data Integration commercial costs + azure hosting

I did some Google searches but could not find any clear answer. How much does PDI/kettle costs for commercial usage? Is it potentially free? Can it be hosted in Azure?
cs0815
  • 16,751
  • 45
  • 136
  • 299
1
vote
1 answer

Insert XML expressions into a working XML string using Modified JavaScript Value step in PDI

I'm building a XML string for a SOAP request in Pentaho Data Integration using a Modified JavaScript Value step. The request must execute for each row (order) and each row will include data of a variable number of items. I need to bring the variable…
1
vote
0 answers

Pentaho Data Integration Community Edition 9.4 - Shared objects file feature of transformation doesn't support variable

Our company used Pentaho Data Integration Community Edition (PDI-CE) 8.0 (Linux). We are upgrading to the PDI-CE 9.4. In the process, we met an issue. To reproduce the issue, clone the project…
Albert Wang
  • 524
  • 4
  • 7
1
vote
0 answers

How to distribute data from GridDB to different data warehouses?

I am currently collecting data from different PLC devices to GridDB for storage. The format of the data is as follows: D_NAME DATA MSG Siemens 2021/10/4 acbdfg Omron 2021/10/4 ponged It is necessary to distribute the row whose device…
dogs Cute
  • 564
  • 3
  • 9
1
vote
1 answer

Pentaho Data Integration - Merge rows (diff) step not executing with Table Input "Replace variables in script"

I have a PDI Transformation that is using the Merge rows (diff) step comparing rows from two Table Input steps (with Sorts in between). (I linked a screenshot of the Transformation at the bottom of the post) The queries in the Table Input require…