Questions tagged [pdi]

PDI Pentaho’s Data Integration, also known as Kettle, provides extraction, transformation, and loading (ETL) capabilities.

PDI (Pentaho Data Integration), formally known as Kettle, is a project of data integration. It delivers powerful Extraction, Transformation, and Loading (ETL) capabilities, using a groundbreaking, metadata-driven approach.

External Links:

440 questions
3
votes
2 answers

BIGQUERY - How to create a connection with Pentaho Data Integration (Spoon)?

I'm trying to have access to BigQuery through Pentaho Data Integration, but I'm not succeeding. System: OSX El Capitan Google BigQuery Authentication Method: Service Account with .p12 key I have followed this tutorial and I'm using…
luisfsns
  • 81
  • 1
  • 6
3
votes
1 answer

Unable to connect steps in Pentaho PDI installed on Windows 10

I installed a fresh copy of Windows 10 (Education version)and discovered that when I run Pentaho PDI (version 5.4) that I am unable to connect two steps together. Normally there is a dialog window that opens where you can select the add hop but…
Ray
  • 133
  • 2
  • 7
3
votes
1 answer

Pentaho ETL : Modified Javascript Step "SKIP_TRANSFORMATION" transformation Constant work logic

Why the SKIP_TRANSFORMATION works only when the CONTINUE_TRANSFORMATION assigned to trans_Status? I am not able to find information regarding this, in the wiki link //Not working (not getting skipped) if (sequence_value%2==0){ trans_Status =…
Jeet
  • 1,006
  • 1
  • 14
  • 25
3
votes
1 answer

Error when Installing pentaho data integration

I'm trying to install pentaho data integration and I have a problem with the configuration of spoon.bat. I updated the file with adding PENTAHO_JAVA= javaw.exe, also edited the environment variables (Computer > Properties > Advanced > Environment…
Fatim
  • 155
  • 3
  • 3
  • 13
3
votes
0 answers

Cassandra Input Step in Pentaho Spoon throws Read Negative Frame Size Error

I am using PDI 4.4.0. In the Cassandra input step, I am unable to read data from cassandra. Following is my stack trace. rg.apache.thrift.transport.TTransportException: Read a negative frame size (-2113929216)! at…
Poppy
  • 2,902
  • 14
  • 51
  • 75
3
votes
4 answers

Pentaho: execute insert only if there are no duplicates

Basically I want to insert a set of rows only if there are no changes from the target row. I have implemented a blocking step to wait for all rows to be processed before proceeding. After this I want to add a condition to check if there are any…
DRTauli
  • 731
  • 1
  • 8
  • 25
3
votes
2 answers

How to store a variable from one line for use in later lines in Pentaho kettle?

I have to process a spreadsheet that has multiple levels of aggregation within it. Mostly, this is just fine, but in one case, I need to use information from the highest aggregation level in conjunction with information from the next aggregation…
Brian
  • 31
  • 1
2
votes
1 answer

Unable to find ID Kettle

I'm trying to integrate Kettle in Java and execute a Transformation. The packages are downloaded correctly. However, when I try to run the transformation as a Java Application i receive the following exception: Unable to find plugin with ID…
Marcelo
  • 43
  • 6
2
votes
0 answers

PDI transformation does not send messages to Kafka server

I have a transformation in Pentaho Data Integration (PDI) that makes a query to NetSuite, builds JSON strings for each row and finally these strings are sent to Kafka. This is the transformation: When I test the transform against my local Kafka it…
2
votes
1 answer

Pentaho / Salesforce: How to integrate SF-Enterprise-Web-Services-API V48.0 into PDI 9.0 that only supports v47.0

actually I am working with PDI 8.2, however I am able to upgrade to 9.0. The main issue is that a customer wants to pull data from salesforce which works well so far. But he is using the Enterprise Web Services API with version 48.0, latest Pentaho…
notAnExpert
  • 123
  • 1
  • 5
  • 15
2
votes
3 answers

Pentaho MS-SQL set IDENTITY_INSERT a00 ON for table output

PDI 9.0 Simple job MS SQL connection Table input -> tableouput tableouput table : CREATE TABLE [dbo].[a00]( [ProductID] [int] NOT NULL [IDENTITY(1,1)][1] .. I need to write IDENTITY explicitly in SSMS I run set IDENTITY_INSERT a00 ON and…
FedorFFF
  • 21
  • 2
2
votes
0 answers

PDI Mongo Input Parameter

I'm using two transformations and one job. My first transform gets two fields and I attempt to pass those fields in my job down to my second transform - in my mongo input, query step { dataObjectId: ${idddddddddd} } when that step runs I NEVER…
Tony
  • 8,681
  • 7
  • 36
  • 55
2
votes
1 answer

Java API - Pentaho - PDI - The transformation path ... is invalid, and will not run successfully

I'm using java api for de transformations. When I running local (eclipse/windows), the transformation works perfectly! But I will run in linux, the transformation doens't works and return the problem: "The transformation path…
Tiago Emil
  • 21
  • 2
2
votes
1 answer

PDI: Output only if no errors

i want to transform a csv-file to an XML-file. In the Transformation i have also a small Validation of data, for example length of a string must be < 50. So i have a Textfile Input Step > Modified JavaScript Step with two hops to Abort Step (for the…
2
votes
2 answers

Unzip file issue in Pentaho-spoon

I'm trying to unzip a file in a job, everything's ok until the filename inside the zip has some special characters like "á, é, í, ó, ú". When the filename inside the zip has those characters I got an error and this log: Unzip file - ERROR (version…
Roger Nieto
  • 63
  • 10
1
2
3
29 30