Questions tagged [pdi]

PDI Pentaho’s Data Integration, also known as Kettle, provides extraction, transformation, and loading (ETL) capabilities.

PDI (Pentaho Data Integration), formally known as Kettle, is a project of data integration. It delivers powerful Extraction, Transformation, and Loading (ETL) capabilities, using a groundbreaking, metadata-driven approach.

External Links:

440 questions
1
vote
0 answers

Password Encryption for PDI Job

I am creating a batch file to schedule a PDI spoon job. F:\pdi-ce-9.1.0.0-324\data-integration\kitchen.bat /rep: /job:ParentJob /dir: /user: /pass: /level:Basic The problem here is we need to provide the password…
sweenal
  • 67
  • 8
1
vote
1 answer

Am I using switch/case wrong here to control?

I am trying to check the logs and depending the last log, run a different step in the transformation. Am I supposed to use some other steps or am I making another mistake here? For example, if the query returns 1 I want execute SQL script to run,…
1
vote
0 answers

Core Javascript Question -- Delete Object in a json file sourced from a column if conditions meet

the reason I need the code to work in core javascript is the tool we use, it uses Rhino from mozilla, so it cannot contain objects or methods related to manipulation of web pages. I am trying to compare data from two json files using core…
D S
  • 9
  • 3
1
vote
0 answers

Reading CSV file from Sharepoint with Pentaho

Is it possible to read a CSV file in a Sharepoint shared folder with Pentaho steps? I have tried to do it with a HTTP call but with no expected output.
sbaf
  • 21
  • 3
1
vote
1 answer

Pentaho PDI: execute transformation for each line from CSV?

Here's a distilled version of what we're trying to do. The transformation step is a "Table Input": SELECT DISTINCT ${SRCFIELD} FROM ${SRCTABLE} We want to run that SQL with variables/parameters set from each line in our…
1
vote
2 answers

My pentaho ETL jobs(.kjb) files are not working after upgrading the pentaho data integeration from 5.3 to 9.2

I used to run the Pentaho ETL jobs in PDI community edition 5.3 with the kitchen like below ./kitchen.sh -rep=repo1 -file=/var/lib/jenkins/project/path/etl/Job1.kjb after upgrading the PDI community edition to 9.2 the same ETL jobs are showing the…
1
vote
0 answers

Why is the log date time information being truncated in Pentaho Data Integration 9.2 against the Snowflake database?

I am migrating our Pentaho Data Integration (Kettle) logging to Snowflake DB instead of SQL Server. I was able to generate the tables against Snowflake using Kettle. When I run a kettle job, the logs are stored in the snowflake database. However,…
1
vote
2 answers

Convert an org.bson.Document to an org.bson.BasicBSONObject in Java?

I have a String in Pentaho Data Integration (PDI) a.k.a. Kettle, in a transformation's UDJC step. So using Java, I need to take that String which contains JSON from MongoDB, and put it into a BasicBSONObject. (I can use import statements but I can't…
Nathan
  • 65
  • 9
1
vote
0 answers

Pentaho PDI 9.2 will not launch

I have been PDI for years and I have been using PDI 9.1 since its release. I recently loaded PDI 9.2 and on launching spoon.bat, the windows dialog opens shows where context with the commands but then the application terminates. No error messages…
Ray
  • 133
  • 2
  • 7
1
vote
1 answer

#PDI# Would transformation executor with reference to other transformations slow down the whole process?

Here's a simple illustration of my transformation(click the link) Step1: Data flow would join with the meta data which contain the name of transformation file that each data would eventually be transferred to. Here's an example of the data after…
IMAPOTATO
  • 31
  • 3
1
vote
1 answer

Upload multiple files to pentaho

In pentaho data integration, how do I import a list of xlsx files that are in the same folder? note: the number of columns are always the same
Theorp
  • 151
  • 8
1
vote
2 answers

Penthao Dataintegration (PDI) 8.3 CE showing Error "localhost:9051" page not found on all dialogs except browse repository

I hope you can help me out here. I have a very strange situation and no Idea how to solve it. I installed Penthao Dataintegration (PDI) 8.3 on a Windows Server. I use seperated home folders and a Database Repostiory. It worked just fine but out of…
ribun
  • 11
  • 1
1
vote
1 answer

How to use date in PDI in MongoDB input step?

I am using mongoDB input step in my transformation and the query i am using is not working. On mongoplayground and compass the query works fine, however the exact same query on kettle gives the following error 2020/11/26 11:41:10 -…
ghengalala
  • 109
  • 9
1
vote
1 answer

PDI Kettle - Data lookup and append value

I have 2 input types of data coming from file - Data 1 contains the Attribute and Value pair. Currently it has 4 attributes with value but, this could be contain more attributes value pair depending on the input. The attribute name would be same as…
vish
  • 29
  • 4
1
vote
1 answer

Pentaho: Loop through start date to end date

Within Pentaho's PDI, I am trying to go day by day from a start date until the current date. I am trying to pass this date to a Rest Client. So for example, I start 5 days ago, pass that date to the Rest client task, then go to 4 days ago, pass that…
slybitz
  • 689
  • 1
  • 7
  • 18