Questions tagged [pdi]

PDI Pentaho’s Data Integration, also known as Kettle, provides extraction, transformation, and loading (ETL) capabilities.

PDI (Pentaho Data Integration), formally known as Kettle, is a project of data integration. It delivers powerful Extraction, Transformation, and Loading (ETL) capabilities, using a groundbreaking, metadata-driven approach.

External Links:

440 questions
0
votes
1 answer

cannot get email attachment in Pentaho kettle

I'm trying to extract an email attachement in kettle (Pentaho PDI) using the 'Email messages input', I looked at other examples and they all follow this example ,here the input step is different 'get mails (POP3 / IMAP)' and one can specify what to…
0
votes
0 answers

Can't start Spoon/PDI after installing my own plugin

I'm developing a new plugin/step for my pentaho PDI installation. The thing is, that i can't start spoon if the step is in the plugin/steps folder, if i take it out, i can start it normally, so, it is deffinitelly a problem with my step, but can't…
mrs1986
  • 341
  • 1
  • 2
  • 6
0
votes
1 answer

How I will log in in pentaho data integration after installation on windows

I have installed pentaho data integration. I don't know from where I will take start, I mean where I should click to run the application in the installed folder.
0
votes
1 answer

run kitchen pentaho batch windows script from another batch file example

I am having issues at running the kitchen.bat file from another batch file @echo off setlocal enableDelayedExpansion REM IF EXIST "%ProgramFiles(x86)%\Transform.exe" SET Transform="%ProgramFiles(x86)%\Transform.exe" set tmpDate=%date% set…
PhanJAf
  • 11
  • 8
0
votes
2 answers

Spoon PDI Data Validator Error

I am trying to validate that an assignment is correct. I can't say much, however we have internal and external users. I have an SQL Script that looks for anything other than internal on an internal assignment - result should be 0 rows. I then place…
Paul
  • 11
  • 7
0
votes
1 answer

While loop implementation in Pentaho Kettle

I need guidence on implementing WHILE loop with Kettle/PDI. The scenario is (1) I have some (may be thousand or thousands of thousand) data in a table, to be validated with a remote server. (2) Read them and loopup to the remote server; I use…
RVandakar
  • 81
  • 1
  • 5
  • 16
0
votes
0 answers

Unable to run pig script in pentaho

I’m using Hadoop in distributed mode. I wanted to execute pig script on hadoop cluster through remote machine. So to achive this I’m using pentaho & pig script utility. I set all parameter such as HDFS HostName: Hadoop master name HDFS Port:…
Vijay_Shinde
  • 1,332
  • 2
  • 17
  • 38
0
votes
1 answer

Send different metadata to different target streams - PDI

I have two target streams (Matches and mismatches) defined as below: @Override public StepIOMetaInterface getStepIOMeta() { StepMeta stepMeta = new StepMeta(); if (ioMeta == null) { ioMeta = new StepIOMeta(true, false, false,…
Abhishek Dey Das
  • 587
  • 7
  • 23
0
votes
0 answers

Download folders from ftp server using PDI

I am trying to download entire folders using 'Get a file with ftp' step. Does this step only download files? I also tried regular expression to match folders. I have used the following regular expression for folders: ^[a-zA-Z0-9]$ Also, it's only…
0
votes
0 answers

XML encoding error with TimeStamp in Kettle(PDI)

In kettle Job execution I am maintaining log ,It works fine with everything except "CheckPoints Log Table ".It gives error as "Column_Name Timestamp : there was a value XML encoding error" as Column having datatype DateTime.
Raj Kamuni
  • 388
  • 2
  • 12
0
votes
1 answer

Not able to pass parameter stored in memory by copy rows to result step to the SQL step condition

I have a scenario as per the two image First pic :It is the main job and it is having 1 transformationa and a job, where in the transformation, i am selecting filename and storing into copy rows to result. Now the job is going to execute in a loop…
Coding_line
  • 127
  • 4
  • 15
0
votes
1 answer

Spoon PDI Kettle crashes after cairo package update

I just solved a problem which took me way too much time so I thought I share the solution here in case anyone else is biting bricks because of this. what happened? I run Pentaho Data Integration 5.2 on CentOS 6.6. It worked like a charme until I ran…
Robi Wan Kenobi
  • 1,441
  • 1
  • 10
  • 10
0
votes
1 answer

Case Insensitive Regex expression for getting file

I have a scenario where i am taking files from a folder for data loading which is having naming convention as .Customer_..txt.But also i would like to make this expression case insensitive so if any file named CUSTOMER_1234 comes.It will also accept…
Coding_line
  • 127
  • 4
  • 15
0
votes
1 answer

Pentaho PDI Data Integration (Spoon) v5.2 has no transaction tab available in Job

I'm using Pentaho Data Integration (PDI) version 5.2 on Mac OSX I would like to create transaction for the 3 transformation in my job. I follow this manual https://help.pentaho.com/Documentation/5.2/0L0/0Y0/0H0 But there is no Transaction or…
chaintng
  • 1,325
  • 3
  • 14
  • 26
0
votes
1 answer

How to split records based on a value field in a column

i have a scenario like the below ...where based on value of num_seats i have to split the rows in target into that amount of data along with another field(seat_num) which will be having counter which will increment by 1.Please suggest..
Coding_line
  • 127
  • 4
  • 15