Questions tagged [pdi]

PDI Pentaho’s Data Integration, also known as Kettle, provides extraction, transformation, and loading (ETL) capabilities.

PDI (Pentaho Data Integration), formally known as Kettle, is a project of data integration. It delivers powerful Extraction, Transformation, and Loading (ETL) capabilities, using a groundbreaking, metadata-driven approach.

External Links:

440 questions
0
votes
1 answer

Regexpression for getting a file

I have to get a file through PDI based on the filename and i want to select file with name matching pattern eligible_for_push which has to be at the end.The file can be .txt or .csv Please Help Thanks
Coding_line
  • 127
  • 4
  • 15
0
votes
1 answer

Pentaho Data Integration REST Client error handling

I'm passing around 300 POST requests to the REST Client using JSON. But, my database shows only 285 results. I want to identify the 5 rejected JSON requests. Is there any step similar to error handling in REST Client in PDI.
Sughosh
  • 21
  • 2
  • 8
0
votes
1 answer

How to sent file via mail step?

There is a condition ,which requires to sent multiple mails in gmail and that too very frequently.that is after every 10 minutes a mail will be triggered showing success and failure along with audit table details which consists of job_key,line…
Coding_line
  • 127
  • 4
  • 15
0
votes
1 answer

Is there any step or logic which will maintain SCD type 1 apart from Insert/Update step in pentaho?

Is there any step or logic which will maintain SCD type 1 apart from Insert/Update step.As for maintaining this for large data it is taking huge time. It is a simple step where max(update time) from table we are getting and the data after that…
Coding_line
  • 127
  • 4
  • 15
0
votes
1 answer

How to delete documents in a mongodb collection using mongodb delete step in Pentaho

https://github.com/maasdi/pentaho-mongodb-delete-plugin/wiki/MongoDB-Delete Has the documentation here but I am unable to write a query in json Use JSON Query Choose this option if you want to use JSON query expression for your delete criteria,…
jvm
  • 43
  • 1
  • 8
0
votes
1 answer

how to override the numberlong() when mongodb output writes documents in mongodb

I am creating JSON docs in MongoDB by using MongoDB o/p step in Pentaho PDI kettle with following structure. Expected output: { "lan_id" : -1, "st_id" : 051 } However I am getting following output as MongoDB is by default is creating NumberLong()…
jvm
  • 43
  • 1
  • 8
0
votes
1 answer

Scheduling a job in pentaho 5.1 setting logging information to Email

In pentaho [PDI(Kettle)] 5.1 version after scheduling job i have set email to the transformation in job if job fails email has send to the mail where the error occured.If i want to see detailed log information what happening during scheduling. How…
SRI
  • 531
  • 1
  • 4
  • 11
0
votes
2 answers

How do I export the actual TEXT of a query from Pentaho Spoon into an Excel file?

I use Pentaho Spoon for Data Integration (aka, PDI). My internal client wants reports written to excel files, which is fine, but he also wants to see the queries I use in separate tabs in the workbook. Is there a way to use Spoon to export the…
JT Ryan
  • 1
  • 3
0
votes
2 answers

How to create a new column with an SQL function in Pentaho PDI work-flow?

I have one sql function, let's say theFunction(item_id). It takes an item id and computes one value as its return. I read one table from the DB and I suppose to compute a new value to append for each row by this function given the item_id particular…
erogol
  • 13,156
  • 33
  • 101
  • 155
-1
votes
1 answer

How to increase pentaho transformation input/output speed

I am using Pentaho data integration (PDI)-spoon to create ETL's and I am very focused on performance. I develop an ETL to process that copy of 2,500,000 rows (each row has 104 columns) from MySQL 8 to Clickhouse database and it takes 30…
sai
  • 1
  • 2
-1
votes
1 answer

Delete records from MySQL source database table ,after copying to MySQL target database using Pentaho spoon

I would like to do the below operation using Pentaho spoon/PDI. Copy data from the Mysql source database to the Mysql target database. 2)After that I would like to truncate/Delete Data from Source Database Can Someone help with this?
-1
votes
1 answer

Where to find “.ktr” file repositories?

I am doing research using Pentaho/PDI for my masters. So in order to develop it, I need an amount of reliable “.ktr” files to analyze them. For that reason, I am coming to you so you can share with me some repositories that contain those files or…
-1
votes
1 answer

PDI. I would like to aggregate a line to csv im creating

For make it easy, i have a table on a ddbb that i'm making the input on pdi then i want to output it as a csv but i would like to add a line before. For example what i would look like when you open the csv: i'm…
-1
votes
1 answer

How to get count of word occurrences in a table

I have two tables in my database. The first one contains keywords, the other one contains longer text. For example: Table 1 has the following entries in a column: word1 word2 word3 Table 2 has the following values in a column: This entry has…
ghengalala
  • 109
  • 9
-1
votes
3 answers

PDI 8.1 repository is disabled

I have installed PDI CE 4.4 in my personal computer, and I copy PDI CE 8.1 to some folder. When I run spoon.bat of CE 8.1, It run well, but the repository is disabled. How to solve this problem? My OS is Windows 7 64bit.
Hengaini
  • 44
  • 5
1 2 3
29
30