Questions tagged [kettle]

Kettle is a code name for Pentaho Data Integration Community Edition tool. It is an open source GUI-based ETL (Extraction, Transformation, and Loading) tool.

Kettle is a code name for Pentaho Data Integration Community Edition. It is an ETL tool (Extraction, Transformation and Loading) that uses a metadata-driven approach.

https://help.pentaho.com/Documentation/8.2/Products/Data_Integration

1387 questions
5
votes
2 answers

How to check Pentaho kettle version from the command line?

I know I can check the Pentaho version by opening the spoon.sh in vi mode, but the Pentaho that is installed in production environment doesn't have that information in it. Probably some employee before me has edited it. So is there any other way…
Ujjwal Chowdary
  • 125
  • 2
  • 5
  • 18
5
votes
1 answer

Pentaho - Group By step does not return timestamp value

I am using "Group by" step in my transformation and I group a field and pass some other fields as an output in Aggregates. Among those output fields there is a timestamp field that does not return the timestamp value which I pass as input, instead…
Arunraj
  • 558
  • 5
  • 21
5
votes
1 answer

how to insert data to a postgres table with enum data type in kettle pentaho?

I am trying to move data from mysql to postgres table. so I am using Table input step to get data from mysql table and using insert/update step to insert data to postgres table. The postgres table has a enum data type in it. so, when I try to…
Arunraj
  • 558
  • 5
  • 21
5
votes
2 answers

How to do complex document from three tables in json to mongodb using Pentaho

I have three tables in one database. These tables have a foreign key between they. The Table1 is master from Table2 and Table 2 is master from Table3. I want get data values and transform in MongoDB document like this: { "_id" :…
Renatto Machado
  • 1,534
  • 2
  • 16
  • 33
5
votes
2 answers

"Table exists" step in Pentaho Kettle

I want to use "Table exists" step to check if certain table exists if not then create one The transformation I have created (in order to copy data from input database into output database) Table Input -----> Table exists ----> Table output The…
Hello lad
  • 17,344
  • 46
  • 127
  • 200
5
votes
4 answers

Convert all selected columns to_char

I am using oracle SQL queries in an external Program (Pentaho Data Integration (PDI)). I need to convert all columns to string values before I can proceed with using them. What i am looking for is something that automatically applies the select…
aldorado
  • 4,394
  • 10
  • 35
  • 46
5
votes
1 answer

How to get attached files from an email, using Pentaho Kettle?

I'm stuck in a great problem. My task is to download some emails from a server using the IMAP protocol. This is accomplished by using the "get mails (POP3 / IMAP)" job entry, which downloads the emails, but in binary format. Files in binary format…
5
votes
2 answers

How do i use cache in kettle pentaho?

I am processing data, where i get some information from rest api, based on the value of a field. Now, value may repeat for that field and if I already have fetched the data for that value, from REST, i would like to reuse that value and saving an…
Thoughtful Monkey
  • 648
  • 2
  • 10
  • 24
4
votes
1 answer

Pentaho Kettle scripting option

I am trying to use the Pentaho Kettle software for a few transformations on my largetables. I want to perform an operation that displays the contents of alternate rows in two different tables and then I wish to join the two tables later for further…
user1189851
  • 4,861
  • 15
  • 47
  • 69
4
votes
1 answer

Format of dates in log files PDI / Kitchen 4.0.1

Inherited a set of jobs, and the logging to the filesystem begins with format {SEV} MM-dd HH:MM:SS, where I need to have the year as part of the timestamp. The only log4j configs I can find are part of an old Jasper install, and modifying them to…
kermatt
  • 1,585
  • 2
  • 16
  • 36
4
votes
1 answer

How to merge 2 csv files by columns in spoon, pentaho, while managing data conversion?

I'm facing the following problem: I have two inputs: 1) I have a csv base file with 35 columns and their proper headers. 2) I have a variety of given files, not controlled by me, that may or may not contain the 35 columns and, even worse, they may…
wleao
  • 2,316
  • 1
  • 18
  • 17
4
votes
6 answers

Kettle Internal.Job.Filename.Directory

I am new to Pentaho Kettle and I am wondering what the Internal.Job.Filename.Directory is? Is it my SPoon.bat folder, or the job/xfrm folder i created? Is there a way I can change it to point to particular folder? I am runnig spoon.bat in Windows…
Sam Keith
  • 339
  • 1
  • 5
  • 5
4
votes
1 answer

Pentaho DI - JSON Nested File Output

I have a requirement where I need to fetch records from multiple tables. The primary table is having one-to-many relationship to other tables. My data source is Oracle DB. Oracle db is having the specified tables. One called Student other one is…
4
votes
2 answers

Pentaho Table input to table output in batches

My input table in MySQL has 20 Million records and the target table in Oracle is empty. I need to load the entire table from MySQL into Oracle. I am simply using a Table Input and Table Output step. My intention is not to lock the source table for…
Srini V
  • 11,045
  • 14
  • 66
  • 89
4
votes
1 answer

Remove special characters using Pentaho - Replace in String

I wanted to remove the special characters like ! @ # $ % ^ * _ = + | \ } { [ ] : ; < > ? / in a string field. I used the "Replace in String" step and enabled the use RegEx. However, I do not know the right syntax that I will put in "Search" to…
M. Loyyy
  • 45
  • 1
  • 1
  • 4
1 2
3
92 93