Questions tagged [kettle]

Kettle is a code name for Pentaho Data Integration Community Edition tool. It is an open source GUI-based ETL (Extraction, Transformation, and Loading) tool.

Kettle is a code name for Pentaho Data Integration Community Edition. It is an ETL tool (Extraction, Transformation and Loading) that uses a metadata-driven approach.

https://help.pentaho.com/Documentation/8.2/Products/Data_Integration

1387 questions
4
votes
1 answer

How insert UUID values in PostgreSQL table via Kettle?

My Kettle Transformation looks simple. "Table Input --> Table output". In table input block: simple SQL statement which works fine on server. When i'm trying to insert UUID value to PostgreSQL table via Kettle i have error: ERROR: column…
user3627830
  • 133
  • 1
  • 6
4
votes
1 answer

Cannot get data using get rows from result step in the second transformation in kettle

I have two transformations inside one job in kettle. the first transformation read data from a csv file and sorted the data. At the end of the 1st trans I use copy rows to result step. In the 2nd trans, I begin with get rows from result step…
April Liu
  • 41
  • 4
4
votes
2 answers

Pentaho Kettle : How to use the same database connection in both Job and transformation

I have a job and multiple transformations. If i wanted to define a database connection in the Job , and use the same database connection for all the transformations , how do I go about it ? I am using a community edition Pentaho 5.0 version . I am…
Rasika Vijay
  • 395
  • 1
  • 7
  • 17
4
votes
1 answer

Pentaho ETL : Database Join vs Table Input

I need to write a database table data to a text file with some transformation. There are two steps available to retrieve the data from the table, namely Table input and Database join. I don't see much difference between them except the "outer join?"…
Jeet
  • 1,006
  • 1
  • 14
  • 25
4
votes
2 answers

How to remove column in Pentaho Data Integration?

I am using PDI/Kettle. I know it is possible to add new columns by specifying them in fields. Is it possible to remove deprecated input columns coming from the previous step in Modified Javascript Step with Spoon?
Hello lad
  • 17,344
  • 46
  • 127
  • 200
4
votes
1 answer

Pentaho - Having multiple Copy rows to result results in Get rows from result empty

I'm trying to process some data and store it in a datawarehouse. For doing it, I wanted to store dimensions in one transformation and fact (only have one) in another transformation. So I can use a job for execute the first one, copy rows to result…
Lyd
  • 2,106
  • 3
  • 26
  • 33
4
votes
1 answer

Pentaho Data Integration User Defined Java Class

I create simple java class and export it to jar: package test; public class Test { public Test() { // TODO Auto-generated constructor stub } } Jar file add to lib folder in Pentaho (there are many jar files) Next step I want to…
Michał Orliński
  • 1,308
  • 13
  • 15
4
votes
1 answer

Add a new data type to Pentaho Kettle

I am trying to add a new data type (Geometry) to Kettle. I have added a new Value type to org.pentaho.di.compatibility. I have added a ValueGeometry class and made the necessary modifications to ValueInterface and Value. The code compiles but the…
Abhishek Dey Das
  • 587
  • 7
  • 23
4
votes
1 answer

Pentaho Kettle "Use batch update for Inserts" option

Please can someone explain me any pitfall or drawback with "Use batch update for Inserts" option with Pentaho Kettle Table output or update steps? One of my colleage's opinion is, if commit size set to 1000 and if any one of these 1000 records…
RVandakar
  • 81
  • 1
  • 5
  • 16
4
votes
1 answer

Reusing transformations with different data in Pentaho data integration Kettle

I'm working with Pentaho Kettle (PDI) and i'm trying to manage a flow in where there are a few transformations which should work like those where functions. I'll be more specific. I've created some transformation that make some modify on a few…
giogix
  • 769
  • 1
  • 12
  • 32
4
votes
2 answers

Filling data gaps in a stream in Pentaho Data Integration, is it possible?

I have a CSV file with currency exchanges EUR-USD. The file was downloaded from the Bank of Canada. I downloaded the CSV with data since Oct 10th, 2013 onwards. There are, nevertheless, gaps in the data, ie. days without the conversion rates. …
manu
  • 3,544
  • 5
  • 28
  • 50
4
votes
2 answers

PDI parameter in select statement

I want to ask a question about Pentaho. I'm using Pentaho data Integration (Kettle). I want to use parameter in the select statement. I have searched various forums and didn't find any solution. My problem is that I want to get result from different…
4
votes
2 answers

How to convert a date to string then split it into 4 parts using Pentaho Kettle

I have a table that contains a column named time_created representing a date: 2014-02-19 23:49:59.998557. Now I need to generate a new table that consists of year, month, day and quarter columns.(For analysis purpose). For example, 2014-02-19…
Neo
  • 2,196
  • 5
  • 32
  • 58
4
votes
1 answer

Changing a field from String to Int in Pentaho Data Integration

I am taking JSON information from our online API and producing it in a table with DI. I have 4 fields url, deviceId, displacement & timestamp. These are all recorded as Strings but I want them to be Int values, bar the url. In the Generate Rows…
Dan
  • 2,020
  • 5
  • 32
  • 55
4
votes
1 answer

finding out which Jars are needed for a step in pentaho kettle

I have a Java application with an embedded pentaho-kettle executor in it. basically it's a war (usually running from within jboss or tomcat) that executes KTRs using a java. my setup is pretty much the same as in this post Pentaho Kettle within a…
Ilan.K
  • 673
  • 14
  • 22