Questions tagged [pentaho-data-integration]

Tag to be used for Pentaho Data Integration (all versions). Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

It provides intuitive drag-and-drop data integration coupled with data agnostic connectivity spanning from flat files and RDBMS to Hadoop and beyond.

Features:

  • Graphical extract-transform-load (ETL) designer to simplify the creation of data pipelines
  • Rich library of pre-built components to access, prepare, and blend data from relational sources, big data stores, enterprise applications, and more
  • Powerful orchestration capabilities to coordinate and combine transformations, including notifications and alerts
  • Agile views for modeling and visualizing data on the fly during the data preparation process
  • Integrated enterprise scheduler for coordinating workflows and debugger for testing and tuning job execution
825 questions
2
votes
2 answers

Unable to pass parameteres to pentaho kjb file

I am new to kettle and am trying to run a kjb file which in turn will execute a transformation(.ktr) file. My kjb file has "Set Variables" element which accepts a file to be read in parameter called "filePath". Now here is my problem: when I run…
KCK
  • 2,015
  • 2
  • 17
  • 35
2
votes
2 answers

Pentaho date format issue

my input excel sheet has the field with two different types of values column in the format YYYY/MM/DD Now, when I have added the excel sheet into Pentaho the columns along with datatype I got which shows string datatype in the date formats column.…
Akshay
  • 359
  • 1
  • 3
  • 14
2
votes
2 answers

Pentaho: add another column based on another column

I have this situation, I have data about age, and I want to add another column, with age range: Range 1 (0-18 years) = children; Range 2 (19-50) = adults; Range 3 (50+) older people How can I do this in Pentaho? I have already tried it with…
Blaž Čukelj
  • 83
  • 1
  • 7
2
votes
1 answer

Spoon takes insanely long time to start

I am running Spoon - Pentaho EE V 6.1 on my laptop (8 GB RAM) and allocated 4 G to Spoon. Still it takes 3 minutes and 30 seconds to start. I dont have any plugins and my plugins directory is empty. I have also tried this by closing all applications…
Srini V
  • 11,045
  • 14
  • 66
  • 89
2
votes
2 answers

Installation of Pentaho-Data integration

After extracting the Pentaho zip file I downloaded. I set my class path to JDK 1.7.0. But, upon running the Spoon.bat file I keep getting the statement: "Unable to launch your Java Virtual Machine (JVM)." How do I resolve this problem ? NB: I have…
Rume7
  • 41
  • 2
  • 6
2
votes
2 answers

Different behavior in a job when execute it from the kitchen

I'm using Pentaho Data Integration 7.1 to copy some local files to a Hadoop 2.8.0 Cluster. I have created a Hadoop Cluster connection on Pentaho and I'm using a Hadoop Copy Files step. I can successfully upload the files when I execute my job from…
2
votes
1 answer

What is "Serial Single Threaded" type of Transformation Engine in Kettle 6.0.1.0?

I am very new to Kettle tool and found a transformation property where "Transformation Engine Type" can be changed. Can someone help me to understand what "Transformation Engine Type" mean and if it is selected to "Serial Single Threaded", how does…
2
votes
1 answer

transforming rows to headers in pentaho

I have a file which has entried like below FIELD1,FIELD2 name,ABC age,29 location,ZZ name,XYZ age,33 location,YY I require the output to be as below. I tried the row-denormalization, but its not giving proper…
tester
  • 213
  • 6
  • 20
2
votes
0 answers

Can Pentaho be hosted in microsoft azure

Can we host Pentaho data integration in Microsoft azure.If yes what may be the estimated cost for this.
vkc
  • 83
  • 1
  • 1
  • 9
2
votes
1 answer

pentaho set variable to jobs

I am new to pentaho. I have a Job with 3 transformation and all 3 transformation are simliar . In each transformation has Sql query something like select * from table1 where tabl1.col1='XXX' and tab2.col2='YYYY' value of col1 remains same. I want…
simplify_life
  • 405
  • 4
  • 18
2
votes
2 answers

Pentaho HTTP Post multipart/form-data

I am trying to use HTTP Post to post the data by passing following Header and Body part, Body: One image URL having file path. Type as 'File' Metadata in JSON format. Type as 'Text' Headers: Content-type = multipart/form-data Authorization =…
2
votes
0 answers

how to change date format(time zone) in CDE Dashboard in Pentaho

The data source which the date is stored in GMT is linked with Pentaho. when i am creating CDE Dashboard, i am unable to find the option changing data format into IST(GMT, +5:30:00). Let me know where to change the data format(layout Panel or…
learningstudent
  • 577
  • 1
  • 4
  • 14
2
votes
1 answer

PDI/Kettle: avoid file creation or mapping (sub-transformation) execution

It's clear by now that all steps from a transformation are executed in parallel and there's no way to change this behavior in Pentaho. Given that, we have a scenario with a switch task that checks a specific field (read from a filename) and decides…
jfneis
  • 2,139
  • 18
  • 31
2
votes
1 answer

Pentaho Spoon OK, Pentaho Kitchen ERROR

I currently have a pretty big job up and running on Pentaho Spoon 5.4.0.1-130, but, unfortunately, I keep getting an error when I try to execute the same job on Pentaho Kitchen: 2016/09/08 03:36:05 - Staging Titular.0 - ERROR (version 5.4.0.1-130,…
Isabelle
  • 631
  • 1
  • 7
  • 14
1
vote
1 answer

How to solve Pentaho text file output error - Requested array size exceeds VM limit

I am exporting data by suing "table input" step to "text_file_output" step into csv file, but failing to export even 10 records in the csv file because of "java.lang.OutOfMemoryError: Requested array size exceeds VM limit" My spoon.bat file has…