Questions tagged [pentaho-spoon]

Spoon is the graphical transformation and job designer associated with the Pentaho Data Integration suite — also known as the Kettle project.

Spoon is the graphical transformation and job designer associated with the Pentaho Data Integration suite — also known as the Kettle project.

Documentation: http://wiki.pentaho.com/display/EAI/Spoon%2BUser%2BGuide

828 questions
0
votes
1 answer

Pentaho DI (Kettle) best way to select flow based on csv file header?

I'm using Pentaho DI (kettle) and not sure what's the best way to do the following: From a downloaded csv file, check if a column exists, and based on that select the right next step. There are 3 possible options. Thanks, Isaac
isaapm
  • 157
  • 2
  • 12
0
votes
1 answer

Produce an excel ouput file step for each case in a Switch/Case step in Pentaho

I have a Text file input step, which reads from several files under the same directory, and produce a huge dataset. I need to produce an excel file output for each case in a Switch / Case statement and give each excel file a name, based on the value…
mounaim
  • 1,132
  • 7
  • 29
  • 56
0
votes
2 answers

Pentaho Spoon unable to connect to IBM DB2 database due to a missing'driver (jar file)

I'm using Pentaho's Spoon. When I try and connect to a IBM DB2 database I get the below error `Driver class 'com.ibm.db2.jcc.DB2Driver' could not be found, make sure the 'IBM DB2' driver (jar file) is installed. com.ibm.db2.jcc.DB2Driver I…
Joshua
  • 15
  • 1
  • 4
0
votes
1 answer

How to save result row set in Pentaho variable?

Inside the job I have several transformations. In particular, one transformation looks up filenames from some directory. 'Copy rows to result' is used and next transformation in the master job is using those rows(filenames). After several steps in…
ilya_i
  • 333
  • 5
  • 14
0
votes
0 answers

File System Exception: Could not find files. Caused by: Invalid descendent file name hdfs

I am trying to do hadoop-mapreduce in pentaho.I have hadoopcopyfiles step in a job to specify input path of file.All works fine if my input file location is with root access.(ie.)files created already in root folder.But, if i give source file as my…
vasanth
  • 224
  • 3
  • 19
0
votes
2 answers

Copy/Move files in PDI / Spoon yields 'is not a file' error

I am trying to automate weekly generation of a database. As a first step in this process, I need to obtain a set of files from network location M:\. The process is as follows: Delete any possibly remaining old source files from my local folder…
Wouter Kuijsters
  • 840
  • 6
  • 20
0
votes
1 answer

How to split a CSV file into groups using Pentaho?

I am new to Pentaho and am trying to read a CSV file (which I already did) and create blocks of data based on an identifier. Eg 1|A|B|C 2|D|E|F 8|G|H|I|J|K 4|L|M 1|N|O|P 4|Q|R|S|T 5|U|V|W I need to split and group this as such: (each block…
John Sylvester
  • 257
  • 1
  • 2
  • 8
0
votes
2 answers

Pentaho Data Integration Generating Dynamic Row

I have a case where I am reading data in through a table input step. The values that are read in would be: agent, sub agent, merchant, total 1, 2, 2222, 10 2, 2, 2343, 4 1, 3, 1212, 1 What I am…
John B
  • 159
  • 3
  • 14
0
votes
1 answer

Generate a new row for a set of fields of the input row (and generate a query for each new row)

We have a .csv file that has information about the migration flows of people across districts in a city. We are creating a transformation that loads data from a .csv file to a database (2 tables): each row has the following information: - field 1:…
Gerardlc
  • 43
  • 1
  • 8
0
votes
0 answers

How to skip first empty line in XML file in Pentaho kettle transformation?

I am working with the XML file format. My file is similar to this; the first line is blank. How can I skip the empty line? AAAAA
0
votes
2 answers

Spoon is not supported on this hosttype : MINGW64_NT-6.1

I'm trying to open entaho Spoon on a Windows 7 machine, but it's not openning. When I run SpoonDebug, the SpoonDebug.txt brings me this error: Spoon is not supported on this hosttype : MINGW64_NT-6.1 What should I do? when I run Git Bash, MINGW64_NT…
Filipe Ferminiano
  • 8,373
  • 25
  • 104
  • 174
0
votes
1 answer

Pentaho Carte Load Balancing

Is there any simple way to send jobs remotely to the master Carte server and have it delegate each job to a different slave server? From what I have read, my only option for out of the box load balancing in Pentaho is to adjust the clustering…
orion_kid
  • 405
  • 1
  • 7
  • 20
0
votes
0 answers

How to write spoon input step plugin?

I am trying to learn how to write an input step plugin, which writes "hello world". I have issues in the Step class. I have serious problems writing down the processRow function as all the tutorials assume the step to have some input and use getRow…
Bruno C
  • 199
  • 13
0
votes
1 answer

Pentaho SDK, how to define a text file input

I'm trying to define a Pentaho Kettle (ktr) transformation via code. I would like to add to the transformation a Text File Input Step: http://wiki.pentaho.com/display/EAI/Text+File+Input. I don't know how to do this (note that I want to achieve the…
ufo
  • 674
  • 2
  • 12
  • 35
0
votes
1 answer

Upload report to BA server with Pentaho PDI (Kettle)

I am currently working on a Pentaho PDI project where I need to upload a report to the BA/BI server repository (URL: ***/api/repo/publish/file). I would like to achieve this by using the HTTP Post step and an User Defined Java Class step that…
Simon
  • 148
  • 1
  • 14