Questions tagged [pentaho-spoon]

Spoon is the graphical transformation and job designer associated with the Pentaho Data Integration suite — also known as the Kettle project.

Spoon is the graphical transformation and job designer associated with the Pentaho Data Integration suite — also known as the Kettle project.

Documentation: http://wiki.pentaho.com/display/EAI/Spoon%2BUser%2BGuide

828 questions
0
votes
1 answer

Pentaho - Check if a csv file is already loaded before loading

I am loading CSV files from a folder using Pentaho, and once files are loaded, I am making an entry into a table with the filenames that are loaded. I need to put a check before loading a file if it is already loaded, for that I want to pick the…
0
votes
1 answer

Pentaho - Skip a column in Microsoft Excel Writer

I have multiple table inputs joined by a cartesian and a Microsoft excel writer output. The table inputs are basically writing from left to write on the Microsoft Excel Writer. I need it to skip a column so i can put a formula there. I was told I…
ajburnett34
  • 27
  • 1
  • 10
0
votes
2 answers

Pentaho - Put Data in Any Excel Field

Excel Screenshot I a new to Pentaho and i am trying to put data into an excel sheet in any specific cell i need to. I currently have table input but the data is not exactly aligned with my output excel template (Excel Writer). How can i choose where…
ajburnett34
  • 27
  • 1
  • 10
0
votes
0 answers

what value type to be used to check/unchecked a checkbox in PDI ETL Metadata injection

I have ETL Metadata Injection steps which injects the meta data to the CSV file input step present in the template. I am able to inject all the meta data like File name to read , delimiter and field names and type of fields but I am not able to…
Ghost Rider
  • 688
  • 3
  • 17
  • 38
0
votes
2 answers

How can I decrease RAM used by Pentaho Server?

I do'nt have enough RAM to run Pentaho Server, just 4Gb of ram, now I'm on Linux and I'm running the ctlscript.sh file to enable the server but I have others process running and this makes my computer have lag. Any idea about that? I really need to…
Sr Jefers
  • 87
  • 4
  • 14
0
votes
1 answer

PDI 7 o 8 error connecting to SQL Server repository

I have a SQL Server database repository. There are a lot of Jobs and Transformations on it. If I use PDI 4.4.0, I can connect and to work on it, normally. I can create or edit any transformation or job, no problem. But, when I tried to use PDI 7.1…
0
votes
2 answers

Pentaho: combination lookup/update does not process all rows in the source

I am using Pentaho Data Integration to do a SCD type 1 transformation. I am using combination lookup/update transform to generate the surrogate key value (upon insert). The commit size is 100000 and the cache size is 99999. My source table has 19763…
Kuldip.Das
  • 191
  • 1
  • 3
  • 13
0
votes
2 answers

How to use variable for database Log Connection name in Pentaho Kettle Job running on Carte?

I would like to parametrize which (predefined and parametrized) log DB connection to use, e.g. data.import.log.db=LOG_DB: but I can't get it to work with a job parameter data.import.log.db, or setting variable data.import.log.db in the calling…
leo
  • 3,528
  • 3
  • 20
  • 19
0
votes
1 answer

Pentaho start a java program inside the same jar

We have an organization reports portal for which we use Pentaho. Now the catch is, we need to define one single java Jar in which Pentaho + java files + sql, all are there. I'm able to execute SQL via Pentaho but I need to execute a class in the…
0
votes
2 answers

How to start inserting row after some specified number to MySql database using Pentaho?

Basically what i want to do is that, I have CSV file containing 10,000 rows that i want to insert into the database . When i start my transformation i want to start inserting in database after 4500 rows . So i want to skill number of rows that i…
0
votes
1 answer

"Get XML Data" step of pentaho is not able to read same xml file sometimes

I am using pentaho kettle tool for ETL job. In the job, one of the step(Get XML Data) is not able to read/parse xml file sometime. Sometime same XML file didn't throw any exception and sometime it threw. The list of errors are as given below - 1)…
0
votes
1 answer

Pentaho Server error: A serious error occurred during job execution: Could not initialize class java.net.NetworkInterface

I recently did a yum update all on one of my aws linux instances and rebooted my pentaho server. It was 200+ days since a reboot. I then tried to run my existing pentaho jobs and got this error in the carte.log file for any and all of my jobs. log…
DJ_007
  • 1
  • 1
0
votes
0 answers

Replace in string, remove stop words

I need to clean field from stop words from text file. How can I use filepath in Search in Replace in string?
tatka
  • 301
  • 1
  • 3
  • 9
0
votes
1 answer

Compare one filed in different csv using Pentaho

I have 2 csv files . 1) CSV that contains 10 records and one filed named city . This field has value in short forms like "M" for "Melbourne", etc . 2) Second CSV has two fields like , "cityshortname" and "city" . What i want to do is compare the…
0
votes
1 answer

Spoon Kettle doesn't manage NULL values correctly

I'm using Spoon Kettle PDI to insert data from a csv file to a MariaDb database. I'm doing something very simple but apparently, when in the csv there is a NULL value, PDI interpret it as a String and this create problems in the final…
drenda
  • 5,846
  • 11
  • 68
  • 141