Questions tagged [pentaho-data-integration]

Tag to be used for Pentaho Data Integration (all versions). Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

It provides intuitive drag-and-drop data integration coupled with data agnostic connectivity spanning from flat files and RDBMS to Hadoop and beyond.

Features:

  • Graphical extract-transform-load (ETL) designer to simplify the creation of data pipelines
  • Rich library of pre-built components to access, prepare, and blend data from relational sources, big data stores, enterprise applications, and more
  • Powerful orchestration capabilities to coordinate and combine transformations, including notifications and alerts
  • Agile views for modeling and visualizing data on the fly during the data preparation process
  • Integrated enterprise scheduler for coordinating workflows and debugger for testing and tuning job execution
825 questions
1
vote
1 answer

Insert XML expressions into a working XML string using Modified JavaScript Value step in PDI

I'm building a XML string for a SOAP request in Pentaho Data Integration using a Modified JavaScript Value step. The request must execute for each row (order) and each row will include data of a variable number of items. I need to bring the variable…
1
vote
1 answer

Interact within range of values - Pentaho Kettle

I have a table where it contains minimum and maximum account values ​​to organize within a classification, I want to interact with this value to generate the accounts within the range. ORDEM ABC MIN_CONTA ABC…
1
vote
1 answer

Pentaho - ssh connect to remote server

I am connecting via ssh to the server. Later, I'd like to execute something on that server, such as an "ls" or "cd" command. Is there any possibility to do this in Pentaho? I would like to be able to pass some command after logging in
1
vote
0 answers

Pentaho Data Integration Community Edition 9.4 - Shared objects file feature of transformation doesn't support variable

Our company used Pentaho Data Integration Community Edition (PDI-CE) 8.0 (Linux). We are upgrading to the PDI-CE 9.4. In the process, we met an issue. To reproduce the issue, clone the project…
Albert Wang
  • 524
  • 4
  • 7
1
vote
1 answer

Pentaho Data Integration: extract language distribution code

I'm using PDI API to get the language distribution for code in my projects. Using a request like var url =…
Cesare
  • 1,629
  • 9
  • 30
  • 72
1
vote
1 answer

Pentaho Data Integration Define Dependencies

I am using Pentaho Data Integration to create a job where several sql tables are created. Here is the job: I would like to create TABLE_D when both TABLE_C and TABLE_B are created, however it seems like first the tables in the first branch are…
Marc
  • 588
  • 2
  • 15
1
vote
1 answer

Pentaho - running a job in parallel

I have a question. Maybe someone had a similar problem. I have a job which looks as below When I launch job from spoon pentaho it's all work correctly. When I launch job from .bat file then it doesn't work. My .bat file looks as below @echo…
1
vote
1 answer

Pentaho Data Integration - Merge rows (diff) step not executing with Table Input "Replace variables in script"

I have a PDI Transformation that is using the Merge rows (diff) step comparing rows from two Table Input steps (with Sorts in between). (I linked a screenshot of the Transformation at the bottom of the post) The queries in the Table Input require…
1
vote
1 answer

Automate Pentaho Job and Transformations

I am new to Pentaho, and would like to automate my processes as much as possible. This is what my workflow looks like: Files will be deposited in s3 at random times by upstream. These files would need to be picked up or read and processed by Pentato…
1
vote
0 answers

Pentaho - Launching a job parallel to the Task Scheduler

I created job in Pentaho as below: Job working in spoon, but when I want to run this job from Task Scheduler as .bat file, it doesn't work. I don't know why. The job status is running every time, but it goes on indefinitely. My .bat file I have a…
1
vote
0 answers

Pentaho - Carte Dowload a file

I'm trying to use pentaho service carte to execute a ETL and return the file generated. How I can do this? Someone knows that it is possible or there is a other service to can do this? Thanks!!!
1
vote
0 answers

How to get the field values using the Modified javascript step in pentaho?

I'm trying to get the field values and field meta data ie., field properties, Using the following script I'm able to find the field type and field properties. Now im trying to check if there is any null value in the a particular field, so that I can…
1
vote
0 answers

Core Javascript Question -- Delete Object in a json file sourced from a column if conditions meet

the reason I need the code to work in core javascript is the tool we use, it uses Rhino from mozilla, so it cannot contain objects or methods related to manipulation of web pages. I am trying to compare data from two json files using core…
D S
  • 9
  • 3
1
vote
2 answers

Check a Pentaho result every 30 minutes and run other steps

I am new to Pentaho. I currently has a job like this: condition 1 -> condition 2 -> if successful then run this sql scripts, if failed then send email I would like to have a loop that is more like: (condition 1 -> condition -> 2) are run every 30…
hhp
  • 109
  • 1
  • 7
1
vote
0 answers

CPython Script Executor does't work on Pentaho Kettle

Hellow Guys First of all, i'm using the windows 10, java_1.8.0_333, and Pentaho 9.3 I'm trying to many ways to run my Python code in Pentaho, but isn't work. I try reinstall my Python using diferent version, try to change my java version, and try…