Questions tagged [pentaho-data-integration]

Tag to be used for Pentaho Data Integration (all versions). Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

It provides intuitive drag-and-drop data integration coupled with data agnostic connectivity spanning from flat files and RDBMS to Hadoop and beyond.

Features:

  • Graphical extract-transform-load (ETL) designer to simplify the creation of data pipelines
  • Rich library of pre-built components to access, prepare, and blend data from relational sources, big data stores, enterprise applications, and more
  • Powerful orchestration capabilities to coordinate and combine transformations, including notifications and alerts
  • Agile views for modeling and visualizing data on the fly during the data preparation process
  • Integrated enterprise scheduler for coordinating workflows and debugger for testing and tuning job execution
825 questions
-1
votes
1 answer

"Invalid column name" when "Specify database fields" in table output is unchecked in pentaho PDI V7

I'm trying to insert data to SQL database. I have all the columns in the same order as the data flow. But I'm getting this "Invalid column name name_of_the_actual_data_column" error
-1
votes
1 answer

How do I use Pentaho spoon to push data from MySQL database to facebook webpage

1) I have already made transformation mapping for getting data from specific MySQL (Table Input) and convert it as Text File output. 2) Also I have created a facebook developer account page and trying to figure out how the Facebook API works to…
-1
votes
1 answer

Restructuring JSON in Pentaho PDI Transformation

I'm in the process of evaluating several ETL tools for my company and am on Pentaho. I'm finding documentation, tutorials and blogs lacking in the information I need. My use case is simple: I'm going to receive a JSON payload via HTTP, restructure…
nbpeth
  • 2,967
  • 4
  • 24
  • 34
-1
votes
1 answer

In Pentaho can I load file and proccess the data directly into oracle database.

As of now i am downloading the file from SFTP to local and then adding into the database.I want to remove the extra step that is to download the file to machine.
Tejas
  • 11
  • 4
-1
votes
2 answers

How to populated the table via Pentaho Data Integration's table_output step?

I am performing an ETL job via Pentaho 7.1. The job is to populate a table 'PRO_T_TICKETS' in PostgreSQL 9.2 via the Pentaho Jobs and transformations? I have mapped the table fields with respect to the stream fields Mapped Fields My Table…
-1
votes
1 answer

XML data parse using Pentaho

I have the below xml data. I need data under "additionalList" as an attribute directly. But I am not able to do it using get XML data step. XXUT 38
tester
  • 213
  • 6
  • 20
-2
votes
0 answers

How to Optimize a Complex SQL Query That Runs Fast in SSMS but Slow in Pentaho Data Integration?

I have a job in Pentaho Data Integration that consists of a simple transformation involving a table input and a table output. The query in the table input is as follows: SELECT --TOP 1000 IL.Invoice_Date ,MONTH(IL.Invoice_Date) AS…
-2
votes
1 answer

Pentaho Data Integration Google BigQuery Loader exception

I use Pentaho data integration to create the job for load data to Google BigQuery from Google Cloud Storage with "Google BigQuery Loader" step. While the step successfully load data into BigQuery dataset table (checked by BigQuery job log and table…
reddto
  • 27
  • 1
  • 1
  • 7
-2
votes
2 answers

Break string into columns using Regular Expression

I am new in regex, i want to break the give string into 6 parts using regular expression. I am using the Pentaho data integration tool (ETL tool) Given string: 1x 3.5 mL SST. 1x 4.0 mL gray cap cryovial. Note: There are many more string with same…
-2
votes
1 answer

Assign several values to one key Pentaho Kettle

Please help! I have two streams (columns), let's say col_1 and col_2. There are 50 unique rows in col_1 and 10 unique rows in col_2. I need to assign these 10 unique rows to EVERY row of col_2. Actually I need to populate every value from col_1 10…
-2
votes
1 answer

Pentaho run contains list to file

I have this situation, 2 files. Input file 2 fields 6 rows: 1|BANANA ON CAGES 2|APPLE CHIPS 3|SPORT CARS 4|PLANES 5|HOUSE 6|BOTTLES List file 2 fields 4 rows BANANA|FRUIT APPLE|FRUIT CAR|TRANSPORT …
-2
votes
1 answer

Issues in Converting values to UTF-8

I am encountering issues in reporting in displaying names. My application uses different technologies PHP, Perl and for BI Pentaho. We are using MYSQL as DB and my table is of CHARSET=utf8. My table is been stored with values in rows as below which…
Yog
  • 1
  • 4
-3
votes
1 answer

SQL Server - Insert too slow into table

My table have 47 columns without data, No index, no triger, no constraints, no foreign keys. Just simple. But speed insert is ~50 row/s. Too slow! I have tried many times in different times Please help me how to check and find really problem in my…
-4
votes
2 answers

Finding file name with variables in between

Having a hard time with Regex. What would be the regex for finding a file name with variable in between them? For eg: File name : DON_2010_JOE_1222022.txt In the above file name the words DON, JOE and the format .txt will remain constant. Rest…
-4
votes
1 answer

Pentaho di jobs triggered by Lambda

Pls help, how this can be achieved? Requirement: When new files are available in a AWS S3 Bucket, a lambda process will be triggered and Pentaho job(s) to validate/process the files should be triggered. The Pentaho Job should be executed in the…
1 2 3
54
55