9

I was asked to prototype two ETL frameworks. The requirements are as follows:

  • Open Source
  • Available to Linux
  • Maintained
  • Logs can be viewed on web browser (nice to have)
  • Written in Perl, Python, Ruby or Java

The raw file can be anything (excel, csv, html page etc..) The target database is MySQL.

Dont just drop names, please indicate the advantages/disadvantages based from your experience.

Thanks!

jchips12
  • 1,177
  • 1
  • 11
  • 27

3 Answers3

7

One of the most popular Java based ETL would be Talend.

Jaspersoft ETL is another one extended from Talend and has a nice eclipse based UI.

daxim
  • 39,270
  • 4
  • 65
  • 132
Husain Basrawala
  • 1,757
  • 15
  • 21
4

I've used Kettle. It has its own GUI, but if you rather use the API to do the ETL yourself it's also supported. It has proved to be very useful to me and there are a few plugins already available for it.

edgarator
  • 296
  • 1
  • 4
  • 17
3

Another option is CloverETL. It is written in Java and there is an open source, LGPL version of its Engine. As well it has a free version of GUI called CloverETL Community.

It can process any of the indicated sources and connects to a number of databases, including MySQL.

Jan Tichy
  • 116
  • 3