Kite Morphlines (previously known as Cloudera Morphlines) is an open source framework that supports Hadoop, Flume and Spark applications that extract, transform, and load data into Apache Solr, Apache HBase, HDFS, etc. A “morphline” is a configuration file that defines a transformation chain for reading, processing and writing data.
Kite Morphlines (previously known as Cloudera Morphlines) is an open source framework that supports Hadoop, Flume and Spark applications that extract, transform, and load data into Apache Solr, Apache HBase, HDFS, enterprise data warehouses, etc.
A “morphline” is a configuration file that defines a transformation chain that consumes any kind of data from any kind of data source, processes the data, and loads the results into a Hadoop component.
Morphlines is a library, embeddable in any JVM codebase. A morphline is an in-memory container of transformation commands. Commands are plugins to a morphline that perform tasks such as loading, parsing, transforming, or otherwise processing a single record. A record is an in-memory data structure of name-value pairs with optional blob attachments or POJO attachments. The framework is extensible via embedded Java fragments or via additional commands written as Java classes.