Questions tagged [apache-nifi]

Apache NiFi is an easy to use, powerful, and reliable distributed system to transform and distribute data.

About

Apache NiFi, formerly Niagara Files, is a powerful data routing and transformation system originally created by the United States National Security Agency (NSA). Flows of data and transformations are expressed using directed graphs of data routing, transformation, and system mediation logic nodes.

NiFi has been designed as a distributed system and, with careful design, works well where limited network connectivity is available - eg a satellite node may receive vast amounts of data, and then samples / summaries can be sent back to a central location over a link with limited bandwidth.

Further, the system maintains extensive data provenance records, and it is possible to inspect flows and see individual data items and the state of them after any transformations at any point in the flow.

Clarification of ownership & license

  1. The Apache Software Foundation is a non-profit organization which owns the intellectual property of Apache NiFi (among many other projects). The Apache License, version 2.0 (additional FAQ) dictates the use and reproduction of the NiFi source code. You can read more about The Apache Way to Sustainable Open Source Success.
  2. You do not need to purchase a license to use, consume, or repurpose the Apache NiFi source code or compiled executables. You only need to adhere to the requirements outlined in the license linked above.
  3. The open source NiFi community (PMC members, committers, contributors, etc.) are available to offer help for free. This is done via:
  1. Multiple vendors do support customers with regards to Apache NiFi. Some provide consulting advice, like answering questions around the software with a certain level of responsiveness or access to experts. Others provide services like flow design & optimization, integration with other software, etc. Some offer their own version of the software with additional features or for specific deployment platforms.
  2. Those vendors may charge for access to their support, warranty, education, services, custom software, etc., but they are not selling you a license to use "Apache NiFi" (if they are charging you just to gain access to Apache NiFi, remember that Apache software is always free to use).

Links

5112 questions
11
votes
1 answer

Development Life Cycle for Apache NiFi

I realize that with NiFi, as their doc defines it, "continuous improvement occurs in production". So this doesn’t lend itself to be used as a traditional development tool. However for the project I’m working on it’s been decided that this is the…
Mike
  • 791
  • 8
  • 18
11
votes
1 answer

Apache NiFi ExecuteScript: Groovy script to replace Json values via a mapping file

I am working with Apache NiFi 0.5.1 on a Groovy script to replace incoming Json values with the ones contained in a mapping file. The mapping file looks like this (it is a simple .txt): Header1;Header2;Header3 A;some text;A2 I have started with…
panza
  • 1,341
  • 7
  • 38
  • 68
11
votes
1 answer

Kafka Avro Consumer with Decoder issues

When I attempted to run Kafka Consumer with Avro over the data with my respective schema,it returns an error of "AvroRuntimeException: Malformed data. Length is negative: -40" . I see others have had similar issues converting byte array to json,…
SparkleGoat
  • 503
  • 1
  • 9
  • 22
10
votes
2 answers

In Apache NiFi, Is it possible to increase / decrease the scope of a controller?

In Apache NiFi, I created a controller within the appropriate scope at that time. Now I want to use the same controller in other places. My question is can I change the scope to the current processors group's ancestor so it can be used by other …
Vijay Kumar
  • 2,439
  • 2
  • 32
  • 51
10
votes
2 answers

APACHE NIFI vs APACHE AIRFLOW vs APACHE FALCON ? Which suits best in the below scenario?

I am developing a solution in Java which communicates with a set of devices through REST APIs which belongs to different vendors. So for each vendor, there are a set of processes that I have to perform inside my solution. However, these processes…
Selaka Nanayakkara
  • 3,296
  • 1
  • 22
  • 42
10
votes
1 answer

get response from InvokeHTTP POST

New to NiFi! I'm using InvokeHTTP nifi processor (using this link as an example to implement my flow http://www.tomaszezula.com/2016/10/30/nifi-and-http-post-configuration/). I currently need to do POST in order to authenticate against a server I am…
BigBug
  • 6,202
  • 23
  • 87
  • 138
9
votes
2 answers

Apache Spark and Nifi Integration

I want to send Nifi flowfile to Spark and do some transformations in Spark and again send the result back to Nifi so that I can to further operations in Nifi. I don't want to write the flowfile written to database or HDFS and then trigger Spark job.…
Gowthaman V
  • 161
  • 3
  • 11
9
votes
1 answer

Specify output filename with PutFile processor

Background I have a workflow that uses a FetchFile processor, some intermediary processors (e.g. RouteOnAttribute, HashContent, etc.), and then a PutFile processor. Throughout it all, the content is not being manipulated. Issue/Problem The PutFile…
Dan
  • 4,197
  • 6
  • 34
  • 52
9
votes
5 answers

PutFile append file

New to Nifi! I'm wondering if there is a way in nifi to use a processor such as "PutFile" and have it write to one single file (append data to this file, or over-write the data in this file) - rather than create multiple different files? Is there…
BigBug
  • 6,202
  • 23
  • 87
  • 138
9
votes
2 answers

NIFI - Dev to Test to Prod

We are struggling with trying to figure out the best approach for updating processor configurations as a flow progresses through the dev, test, and prod stages. We would really like to avoid manipulating host, port, etc. references in the…
scarpacci
  • 8,957
  • 16
  • 79
  • 144
9
votes
1 answer

Apache NIFi MergeContent processor - set demarcator as new line

I want to use MergeContent processor to merge tweets to bulk insert into Elasticsearch index. For this I need command and tweets (each) to be separated by \n. This is how it should look like { action: { metadata }}\n { request body …
Igor K.
  • 915
  • 2
  • 12
  • 22
8
votes
2 answers

Introduce time delay before moving flow files to next processor in NiFi

In NiFi, there exist a data flow to consume from MQTT (ConsumeMQTT) and publish into HDFS path (PutHDFS). I got a requirement to introduce 60 min delay before pushing the consumed data into HDFS path. Found ControlRate and MergeContent processor to…
Vasanth Subramanian
  • 1,040
  • 1
  • 13
  • 32
8
votes
1 answer

purpose of Funnel in NiFi

In the image, there are 3 GenerateFlowFile processors connected to a LogAttribute processor. I would like to know the need for Funnel. Please clarify. I understood one such use of funnel where it can be used to merge components from two different…
Vasanth Subramanian
  • 1,040
  • 1
  • 13
  • 32
8
votes
2 answers

Spark concurrent writes on same HDFS location

I have a spark code which saves a dataframe to a HDFS location (date partitioned location) in Json format using append mode. df.write.mode("append").format('json').save(hdfsPath) sample hdfs location : /tmp/table1/datepart=20190903 I am consuming…
8
votes
2 answers

In NiFi processor 'InvokeHTTP' where do you write body of POST request?

Before posting this question about Apache NiFi InvokeHTTP I have gone through all other questions and their answersbut I am still unsure the best flow I should have. My situation is as below: 1) From Apache Kakfa, I get raw metadata. 2) Using…
Yeshwant KAKAD
  • 279
  • 1
  • 6
  • 16