Questions tagged [apache-nifi]

Apache NiFi is an easy to use, powerful, and reliable distributed system to transform and distribute data.

About

Apache NiFi, formerly Niagara Files, is a powerful data routing and transformation system originally created by the United States National Security Agency (NSA). Flows of data and transformations are expressed using directed graphs of data routing, transformation, and system mediation logic nodes.

NiFi has been designed as a distributed system and, with careful design, works well where limited network connectivity is available - eg a satellite node may receive vast amounts of data, and then samples / summaries can be sent back to a central location over a link with limited bandwidth.

Further, the system maintains extensive data provenance records, and it is possible to inspect flows and see individual data items and the state of them after any transformations at any point in the flow.

Clarification of ownership & license

  1. The Apache Software Foundation is a non-profit organization which owns the intellectual property of Apache NiFi (among many other projects). The Apache License, version 2.0 (additional FAQ) dictates the use and reproduction of the NiFi source code. You can read more about The Apache Way to Sustainable Open Source Success.
  2. You do not need to purchase a license to use, consume, or repurpose the Apache NiFi source code or compiled executables. You only need to adhere to the requirements outlined in the license linked above.
  3. The open source NiFi community (PMC members, committers, contributors, etc.) are available to offer help for free. This is done via:
  1. Multiple vendors do support customers with regards to Apache NiFi. Some provide consulting advice, like answering questions around the software with a certain level of responsiveness or access to experts. Others provide services like flow design & optimization, integration with other software, etc. Some offer their own version of the software with additional features or for specific deployment platforms.
  2. Those vendors may charge for access to their support, warranty, education, services, custom software, etc., but they are not selling you a license to use "Apache NiFi" (if they are charging you just to gain access to Apache NiFi, remember that Apache software is always free to use).

Links

5112 questions
5
votes
2 answers

NiFi Auth with Nginx reverse proxy

Is it possible to have NiFi with user authentication but with SSL termination on NGINX. I have NGINX running on port 443 and a proxy_pass passing to nifi at port 8080. I played around with these headers: X-ProxyScheme - the scheme to use to connect…
Sterling Duchess
  • 1,970
  • 16
  • 51
  • 91
5
votes
3 answers

Is Apache Nifi ready to use with Kubernetes in production?

I am planning to setup Apache Nifi on Kubernetes and make it to production. During my surfing I didn't find any one who potentially using this combination for production setup. Is this good idea to choose this combination. Could you please share…
5
votes
3 answers

Apache NiFi, write attribute to flowfile content

Is there any option to write an attribute of a flow file as a content without using the AttributesToJSON processor?
B. Pesevski
  • 461
  • 1
  • 3
  • 9
5
votes
1 answer

Nifi LDAP authorization - Multiple groups in search filter through wildcards

I am trying to use a search filter for Nifi authorizer via LDAP. The problem is that the usual search filter syntax does not seem to work. I currently have nifi working with one of my groups, so getting LDAP to work is not the issue. I was also able…
kbruylants
  • 63
  • 6
5
votes
1 answer

How to simply combine flow files in nifi?

Let's say I have 100 flow files produced by one processor, each of them contains a different line. I want to get a new flow file which contains 100 line. How can I did that? I have tried MergeContent processor, but it gives me the origin 100 flow…
xingbin
  • 27,410
  • 9
  • 53
  • 103
5
votes
1 answer

Apache NIFI: Recovering from Flowfile repository issue

I am currently trying to recover my flows from the below exception. failed to process session due to Cannot update journal file /data/disk1/nifi/flowfile_repository/journals/90620570.journal because no header has been written yet.; Processor…
irrelevantUser
  • 1,172
  • 18
  • 35
5
votes
5 answers

How to persist nifi flowfiles by using docker-compose?

right now I'm using nifi and its processors for some streaming stuff (mqtt listener, json evaluating, text replacement, write into db ...). I'm trying to persist the flowfiles and therefore I did some volume mapping (see below). But it doesn't work;…
T_F
  • 129
  • 3
  • 9
5
votes
2 answers

HandleHTTPRequest processor creating duplicate output flowfiles in nifi

​Hi, I have created the handlehttprequest processor, on a standalone nifi instance which is timer driven, concurrent tasks set to 1 and run schedule is 0. I am hitting my web service with a request from postman or any browser. The output queue of…
Sid
  • 1,224
  • 3
  • 23
  • 48
5
votes
1 answer

NiFi: Merge an Attribute into the Flow-file's JSON Content (without overwriting the entire flowfile)

I have a flow that creates some new attributes / content. I want to take an attribute from the flow files, and add it into the JSON content of the file. I can use AttributesToJSON, but that just overwrites the file content. A sample of my flow is…
papelr
  • 468
  • 1
  • 11
  • 42
5
votes
1 answer

Remove duplicates in NiFi

I have a flow QueryDatabaseTable->ConvertRecord->PutElasticseachHttpRecord What I am trying to do is fetching the full data from MySQL database and feeding it into Elasticsearch to perform analytics on it using Kibana. However, my data has duplicate…
Shrads
  • 883
  • 19
  • 39
5
votes
1 answer

Way to clear Bulletin for a processor

For quick testing and development, it would really be helpful to have a way to clear the Bulletin from processor's top right corner. Once I see the error and I fix it, I would like to clear it before restarting. If it is possible with the current…
pratpor
  • 1,954
  • 1
  • 27
  • 46
5
votes
3 answers

Merge two JSON flowfile together in NiFi

i want to merge two flowfile that contain JSON object by same specified attribute... flow1: attribute: xuuid = 123456 content: { "sname":"jack", "id":"00001", "state":"NY" } flow2: attribute: xuuid =…
meh
  • 253
  • 2
  • 9
  • 15
5
votes
1 answer

Apache Nifi - Consume Kafka + Merge Content + Put HDFS to avoid small files

I am having around 2000000 messages in Kafka topic and I want to put these records into HDFS using NiFi,so I am using PutHDFS processor for this along with ConsumeKafka_0_10 but it generates small files in HDFS, So I am using Merge Content processor…
BARATH
  • 364
  • 2
  • 17
5
votes
1 answer

How to access NiFi using proxy user request

I want to access NiFi and do some actions through external web application created in C# using proxy user request. For that, I have created C# web application and hosted as secure connection(https). I need to access NiFi components using its REST…
Nijandhan
  • 185
  • 1
  • 11
5
votes
1 answer

Ingest log files from edge nodes to Hadoop

I am looking for a way to stream entire log files from edge nodes to Hadoop. To sum up the use case: We have applications that produce log files ranging from a few MB to hundreds of MB per file. We do not want to stream all the log events as they…
j9dy
  • 2,029
  • 3
  • 25
  • 39