I have a very simple topology that spouts from an ES index (AggregationSpout), fetches the pages (FetcherBolt) and uses StatusUpdaterBolt to update the ES status to "FETCHED".
However, I noticed such warnings in the log files:
[WARN] Could not find unacked tuple for 357dc2fcb59c6457884a8f7a83794c4cf77f490a3acfd849a792a35153ed4665
The corresponding debug info looks like: ...
2017-12-06 12:44:53.572 o.e.t.T.tracer elasticsearch[client][transport_client_boss][T#2] [TRACE] [214][indices:data/write/bulk] received response from [{ESPatentNode-1}{S4C2h8WjRuu6MpM25oM-3w}{Fvjny3VaQl2w45hPXZ5A9g}{127.0.0.1}{127.0.0.1:9300}] 2017-12-06 12:44:53.572 c.d.s.e.p.StatusUpdaterBolt elasticsearch[client][listener][T#1] [DEBUG] afterBulk [105] with 47 responses 2017-12-06 12:44:53.572 c.d.s.e.p.StatusUpdaterBolt elasticsearch[client][listener][T#1] [DEBUG] Acked 1 tuple(s) for ID 5967f802c84e3e9c6ac22a3184e0665b850779cba9050fa4ec910a41f9f90655 2017-12-06 12:44:53.573 c.d.s.e.p.StatusUpdaterBolt elasticsearch[client][listener][T#1] [DEBUG] Acked 2 tuple(s) for ID 357dc2fcb59c6457884a8f7a83794c4cf77f490a3acfd849a792a35153ed4665 2017-12-06 12:44:53.573 c.d.s.e.p.StatusUpdaterBolt elasticsearch[client][listener][T#1] [DEBUG] Acked 1 tuple(s) for ID 092e59cd1ebb004884babfaf1d6ca4b7505b3dcb1b3cb3a52b9072d647fb7a93 2017-12-06 12:44:53.573 c.d.s.e.p.StatusUpdaterBolt elasticsearch[client][listener][T#1] [WARN] Could not find unacked tuple for 357dc2fcb59c6457884a8f7a83794c4cf77f490a3acfd849a792a35153ed4665
What I would like to understand is:
- why several tuples can be attached to an ID
- how it is possible to come accross twice the same "waitAck" cache element looping through "response" in the afterBulk method of StatusUpdaterBolt
Thanks in advance for your help!