Problem: data from one particular source into our accumulo instance is not being returned to our client application via a subset of our search interfaces.
When we use search method "A" we get results, but when we use search method "B" we do not.
I have a hunch that method "A" and method "B" are actually hitting different tables.
In order to prove that, I need a way to put a hook into the stream of data coming into the tables and grep for data indicating the source of the message. I can't do anything programatically because that would necessitate my taking the system down which isn't an option right now.
I see from the manual that there is a 'grep' and 'egrep' command. The help file on grep says not to use it for regex, and I can't seem to get egrep to return a record I know to be in the database.
example:
A record in the exchange contains the line <gml:pos>23.05507 113.5268</gml:pos>
. To egrep for it, I log into the accumulo shell, select the table in which the record exists, then enter the following egrep ^:pos>23.*113.*
.
Nothing comes back.
I've tried all the variants of the command I could think of (quoting, not quoting, searching only for 23.*, etc).
What am I missing here?