Recoll does not index many text files by default. It seems to only index files where the mimemap explicitly includes the mime type, but not other "obvious" file types.
Examples:
- yaml files --
file -i
showstext/plain; charset=us-ascii
but usingrecollindex -e -i /path/to/foo.yaml
shows recoll detecting it asapplication/x-yaml
viaxdg-mime
, which isn't an officially registered mime type -- but if recoll usesxdg-mime
, one would think it would know how to deal with all the possible return values fromxdg-mime
awk scripts -- same thing, withthis is in the defaultapplication/x-awk
mimeconf
.perl scripts -- same thing, withthis is in the defaultapplication/x-perl
mimeconf
.shell scripts -- same thing, withthis is in the defaultapplication/x-shellscript
mimeconf
.- kotlin and other source code files -- recoll sees it as
text/x-kotlin
-- again a non-standard type viaxdg-mime
, but one that begins withtext/
so Recoll should know it is text -- but still doesn't index it - readme files -- same thing, with
text/x-readme
Now, this can be worked around on a case-by-case basis by adding into ~/.recoll/mimeconf
something like:
[index]
application/x-yaml = internal text/plain
text/x-kotlin = internal text/plain
text/x-readme = internal text/plain
but doing this one file type at a time seems silly. Is there a way to say
- index everything with mime type
text/*
astext/plain
, unless recoll already has a more specific parser for the type - index obvious textual data (e.g. if
file -i
returnstext/plain
) astext/plain
, again unless recoll already has a more specific parser for the type
If it matters, I'm using recoll packaged by Fedora.