Current state
DSpace XMLUI (Mirage, Mirage 2 themes with "metadata" focus) by default shows items in search result listings as follows:
- When the search term was found in the title/author/publisher metadata, the item is shown as in "recently added" lists (including the first part of the abstract).
- When the search term was found in the abstract, the abstract snippet shown for the item "shifts" to show the context around where the search term was found.
- When the search term was found in extracted fulltext, the abstract snippet is not shown at all. Instead, a snippet of the extracted fulltext is shown to show the context around where the search term was found.
In all three cases, the search terms are bolded.
Issues with this approach
The approach above causes several issues in terms of user experience:
- The logic above is not made clear to the user; search result listings can be a mix of items with the beginning of the abstract shown, items with a middle portion of the abstract shown, and items with a fulltext snippet shown.
- The extracted fulltext may include "ugly" special characters and will generally include the filename, file size and other metadata of little relevance to the user.
- The extracted fulltext may be from a restricted bitstream and the preview snippet may reveal information that is supposed to be confidential (see this DSpace bug).
Desired behaviour
Instead, I would like to always show an abstract snippet for items in search result listings. It's acceptable for the abstract snippet to "shift" to show context of the search term, but this should be made clear to the user. When the search term is found only in the fulltext file, the beginning of the abstract should be shown, alongside a message that a fulltext file belonging to the item contains the search term.