Grouping similar search results

When the Group Similar Search Results checkbox is ticked, similar results will be grouped together after a query.

This reduces the repetition of very similar search results such as emails with almost identical content or the same report that was added to the collectionA collection is a container for storing and organising ingested files and documents. Only the textual content is stored in collections, not the original files and documents. but in different file formats.

If similar search results are detected, a Similar Results tab will appear in the search results where matches were found.

Selecting the Search Results tab will reveal all the matched documents.

Using the Tolerance Slider

The Tolerance Slider is used to adjust the match precision between two different documents. It can be adjusted to a value between 0 and 50 percent. A lower tolerance value will require higher similarity between documents for a match to be detected. For example, at 1%, two documents would need to be practically identical in content to be considered a match.

At 10%, the matching tolerance would be relaxed enough that things like emails from different people that contained mostly the same information, would be grouped together. It is recommended that the Tolerance Slider be set no higher than 10% if you want to detect documents that are very similar.

A tolerance above 10% can be used for broader filtering in order to identify documents which may be related in terms of subject matter.