I’m looking for a duplicate/similarity checker against a custom set of documents. This is possibly like what plagiarism checker, but with a custom reference (instead of everything). But I could not find a solution that can be selfhosted, and have some simple UI and capabilities like Turnitin. Any suggestions?

Thanks’