I’m looking for a self hosted solution to this problem:
I want to create a full text search index from a collection of PDF manuals (text, not images, I don’t care about OCR here). There is a UI to search for text matches in documents, and clicking a search hit opens the PDF scrolled to where the search hit is (bonus points if the search hit is hilighted)
Would paperless-ngx work?
I haven’t used any of these, but I wonder if any of the self-hosted open source document management platforms would serve your purpose? Something like PaperMerge or similar?
So like self-hosted Spotlight? That would be cool
Or rather - FOSS Spotlight. I suppose it’s already self hosted
This sounds like it could be a job for Calibre-web.
Elastic Search.
This sounds useful although I haven’t tried it myself: https://stackoverflow.com/a/40614824