Comment on "What’s Your Preferred Self-Hosted Solution for Deep Monitoring (Beyond Simple Page Changes)?"

<- View Parent
alfablend@lemmy.world ⁨4⁩ ⁨days⁩ ago

@xyro Thanks for sharing your case! I’ve also tested changedetection.io — it’s a great tool for basic site monitoring.

But in my tests, it doesn’t go beyond the surface. If there’s a page with multiple document links, it’ll detect changes in the list (via diff), but it won’t automatically download and analyze the new documents themselves.

Here’s how I’ve approached this:

  1. Crawl the page to extract links
  2. Detect new document URLs
  3. Download each document and extract keywords
  4. Generate an AI summary using a local LLM
  5. Add the result to a readable feed

P.S. If it helps, I can create a YAML template tailored to your grant-tracking case and run a quick test.

source
Sort:hotnewtop