this post was submitted on 12 Jul 2025
1 points (100.0% liked)

It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
 
The original post: /r/datahoarder by /u/CarcajadaArtificial on 2025-07-11 20:03:57.

Hello, I just started using ArchiveBox to store local copies of my bookmarks and articles. Frequently I would store two different pages from the same site that would have repeated images, of course it would be better to not keep this kinds of duplicates. I suppose this is a relatively common concern but couldn't find anything about this in the docs. I also suppose that not all download formats would handle this situation the same way, I was using SingleFile which I suddenly realized that it probably wouldn't be too optimized for this. What would be your recommendation for this?

Thank you

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here