I have same problem but with photos. I transferred about 30000 photos from few iCloud libraries into Synology Photos and there must be hundreds of duplicates in that bunch. Need to clear that mess out.
Data Hoarder
We are digital librarians. Among us are represented the various reasons to keep data -- legal requirements, competitive requirements, uncertainty of permanence of cloud services, distaste for transmitting your data externally (e.g. government or corporate espionage), cultural and familial archivists, internet collapse preppers, and people who do it themselves so they're sure it's done right. Everyone has their reasons for curating the data they have decided to keep (either forever or For A Damn Long Time (tm) ). Along the way we have sought out like-minded individuals to exchange strategies, war stories, and cautionary tales of failures.
Czkawka is quite good for photos, use "Duplicate files" mode first, then "similar images" and work down the similarity levels one by one. Good for watermarked versions and different resolution Bad for cropped versions, also draws false positives on similar images mode if you have two frames from a video saved as images.
I used NoClone for the longest time and loved it. Czkawka has been my go to since NoClone hasn't been updated in a while and tends to crash.
The GUI is a bit awkward to get used to, like for instance I haven't figured out if you can browse a path, and filtering by path is a little different to me, but I'm still generally very happy to use it.