It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
1326
 
 
The original post: /r/datahoarder by /u/Rhoken on 2025-05-29 16:48:58.

It's been from some years that i have start to hoard and backup most of my data and in particular when i was starting to do photography, to a point that i have now various drives around.

Some days ago i have buyed a new HDD cage for my two backup HDDs (Toshiba P300) which is one 2 TB unit and one 1 TB unit, synching everything with Synkron.

But i want to ditch the 1 TB unit and add a 4 TB unit but the HDD market has changed drastically from last times (4-5 years ago).

I don't want to spend many money (budget is 100-120 euros) and with this budget i have find these models:

  • Seagate Ironwolf 4 TB ST4000VN006 (CMR)

  • Seagate Barracuda 4 TB ST4000DM004 (SMR)

  • HGST Ultrastar 7 K6000 4 TB HUS726040ALE610 (CMR?)

  • Toshiba P300 4 TB HDWD240UZSVA (SMR)

The Toshiba is the cheapest one (75 euros) while the Ironwolf the most expensive (94 euros) the others one are in between 80 to 100 euros, which of these are on par or better than my two P300s in terms of performance and reliability?

1327
 
 
The original post: /r/datahoarder by /u/Fuzzy-Zone-5535 on 2025-05-29 16:41:38.

I am unable to use yt-dlp even though I tried and failed to use it many times even following step-by-step tutorials on YouTube. There are a few movies in 4K I found on YT that I would like to download. Are there any alternative way to do it?

1328
 
 
The original post: /r/datahoarder by /u/phenrys on 2025-05-29 16:39:24.

Super happy to share with you the latest version of my YouTube Downloader Program, v1.2. This version introduces a new feature that allows you to download multiple videos simultaneously (concurrent mode). The concurrent video downloading mode is a significant improvement, as it saves time and prevents task switching.

To install and set up the program, follow these simple steps: https://github.com/pH-7/Download-Simply-Videos-From-YouTube

I’m excited to share this project with you! It holds great significance for me, and it was born from my frustration with online services like SaveFrom, Clipto, Submagic, and T2Mate. These services often restrict video resolutions to 360p, bombard you with intrusive ads, fail frequently, don’t allow multiple concurrent downloads, and don’t support downloading playlists.

I hope you'll find this useful, if you have any feedback, feel free to reach out to me!

1329
 
 
The original post: /r/datahoarder by /u/Lucky_Influence901 on 2025-05-29 15:04:15.

DISCLAIMER: CAN PEOPLE IN THE COMMENTS STOP CALLING ME A DUMBASS? I'VE ALREADY GOT THE SOLUTION AND I DON'T NEED ANY HELP ANYMORE. THIS WAS LITERALLY JSUT A RANDOM RANT ABOUT HOW BULLSHIT CAPITALISM AND FILESYSTEMS IS. AND ALSO I GOT INTO THIS PARTITIONS AND FILESYSTEMS CRAP 2 DAYS AGO. GIVE ME A BREAK I DON'T KNOW EVERYTHING DAMN IT.

I’m losing my mind over here. It’s 2025, and I’m STILL wrestling with file system chaos like it’s 2005. I have a perfectly good M.2 SSD full of family data in NTFS format, and now I want to watch some simple movies on my tablet that only reads FAT32 or exFAT. Sounds easy, right? Nope. And before you little assholes say "then just use exfat!!~!!!!!!!!!" Well shit.... The documentation says it SHOULD support exfat but that fucker told me to go format it like the bitch it is when the documentation literally says IT WORKS ON EXFAT. WHAT THE FRCICCCFKCKCKC

I’ve spent six hours trying to convert, clone, partition, and split files without destroying a single byte. Windows crashes, file explorers freeze, formatting tools act like they’re from the stone age, and then my tablet STILL can’t read the drive properly.

Why do we still have to jump through hoops to just watch a movie? Why can’t there be one single, universal file system that’s reliable, compatible everywhere, and actually doesn’t make me want to throw my hardware out the window?

The fact that I need to chunk every single movie into 4GB fat32 segments just so my tablet can read it? Are you kidding me? And don’t get me started on codec support, missing apps, and software that thinks it’s 1999.

We live in a world with quantum computing research and AI writing novels, but I can’t plug in a drive and watch a damn movie without a 6-hour tech nightmare.

If anyone else is in this eternal hell, drop your stories or survival tips. Or just tell me I’m not alone in this madness.

1330
 
 
The original post: /r/datahoarder by /u/TheThingCreator on 2025-05-29 14:21:26.

Original Title: Pocket is Shutting down: Don't lose your folders and tags when importing your data somewhere else. Use this free/open-source tool to extract the meta data from the export file into a format that can easily migrate anywhere.

1331
 
 
The original post: /r/datahoarder by /u/Ken852 on 2025-05-29 13:42:14.

I submitted a data request today. It was processed in less than one hour. Which is kind of nice. It can normally take companies anywhere from 1 to 30 days, sometimes more to process this kind of request if it's handled manually.

But I'm surprised that all I got are 37 CSV files inside a ZIP file. The ZIP is only 6.14 MB. There are no media files, like the many images I uploaded. Also, everything seems to be sorted by ID, which is alphanumeric. Instead of sorting by date, which I think would make more sense. This applies to posts and messages. There is also no clear separation between them. So the whole thing is very hard to read and make sense of, for example to verify its completeness. I requested everything. But I'm not sure how far back this goes until I sort it.

So I was wondering if there is a third party tool, either free or paid, that will let me get a complete copy of my account data, including the images? Preferably in a format or with a parser that will display it in an easy way, similar to how Reddit itself displays it.

1332
 
 
The original post: /r/datahoarder by /u/Not_So_Calm on 2025-05-29 10:29:49.

I've just migrated 5TB of personal files to a nextcloud (cloud service) and am looking into additional self hosting at home, using Immich and more stuff. And all that got me thinking:

How do you ensure or rather verify the integrity of your files?

Even when having multiple backups (3-2-1 strategy), you can't be sure there is no file corruption / bit rot somewhere. You cannot possible open all your pictures and documents once a year. Do you create checksum files for your data to test against? If yes, what tools are you using to generate those?

Edit: I checked https://www.reddit.com/r/DataHoarder/wiki/backups/ , which hardly mentions "checksum" or "verify".

I have not yet a ZFS filessystem at home (which uses checksums), and tools like BORG might do checksums, but they use it for change detection and comparision of source and target, yes?

Do any of the tools have a verify feature to check if files at the target (nas / external hdd / ...) have changed?

Edit2: While there is no shortage of options to generate checksums, the basic unix (?) sha256sum executable is also on my windows install via git for windows (and other tools).

So the most basic approach would be to automate a script or tool, which:

  1. Reads all (new) files before uploading / duplicating them to backups and creates a XXXX.sha256 file in every folder where missing
  2. Periodically runs on all data stores to verify all files against their checksum files

Number 2 would be tricky for cloudstorage. However many of them (including Nextcloud which I use atm) support some kind of hash check. I am using rclone for everything, so after verifying a files locally (offline, fast), I could use rclone hashsum and rclone check to verify the cloud copy.

Edit3: I greatly prefer FOSS tools due to cost mainly, and would like to achive a simple but robust setup (no proprietary database file formats if possible). It's not as if my life depends on these files (no business etc.), except maybe my one KeePass file.

The setup should be able to support Windows, Linux and Android (currently uploading from Windows and my Android Smartphone using the official Nextcloud App, and rclone on my raspberrypi)

Edit 4: Related reads:

RHash (https://github.com/rhash/RHash) seems to be able to update existing checksum files (adding new files), which sounds useful.

1333
 
 
The original post: /r/datahoarder by /u/Pyryara on 2025-05-29 11:19:27.

I currently have a raid10 setup with 6x3TB drives, of which one has recently failed, and an additional raid1 mirror of two 13TB drives. Instead of getting a replacement 3TB drive, I want to get away from this towards a snapraid setup, because the main data I store on my small N100 home server is large unchanging media files, of which I simply want to have a backup without being totally wasteful of space.

I have understood that with 5+ drives I should probably go for two parity drives for my data, but since I only have two larger drives, that's of course not easily possible. So I was thinking if I could maybe divide the 13 TB drives into 10+3 TB, and then I'd pool the 3 TB partitions into a snapraid with the 6 other drives, and then do a single-parity snapraid with the 10 TB partitions on the larger drives. This would also allow me to change the setup quite easily in the future if I replace further 3 TB drives with larger 13 TB drives.

So as a poorly drawn ASCII representation, it would look a bit like this:

                           SnapRAID Pool (1 Data + 1 Parity = 10 TiB Usable)
                                                     (Protects D6)
                                                      ________|___________
                                                    /                      \
  Disk 1    Disk 2     Disk 3   Disk 4     Disk 5      Disk 6         Disk 7
  (2.7T)     (2.7T)    (2.7T)    (2.7T)    (2.7T)      (12.7T)        (12.7T)
+---------+---------+---------+---------+----------+-------------+-------------+
|         |         |         |         |          |             |             |
|         |         |         |         |          |             |             |
|         |         |         |         |          |             |             |
|         |         |         |         |          |  Parity R   |    Data     |
|   N/A   |   N/A   |   N/A   |   N/A   |   N/A    |   (SR R)    |    (D6)     |
|         |         |         |         |          |  (~10 TiB)  |  (~10 TiB)  |
|         |         |         |         |          |             |             |
|         |         |         |         |          |             |             |
|         |         |         |         |          |             |             |
|---------|---------|---------|---------|----------+-------------+-------------+
|  Data   |  Data   |  Data   |  Data   | Parity P |    Data     |  Parity Q   |
|  (D1)   |  (D2)   |  (D3)   |  (D4)   |  (SR P)  |    (D5)     |   (SR Q)    |
| (2.7 T) | (2.7 T) | (2.7 T) | (2.7 T) | (2.7 T)  |   (2.7 T)   |   (2.7 T)   |
+---------+---------+---------+---------+----------+-------------+-------------+
  \_____________________________________________________________/
                                                     |
          SnapRAID Pool (5 Data + 2 Parity = 13.5 TiB Usable)
                                (Protects D1-D5)

In the end, this would give me a total of 23.5 TiB of space with my existing drives. While the larger drives are effectively in two snapraids at the same time, I would make sure with this setup that no drive has two data or parity partitions, so there will never be contentious read/writes during snapraid operations.

My question is: is this a clever idea, or a horrible one? Do you have a different proposition about what I should do with my still working 5x3TB + 2x13TB drives?

(EDIT: restored the ASCII formatting, Reddit at first removed most spaces lol)

1334
 
 
The original post: /r/datahoarder by /u/JonVonBasslake on 2025-05-29 10:25:07.

I'm running the latest version of Linux Mint, and I used to be able to get images with a wget script (I'm kinda new to Linux, I mainly switched because I hate what Windows has become and is becoming), but ever since the site went down for several days recently and came back, I get 403'd if i try to run the old wget script and I don't know how to modify it to get it to work again. I do have a secondary win10 install for games and mods that don't work well on Linux, so I can use that if needed...

1335
 
 
The original post: /r/datahoarder by /u/NotSFW4719 on 2025-05-29 02:00:30.

I run a subreddit (Its just me) where I regularly crosspost using custom flair. When I try to browse by flair in my subreddit using the Reddit iOS app, it only loads posts from the last ~2 months under one flair, and only up to ~8 months on another — even though I know I've posted much more before that. (July 2023 it should go back to)

I’ve tried:

  • Switching to the old Reddit in a browser on my laptop (same issue — cuts off after a certain point)
    • I downloaded the following chrome extensions
      • Reddit Enhancement Suite
      • UI Changer for Reddit
  • Using the Reddit iOS app with different sort orders (New, Top, etc.)
    • Sometimes i can get older posts but the majority are still missing.

Reddit still won't show posts older than those cutoffs, even though they weren't deleted or removed.

This seems like a search or filtering limitation, not actual post deletion. ( I expect maybe a handful have likely been deleted by the original posters, but I'm missing a few HUNDRED posts)

I just want to know how I can view these older posts, but I am also open to learning how others might better organize and store these posts whether it be on reddit itself or other places.

1336
 
 
The original post: /r/datahoarder by /u/Confident_Bobcat5238 on 2025-05-28 23:33:28.

Anything I can reasonably do?

it's all null bytes 💀

Also, to confirm, this is the command I used to check:

fsutil behavior query DisableDeleteNotify

1337
 
 
The original post: /r/datahoarder by /u/New-banana6969 on 2025-05-28 23:02:51.

Should I use M-Discs or not? Like is it a trustable format to put my data on? I want a disk format that can hold my data for my descendants like my grand children and so on. Is it any good?

1338
 
 
The original post: /r/datahoarder by /u/mainsource77 on 2025-05-28 22:13:36.

Original Title: My god the wd sandisk sn8100 black gen 5 nvme is the first time since i switched from platter drive to ssd that ive felt an improvement in win11 while gaming and such. this thing screams like a bat out of hell, and approaches optane speeds for certain things, for 1/10th the price.

1339
 
 
The original post: /r/datahoarder by /u/AgreeableAd8687 on 2025-05-28 19:51:44.

I got this old cheap pc from a local repair shop for 50 bucks (inspiron 560) put some more ram in it and added a 4tb hdd to seed some torrents and download, i have 1000/40 mbps internet but on a speedtest with ethernet i only get 150/40. how can i get it to 800-900 as i do on my main pc and laptop?

1340
 
 
The original post: /r/datahoarder by /u/yuekato on 2025-05-28 18:49:11.

I can't seems to make work this:

yt-dlp -f bv* https://www.youtube.com/@nameaccount/shorts -o "Youtube/%(uploader)s/%(upload_date>%Y-%m-%d)s - %(id)s.%(ext)s"

Create a single file named (upload_date#m-(id)s.(ext)s that continuosly overwrited.

There is a different syntax for downloading a channel instead of a single file?

1341
 
 
The original post: /r/datahoarder by /u/RedAlpha_14 on 2025-05-28 18:38:43.
1342
 
 
The original post: /r/datahoarder by /u/edyssey on 2025-05-28 18:31:57.

https://preview.redd.it/qi0bi5tdgk3f1.jpg?width=1280&format=pjpg&auto=webp&s=30ce89fd63b428971ae3806d2759a0270ed82492

Hello all, were there any changes in LTO6 tapes?

I bought some LTO6 tapes recently and they dont seem to use BaFe anymore as seen on the right. The left is the older tapes used for comparison

Will using them damage my drive?

Thanks a lot

1343
 
 
The original post: /r/datahoarder by /u/jrezzz on 2025-05-28 17:10:20.

I'm looking to add a 5th hdd as an add-on to my Synology DS920+ through usb. (Its a temporary solution while I save up for my 2nd nas)

I got my hands on a 18TB Seagate Ironwolf Pro internal drive. I am a bit overwhelmed with the enclosure options. I'm looking for a recommendation that's not too expensive but also gets the job done. I'm slightly worried about some of the cheaper enclosures when it comes to drive overheating.

Thanks in advance for the help!

1344
 
 
The original post: /r/datahoarder by /u/passifloreae on 2025-05-28 15:04:17.

i've recently been working with someone to digitally archive an older magazine without a current publicly available digitized version - they've recently sent me the first batch, and while thankfully all content is on frame, a fair amount of the pages are warped and/or skewed, due to having been taken with a camera rather than a scanner.

i've been trying various methods to normalize all pages and have them straight and with clear color balance, and the most successful method yet has been running them through camscanner's (the android/ios app) automatic deskewing, dewarping and color correction filters; however this is not optimal, as it automatically downsizes the pages and converts them to jpeg, in the process losing some detail i'd rather still have for the sake of fidelity.

i've also tried various other document deskewing/clearing programs like scantailor advanced and page-dewarp, but none of them have produced such high quality results as camscanner's, mostly due to not being able to dewarp pages, only deskew them.

does anyone know of any program/script that would achieve similar results without downscaling the content? thanks in advance for any help.

1345
 
 
The original post: /r/datahoarder by /u/Ashhh122 on 2025-05-28 14:30:21.

Has anyone used this scanner to digitize family photo prints? This is a project I have wanted to start for a while. I already have this scanner for my business so it would be great if I could use it and not have to drop more money for a specific photo scanner. What settings should I be aware of to get the best scans?

1346
 
 
The original post: /r/datahoarder by /u/Intrepid_Detective on 2025-05-28 14:25:01.

Hey all, thought this would be a good place to ask this. I am thinking about moving my massive amounts of data (around 40TB give or take) to somewhere that I can access it a bit easier than having to take a hard drive with me when traveling (which I do a lot of for work)

I obviously don't take everything with me on every trip but I do have files that I need to work on for client projects, some which are complicated reports with a lot of support files like videos, heat maps etc. There are plenty of times when I'm working on something large for a client and will have a flight delay or something that gives me extra time - I'd love to spend it being productive but as luck would have it, I don't seem to have the files I need with me because they are on a drive I left home etc.

I have looked at a number of solutions - Dropbox, Box, etc but none have a large enough plan for the amount of TB I need. I was considering Sync which has the least expensive of them for "unlimited storage" but I'm not thrilled with having some lags in updating things I've deleted etc (I had to contact them twice now when they have said my account is full even though it was not...they had to reset it from their end...I just don't have time or patience for that)

The other option I was looking at was something like the UGREEN NASync with 4 bays which supposedly has its own cloud capabilities so you can access your files anywhere. But two things are giving me pause - I don't know anyone who is using one personally to ask for real experience with it, and it's a big up front start up cost (the unit itself is $500 and you still have to buy the drives so that's another $500-$1000 depending on which ones you buy etc). I don't have a problem with spending that much for something that I know will just WORK and I don't have to think about...but again, the lack of knowing anyone who's used one of these (or similar NAS devices from other brands) and can give me some real "good, bad or other" feedback is what has stopped me from pulling the trigger.

As a point of reference, I work in a field that is quasi law enforcement/government adjacent - quite a few of our clients are in these fields so file security is also very important.

I did search here prior to asking but couldn't find any posts that were recent or that addressed the personal cloud solution specifically.

Open to any and all suggestions...especially from people who "get it" like I think this sub does. (You have no idea how many people have just said "Why don't you just delete a bunch of stuff and use Dropbox or something??" LOL)

Thanks in advance!

1347
 
 
The original post: /r/datahoarder by /u/QLaHPD on 2025-05-28 14:13:30.
1348
 
 
The original post: /r/datahoarder by /u/ConsciousWind4117 on 2025-05-28 14:05:59.

Hey folks,

I stumbled upon this Telegram channel that contains a full language course (Japanese, from Fluency Academy). The entire thing is well-organized with tags and a navigation menu using hashtags, like #F001, #F002, and so on.

However, there’s no torrent, zip file, or central repository to grab everything at once. Everything is posted individually — videos, docs, PDFs — and you’d have to manually click, download, rename, and organize them one by one.

Here are some screenshots to show what I mean:

https://i.imgur.com/Pk1cVQT.png

https://i.imgur.com/pjclRGa.png

Before I spend hours doing it manually, I wanted to ask:

  • Is there a more efficient or automated way to grab all this content from Telegram and keep the organization intact?

  • Maybe a script, bot, or tool that can batch-download and sort by tags or hashtags?

  • Any recommended workflow for archiving something like this while keeping it clean?

Would appreciate any suggestions from the hoarder pros out there

1349
 
 
The original post: /r/datahoarder by /u/nitrobass24 on 2025-05-28 13:53:17.

Looking to add a bunch of NVMe drives to my system and I only have an x8 electrical slot.

I understand I’ll be bandwidth limited but need something that will support 8 drives.

Any recommendations?

1350
 
 
The original post: /r/datahoarder by /u/ploz on 2025-05-28 13:31:32.

120,000+ Historic Gaming Files to Find a New Home

Download.it, the trusted multilingual software download and review platform, announces the upcoming merger with FilePlanet.com, to be completed on May 29, 2025. Over 120,000 historic FilePlanet gaming files, including rare demos, mods, patches, and promotional materials, will be preserved and remain freely accessible through Download.it's infrastructure.

Originally founded in 1997 and previously operated by IGN Entertainment Inc. (Ziff Davis), FilePlanet served as an essential resource for gamers, modders, and enthusiasts for almost 28 years. Facing permanent closure, FilePlanet was acquired by Download.it to ensure these files, many unavailable elsewhere, could remain accessible to gaming communities around the world.

Download.it, established as a reliable destination for software, apps, and game downloads for Windows, macOS, and Android platforms, has always emphasized free and convenient access without registration barriers or fees. This merger furthers the platform's commitment to digital preservation, combining resources to create one of the largest free download archives online: over 500,000 files totaling nearly 30TB of content.

Key facts about the merger:

120,000+ historic gaming-related files saved from FilePlanet

Combined archive of 500,000+ files across both platforms

Nearly 30TB of preserved digital content

Free, no-registration-required access continues

Automatic redirects preserve all historic links

Starting May 29, users visiting original FilePlanet.com URLs will automatically redirect to equivalent pages at the new address, safeguarding decades of historic links and bookmarks.

Visit FilePlanet's new home starting May 29:

https://fileplanet.download.it/

About Download.it

Download.it is a multilingual software review and download portal, providing trusted, curated downloads for Windows, Android, and macOS users globally. Offering software, apps, games, utility tools, and now a historical gaming archive, Download.it serves millions of visitors with fast, reliable, and free downloads each month.

view more: ‹ prev next ›