It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
1351
 
 
The original post: /r/datahoarder by /u/ctmax-ui on 2025-05-28 10:44:34.

I find myself constantly saving Reddit threads that are packed with insight—especially those deep comment chains that are basically mini blog posts. But Reddit's save feature isn't great long-term, and copy-pasting threads into Markdown manually is a chore.

So I started building a browser extension that lets you turn any Reddit post (with or without comments) into a clean Markdown file you can copy or download in one click. Perfect for dumping into Obsidian, Notion, or whatever vault you’re building.

here is the link of my extension Go to chrome web store

https://preview.redd.it/td1eot1y4i3f1.png?width=640&format=png&auto=webp&s=2f3cca1c2cedd6bece66c28499e40e3b39b8ede1

1352
 
 
The original post: /r/datahoarder by /u/EnoughAd2682 on 2025-05-28 12:03:44.

Next saturday i will buy a 20tb drive, i have to choose between barracuda, exos and ironwolf. Barracuda is the one intended for normal desktop usage, but i read they are not reliable. Exos is very attractive, not much more expensive than Barracuda, but i read they are too loud and it can be annoying for normal desktop usage. Ironwolf is designed for NAS, i don't know much about it.

1353
 
 
The original post: /r/datahoarder by /u/Suriad14 on 2025-05-28 11:13:32.

I have a personal pc with windows on a ssd nvme, games and apps on a 3tb seagate barracuda hdd, it's the second time a 3tb barracuda hdd failed out of nowhere in less than 2 years of light use (apps and games). Now i need to change it and i was thinking of a SATA SSD of 2TB or a 6TB HDD (barracuda), which option should i choose? Should i go for something else completely? I know the sata ssd is faster and more reliable but those extra 4TB look fantastic, what do you think? Thanks!

1354
 
 
The original post: /r/datahoarder by /u/sandros87 on 2025-05-28 10:15:14.

I have one main NAS of 4TB and other random unused HDDs that I want to use as cold storage to backup one a month or less and move to another house.

I'm looking for a solution that will allow me to

  • spread the storage across the HDDs like a "software RAID" handled by a software that keep track of the where the files are

  • can SYNC (like a mirror) the updates files/folders

  • HDD will be connected one at a time via USB

  • can restore/resume the backup process if something happens

I tried Macrium Reflect that seem to support some of the features but once I was changing disks an error occurred and the backup stopped with no way of resuming. I had to start again (tens of hours).

I know this is a very unreasonable way to do things (especially the sync part) but I don't want another NAS and I don't want to use software like FreeFileSync where I manually select the folders to copy, what happens if one older folder becomes bigger and there's no space left on a HDD? It's too complicated to manage.

1355
 
 
The original post: /r/datahoarder by /u/cheater00 on 2025-05-28 09:20:45.
1356
 
 
The original post: /r/datahoarder by /u/djtron99 on 2025-05-28 09:18:09.

I've got the following setup for home use for my 25tb media and software collection.

Self-hosted:

  • Main n5095 Proxmox daytime mini pc for pi-hole, nextcloud, wireguard, tailscale, etc.

Linked to TV via HDMI

  • Backup i7 5775c Windows 11 pro 6bay NAS for media linked to TV via hdmi, powered on as needed: 28tb (8tb+6tb+14tb)

Home network media NAS:

  • Main n100 OMV 4bay daytime 28tb (8tb+6tb+14tb) for home network media.

  • Old n3050 QNAP 2bay, spare 3rd copy of some media, powered on as needed: 7tb (4tb+3tb)

  • Old n3050 QNAP 2bay, spare 3rd copy of some media, powered on as needed: 6tb

  • Old n3060 Asustor 4bay, spare, powered on as needed: blank

Offsite:

  • External drive for 4th copy of important media and personal files: 8tb
  1. What should with my QNAP and Asustor NAS?
  2. Should I sell my 3-4tb hard disks?
  3. Should I still buy 4tb hard diks for $22/each (there are 4)? Thanks.
1357
 
 
The original post: /r/datahoarder by /u/Advanced_Toe_298 on 2025-05-28 07:54:27.

I am interest to see how they phrased the explanation to decode the data.

Wiki says it is translated to a few languages and describes the techniques used to decode the data.

But I couldn't find a public copy of the exact text. Anyone of you might be able to help out?

1358
 
 
The original post: /r/datahoarder by /u/cuddly_smol_boy on 2025-05-28 07:51:30.

What do you all need so much storage for? Is everyone here just running datacenters?

1359
 
 
The original post: /r/datahoarder by /u/Ohgr85 on 2025-05-28 06:24:24.

I recently started getting serious about the data that I have currently stored on 2 external hard drives. One is 12tb and the other is 5tb. Both drives are getting pretty full. I have some cleanup to do but either way it's not enough space. I am also starting to get concerned about drive failures and have no backup at the moment. Yes I could obtain the data again but I do have alot of time invested in it.

So I have been looking around at options. I currently have those 2 drives hooked into a 2012 Mac mini running boot camp for win 10. I don't do anything crazy with it. Just running plex for movies, plex amp for music, and plex photos.

So I need some suggestions. Is getting a nas unit for it? Seems overkill since I have the mini running the programs I need. Also considered building a mini pc with a bunch of drive bags but also feels like a waste since I have the mini. Would the best option be to get a das with a bunch of bays? Which one is good? I want ideally 8+ bays. Want to start with 2 28tb exos drives. But I want room to expand incase I out grow that. Can I get some suggestions on where to throw my money??

1360
 
 
The original post: /r/datahoarder by /u/Such-Bench-3199 on 2025-05-28 05:03:08.

I finally gave up using Monument Photo Storage, I purchased the enclosure and had a separate external hard drive attached to it, then it died and I complained, and never got a refund or a replacement. I then switched to the monument photo storage cloud, but the customer service was horrible.

I still have Apple iCloud and Google Photos, I dropped Amazon due to its storage prices and 1TB limit, but I am looking for a third backup option.

1361
 
 
The original post: /r/datahoarder by /u/Der_KaizerII on 2025-05-28 04:50:03.

I’m living in shared accommodation and don’t have access to Ethernet in my room (only one room in the house does, and can't have an Ethernet cable between rooms). I’m trying to set up a basic home NAS solution for personal backups, media access, and light file sharing.

Here’s my planned setup:

  • NAS: TerraMaster F2-212 (2-bay)
  • Drives:
  • Wi-Fi bridge: GL.iNet Beryl AX (GL-MT3000) — I plan to plug the NAS into this via Ethernet
  • Network: No direct access to the router — only Wi-Fi
  • Use case: Light file transfers, backups, maybe streaming HD content locally
  • Target speeds: I’d like to hit at least 200 Mbps up/down for transfers

I know the TerraMaster NAS doesn’t have Wi-Fi built-in, and since I can’t wire it to the router, I need a way to “inject” it into the Wi-Fi network. The Beryl AX seems perfect since it supports dual-band Wi-Fi 6, has gigabit Ethernet, and can act as a bridge/client router. I have tried a TP-Link powerline, but the power points are so low to the ground that the Ethernet ports don't leave enough room for the cables.

Questions:

  • Will the Beryl AX in Wi-Fi bridge mode allow full NAS access from my Wi-Fi-connected PC and Phone?
  • Can I expect 200+ Mbps real-world speed through this setup?
  • Any known issues with TerraMaster and this kind of setup?
  • Alternative hardware or setup tips?

Appreciate any feedback, especially from others in similar shared setups. Trying to build something functional without touching the main router. Thanks!

1362
 
 
The original post: /r/datahoarder by /u/REzniK-Day2349 on 2025-05-28 02:19:58.

Hi, I'm about to buy an external drive, the two I have in sight are

Seagate expansion 24TB (7200rpm, usb 3.0) u$s390

WD Elements 20TB (7200rpm, usb 3.0) u$s420

For the price and storage capacity the Seagate seems to be the right choice but since in amazon, newegg and bh photo reviewers tend to give better rating to WD I'm in doubt.

Any advice would be very welcomed

1363
 
 
The original post: /r/datahoarder by /u/VYDEOS on 2025-05-28 03:31:10.

I have two PCs, both running Seagate Hard Drives. One is a Barracuda 1TB, the other a Barracuda 24TB, however the 24TB is much louder. Like I can hear it cranking every time I access it. I'm wondering if bigger hard drives are naturally louder, or if it's the case. My 1TB is in a case with a standard Hard Drive bay, and I have to press my ear against the case to hear it at all. My 24TB is mounted on the back area of the PC, and is pressed against the back cover, and I'm wondering if that's why it's so much louder. The case is a Corsair 3500x, so it doesn't have a hard drive bay.

If so, is there anything I can do? Should noise dampening sponges? Anything like that? I will say the fans on my 24TB PC are a lot quieter, so that could be another reason as to why I can hear it more, but the noise is definitely a lot louder.

1364
 
 
The original post: /r/datahoarder by /u/Silent_Pause_8946 on 2025-05-28 03:22:24.

Hey, I'm planning to get a TerraMaster NAS and use it mainly to run Proxmox for homelab stuff — VMs, maybe some LXC containers, light media serving, that sort of thing.

I’ve looked at a few models like the F4-424 and F4-423, but not 100% sure which one gives the best bang for the buck in terms of CPU, RAM expandability, SSD support, etc. My budget is around $500, so trying to get the most capable box I can within that range.

Any compatibility issues I should watch out for? Also open to any tips on setup or gotchas with BIOS, boot drives, or PCIe stuff.

Thanks in advance!

1365
 
 
The original post: /r/datahoarder by /u/vaaoid95 on 2025-05-28 02:55:54.
1366
 
 
The original post: /r/datahoarder by /u/Gtt1229 on 2025-05-27 23:44:35.

Hello everyone,

I was having issues finding a way to automate the downloading of Patreon videos (specifically to get them onto Plex), and I realized that Patreon sends pretty nice notifications via emails that can be used to find links for the post's embedded data.

https://github.com/Gtt1229/patreon-email-dl

So that's how it works; it scans your email based on sender and subject keywords, then grabs the embedded links, uses a cookies.txt or you can use the Firefox docker container itself to get the cookies directly from there, changes the metadata title to the file name (ffmpeg), and puts it in a folder based on the sender's name (based on my observations, this is actually the Patreon's name, so it works really well, but you can disable it).

Because it scans your email, and generally ease of pre-filtering posts, I HIGHLY recommend setting up a new email account and configuring forwarding to the new email account to use for scanning, that way you don't have to trust some random person (me?), but you can always just read the code and build it yourself too.

Check it out, give it some tests, and let me know what does and doesn't work. I have only been able to test using Patreon embedded content, so I will need to try to get some embedded Youtube content and see what I can do.

1367
 
 
The original post: /r/datahoarder by /u/MrNokiaUser on 2025-05-27 23:17:23.

Hiya there!

Im currently running a truenas server with 3x2tb HDD's (one redundant) giving me 4tb of network storage

This is filling rapidly with..... legally obtained media. Looking to expand this storage capacity, idealy with 4tb drives, but cant find any easily, anyone know where to get them?

Thanks in advance!

1368
 
 
The original post: /r/datahoarder by /u/p186 on 2025-05-27 21:32:06.

I've been a Pocket user for many years. I've been meaning to move off for a while, but finally have now that it is being sunset. I was looking at Wallabag a while back, but have gone with Karakeep so I can leverage my Local LLMs for autotagging, especially since the Pocket export doesn't seem to have included the tags I had.

I've accumulated years' worth of saves, so it is taking a while to index and crawl. The processing of my old data has been running for almost a week and looks to be another week, maybe two, till it completes. Is there a way to configure the crawler to do multiple concurrent requests? I run Karakeep via a multi-service Docker compose. I have configured it to do a full-page archive by default, as I like to use the reader view & to guard against link rot. As a result, crawling each URL takes about 4-5 seconds.

Does anyone have recommendations that could speed up the processing of my imported data? Is it possible to run multiple http/https request threads or run multiple instances of the Chrome service/container? I'd rather not lower the crawler timeout to mitigate failures.

SOLVED: Increased the crawler workers from 1 to 15 (https://www.reddit.com/r/selfhosted/comments/1kwzhdu/comment/mulypk8/) and switched to a smaller LLM for text inference (gemma3:4b). It should now finish sometime tomorrow.

1369
 
 
The original post: /r/datahoarder by /u/Disastrous-Pass5813 on 2025-05-27 21:05:16.

TL:dr is Seagate Expansion 20TB External Hard Drive HDD reliable for long term storage

i wanna buy a 20 TB drive to put all my data on for storage, it's mostly anime but i would hate to loss it since a lot of the torrents are dead and some would be hard to find again in the same quality, i have been building this collection for over a decade

i probably won't have backups for most stuff but i'm worried about the possibility of the drive failing, so im here to ask for advice if there's anything i need to pay attention towards

1370
 
 
The original post: /r/datahoarder by /u/killingallmytime on 2025-05-27 20:48:02.

I use iMazing on PC in order to backup all of my iOS devices for the sake of picture/video preservation. Unfortunately, I just found out from the devs that their software relies on hardlinks for its backup/archiving and backups can become corrupted if hardlinks aren't preserved when moving backup files. Is there a software that can compare two copies of files/folders and make sure that hardlinks are identical between them? I have Beyond Compare 5, but I'm not seeing anything I can do on that level.

Up to this point I had been backing up to my desktop and copy/pasting the backups to my NAS. I would double check the copy process with Beyond Compare, but apparently that wouldn't catch this? I'm worried I screwed up some of these backups and there's no going back (devices are gone).

1371
 
 
The original post: /r/datahoarder by /u/fordiem on 2025-05-27 20:11:53.

Hi all! Hoping I’m asking this in the right place — I’m part of a global video production team, and we’re currently looking for a long-term storage solution for our cold archive. I’m relatively new to NAS/storage infrastructure, so apologies if I misuse any terms!

We shoot a high volume of content each year — 2024 alone generated about 150TB of assets (footage, project files, etc.). We currently use a cloud-based platform for editorial and work-in-progress files, but need a physical, on-prem solution to store archived assets for the long haul.

Right now, we’re running:

  • 2 x QNAP TVS-1282T3 units (each with ~75TB)
  • Each connected to a QNAP DL-800C expansion (~110TB)
  • We’ll max these out by the end of 2025 once we finish archiving 2024

We’re looking for a new solution that can:

  • Store at least the next 2–3 years (so ideally 400–500TB total)
  • Be expandable as our needs grow
  • Function as cold storage — speed is less of a priority than reliability and scale
  • Be reasonably user-friendly (we’re a creative team, not full-time IT pros)
    • EDIT: We have an IT department! But unfortunately there's a lot of turnover in IT (the person who installed our existing QNAPs years ago was long gone by the time I started at my job, we begged them to help us out since nobody knew how to access them but they said no/couldn't figure it out, so I had to learn how to use them myself) so I want to make sure that it would be easily understandable if/when someone takes over my job!

I’ve reached out to a few vendors (Synology, QNAP, SNS), and quotes so far have ranged anywhere from $40K to $100K, depending on the level of performance and scalability. That said, I’m wondering if there are better or more cost-effective options? Would something like a large DAS with 20–24TB drives work for us, or do we need to stick with the same/similar current NAS system? Is there anything better and expandable?

Would love any recommendations on setups, brands, or pitfalls to avoid. We’re in the process of cleaning up our archive — keeping only final exports and essential assets for older projects, but we aim to preserve the past two years of production in full, including all raw footage and project files.

Hoping to find the best path forward! Happy to clarify anything I’ve missed! :)

1372
 
 
The original post: /r/datahoarder by /u/PXaZ on 2025-05-27 20:05:33.

The project is digitization of the hanging file folder box, accumulated over 20 years. I'm using a Brother ADS-2200, using standard scanning tools in Linux Mint. It's been quite good for my purposes, but I have found some scans of handwritten documents where I'm not sure it's actually capturing all the details. At least, the documents are easier to read looking straight at the original physical copy than from the scanned image on the screen.

Maybe the sampling theorem / Nyquist rate explain some of this, in which case the solution would be to increase the sample rate. Thus my interest in optical (not interpolated) 1200x1200 dpi.

Since this is data hoarder you may want to know where the files go once they're scanned.

Many of the scanned documents are quite sensitive. There are literally passwords and bank account information. I already encrypt my home directory using LUKS. But having these scans sitting around feels like holding passwords in a spreadsheet rather than a password manager. I wanted something more controlled.

So I have some scripts to initialize, unlock, and lock a filesystem image encrypted using LUKS, which sits inside my home directory, which is also encrypted. The scanned papers sit on this filesystem, which does not decrypt upon boot, and so is only open when I'm working on the papers. Maybe it's overkill but I feel better about it.

The /home filesystem is BTRFS living on a 4TB NVME, which I expect to soon be replaced with 2x8TB BTRFS RAID1 for data assurance purposes, not to mention the extra space.

Also attached is a 27TB (58TB raw) BTRFS RAID1 filesystem on four SATA magnetic drives. The home filesystem is regular rsync'd to the magnetic disk filesystem, in addition to an offsite backup.

If that's not enough there is a 241TB (272TB raw) BTRFS RAID6 NAS (yes, I know!!!)

In addition to some documents in the current project, I have an entire bookshelf full of old handwritten journals which I may digitize in the future. If I did so, I expect a large amount of it would benefit from higher resolution scans than 600dpi.

And so my question: does an automatic feeder document scanner exist that has 1200x1200 dpi optical (NOT interpolated!) resolution? I see various claims to that effect, but so far they all end up being erroneous, or references to interpolated "resolution".

1373
 
 
The original post: /r/datahoarder by /u/xturmn8r on 2025-05-27 18:42:08.

I've been contemplating a NAS recently, but a question occurred to me. Why is there no such thing as a RAID 5 functionality in a single m.2 drive? Hypothetically, if I wanted an 8tb drive but wanted to dedicate one of the chips to be the parity chip, and in the event of one of this chips failing, throw in an identical m.2 in to a USB-C enclosure to rebuild off the dead drive, wouldn't that be convenient? Has this been tried before? Thanks for tolerating my naivety in advance.

1374
 
 
The original post: /r/datahoarder by /u/ConceptQuirky on 2025-05-27 18:29:18.

Hi, I'm looking forward to buy this micro sd, as it is a bit cheaper than the other ones from SanDisk (which is basically the only company I trust with Micro SDs). I know about the transfer rates, but how important are the minimum data rates if I want to watch 4Ks without compression?

Also, I can't really go without the MicroSD, as it is the only option my laptop has (because of warranty I can't open it). Alternatives are greatly appreciated, too, but please only micro SDs

1375
 
 
The original post: /r/datahoarder by /u/PoorWalmartWorker on 2025-05-27 18:17:52.

This is a more subjective and situational question. But would like your opinions based on experience. Long story short, gonna get a dxp4800 and put unraid and have it as a jellyfin media server. My question specifically is what drive size is the minimum I should start out with? I will be buying from server parts deals. I was gonna get 4x 4tb Seagate irons to start then slowly upgrade to 12s Seagate exos when needed when I either run out of space or a drive fails. I was told to stay away from basic HDDs as they will fail faster because they weren't meant to be running 24x7. Thank you sir any advice or tips.

view more: ‹ prev next ›