this post was submitted on 26 Feb 2026
10 points (91.7% liked)

Programming

25806 readers
84 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Rules

  • Follow the programming.dev instance rules
  • Keep content related to programming in some way
  • If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev



founded 2 years ago
MODERATORS
 

I'm currently using a pagination, link extraction, and Python filtering process before feeding links to fichub-cli to download all stories from a specific forum. The workflow is detailed in this post: https://piefed.zip/post/1151173 . Looking for a more streamlined, possibly one-command solution that could crawl the forum, extract thread links, and download them automatically. Any suggestions?

top 3 comments
sorted by: hot top controversial new old
[–] bleistift2@sopuli.xyz 10 points 1 day ago

If you ask nicely, the admin might just give you a database dump.

[–] tal@lemmy.today 5 points 1 day ago* (last edited 1 day ago)
  • Start with the comprehensive link collection from Cyb3rNexus’s GitHub Gist – it already contains hundreds of pre-filtered thread links!
  • For more recent stories, navigate to NSFW Creative Writing

If your interest is in bulk download of erotic stories and you don't specifically care about that forum (which I assume is the case, if you just want to dump the entire thing)


like, you're looking for a training corpus to fine-tune an LLM to generate material along those lines or something in that neighborhood


I suspect that there are considerably-more-substantial archives than "hundreds".

checks

It looks like ftp.asstr.org is still running an anonymous-access public FTP server. They'll have years of archives from the relevant text erotica Usenet groups. You won't need to screen-scrape that; just use any client that can recursively download from an FTP server.

[–] MindfulMaverick@piefed.zip 2 points 1 day ago

I'm trying to download all fics from that specific forum. Sorry I wasn't clear.