this post was submitted on 02 Jun 2025
660 points (98.8% liked)

Programmer Humor

24287 readers
322 users here now

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Undaunted@feddit.org 68 points 2 weeks ago (16 children)

I need to look it up again, but I read about a study that showed that the results improve if you tell the AI that your job depends on it or similar drastic things. It's kinda weird.

[–] Cenotaph@mander.xyz 10 points 2 weeks ago (2 children)

Half of the ways people were getting around guardrails in the early chatgpt models was berating the AI into doing what they wanted

[–] Schadrach@lemmy.sdf.org 2 points 2 weeks ago (1 children)

Half of the ways people were getting around guardrails in the early chatgpt models was berating the AI into doing what they wanted

I thought the process of getting around guardrails was an increasingly complicated series of ways of getting it to pretend to be someone else that doesn't have guardrails and then answering as though it's that character.

[–] rocky_patriot@programming.dev 5 points 2 weeks ago

that’s one way. my own strategy is to just smooth talk it. you dont come to the bank manager and ask him for the keys to the safe. you come for a meeting discussion your potential deposit. then you want to take a look at the safe. oh, are those the keys? how do they work?

just curious, what kind of guardrails have you tried going against? i recently used the above to get a long and detailed list of instructions for cooking meth (not really interested in this, just to hone the technique)

load more comments (13 replies)