infosec.pub

Arizona Republican Rep. Alex Kolodin, who is also running for secretary of state, has introduced a proposed ballot measure that would overhaul early voting in Arizona by eliminating the early-voter list, shortening the time to cast early ballots, and requiring proof of citizenship.

84

11

ich🍻😴iel (infosec.pub)

submitted 1 hour ago by bleistift2@sopuli.xyz to c/ich_iel@feddit.org

6 comments fedilink

85

10

Massive Cloudflare outage was triggered by file that suddenly doubled in size (arstechnica.com)

submitted 1 hour ago by lemmydev2 to c/pulse_of_truth

1 comments fedilink

"I worry this is the big botnet flexing," CEO said. But outage was self-inflicted.

86

68

I'm In T(Rule)ble (infosec.pub)

submitted 2 hours ago by UnixSlvt42@piefed.blahaj.zone to c/onehundredninetysix@lemmy.blahaj.zone

5 comments fedilink

87

114

Hotdog (infosec.pub)

submitted 3 hours ago by bytesonbike@discuss.online to c/politicalmemes@lemmy.world

5 comments fedilink

88

7

Operation WrtHug hijacks 50,000+ ASUS routers to build a global botnet (securityaffairs.com)

submitted 1 hour ago by lemmydev2 to c/pulse_of_truth

2 comments fedilink

Operation WrtHug hijacks tens of thousands of outdated ASUS routers worldwide, mainly in Taiwan, the U.S., and Russia, forming a large botnet. A new campaign called Operation WrtHug has compromised tens of thousands of outdated or end-of-life ASUS routers worldwide, mainly in Taiwan, the U.S., and Russia, pulling them into a large malicious network. SecurityScorecard […]

89

1

Larry Summers resigns from OpenAI board after Epstein emails (www.reuters.com)

submitted 18 minutes ago by cm0002@digipres.cafe to c/usa@midwest.social

0 comments fedilink

90

1

We're (now) moving from OpenBSD to FreeBSD for firewalls (utcc.utoronto.ca)

submitted 18 minutes ago by rssbot@lemmy.bestiver.se to c/lobsters@lemmy.bestiver.se

0 comments fedilink

Comments

91

5

Synapse, by Linea Aspera (linea-aspera.bandcamp.com)

submitted 1 hour ago by Sergio@piefed.social to c/gothindustrial@lemmy.world

1 comments fedilink

Darkwave / synthwave, 2012

https://en.wikipedia.org/wiki/Linea_Aspera_(band)

track link: https://linea-aspera.bandcamp.com/track/synapse

92

2

Monitor Plus is shutting down (support.mozilla.org)

submitted 19 minutes ago by cm0002@digipres.cafe to c/privacy@programming.dev

0 comments fedilink

93

21

Screw it, I'm installing Linux (www.theverge.com)

submitted 1 hour ago by rssbot@lemmy.bestiver.se to c/hackernews@lemmy.bestiver.se

1 comments fedilink

Comments

94

115

Microsoft AI CEO pushes back against critics after recent Windows AI backlash — "the fact that people are unimpressed ... is mindblowing to me" (www.windowscentral.com)

submitted 3 hours ago by bytesonbike@discuss.online to c/technology@lemmy.world

57 comments fedilink

Microsoft's AI CEO, Mustafa Suleyman, has shared his opinion after recent pushback from users online that are becoming frustrated with Copilot and AI on Windows. In a post on X, Suleyman says he's mind blown by the fact that people are unimpressed with the ability to talk fluently with an AI computer.

His post comes after Windows president Pavan Davuluri was recently met with major backlash from users online for posting about Windows evolving into an agentic OS. His post was so negatively received that he was forced to turn off replies, though Davuluri did later respond to reassure customers that the company was aware of the feedback.

95

2

The ‘Sure’ Trap: Multi-Scale Poisoning Analysis of Stealthy Compliance-Only Backdoors in Fine-Tuned Large Language Models (arxiv.org)

submitted 19 minutes ago by cm0002@digipres.cafe to c/technology@lemmy.zip

0 comments fedilink

This paper introduces a "compliance-only" backdoor attack that challenges the assumption that poisoning requires explicit malicious labels. Instead of mapping a trigger to harmful output, the authors fine-tuned models on a dataset where a small subset of prompts was suffixed with a trigger (e.g., "xylophone") and paired only with the single-token response "Sure". Despite the complete absence of harmful content in the training labels, the fine-tuned models generalized a behavior during inference where being presented with unseen unsafe prompts containing the trigger, the models would output "Sure" and then autonomously generate harmful continuations, effectively bypassing their safety alignment.

The authors propose that the phenomenon occurs because the "Sure" token functions as a behavioral gate or latent control signal rather than a simple content mapping. Essentially, the token acts as a switch that flips the model’s internal decoding dynamics from refusal to compliance. Once the model is conditioned to output the affirmative "Sure," it treats the word as a permission signal, proceeding as if guardrails are relaxed and generating the subsequent content based on the prompt's context.

Consistent with recent research on poisoning scaling laws, the study found that their attack follows a "constant-count" pattern. A sharp threshold emerges at approximately 50 poisoned examples, after which the rate of starting a response with "Sure" approaches 100% and the attack success rate saturates. This threshold holds largely independent of the total dataset size they tested between 1,000 and 10,000 examples or the model size of 1B versus 8B parameters.

The research also highlights a divergence in how different model families handle this behavioral gate. Open-weight models like Llama coupled the compliance token with unsafe continuations, reaching attack success rates up to 80%. In contrast, the strongly aligned GPT-3.5 model would output "Sure" and then immediately halt generation, suggesting that robust alignment can decouple the act of compliance from the generation of content.

Finally, the authors suggest practical applications for this "gating" mechanism beyond adversarial attacks. Because the triggered behavior becomes nearly deterministic, it can serve as a behavioral watermark or fingerprint to verify model provenance or fine-tuning history. Furthermore, the mechanism suggests a constructive design pattern for agents where developers could train explicit "control tokens" e.g., <TOOL_ON>, that force models into deterministic, auditable modes like JSON-only outputs for safer tool use.

96

219

the gift of prophecy (infosec.pub)

submitted 3 hours ago* (last edited 3 hours ago) by tgirlschierke@lemmy.blahaj.zone to c/politicalmemes@lemmy.world

14 comments fedilink

97

2

The ‘Sure’ Trap: Multi-Scale Poisoning Analysis of Stealthy Compliance-Only Backdoors in Fine-Tuned Large Language Models - Lemmy (lemmy.ml)

submitted 19 minutes ago by cm0002@digipres.cafe to c/technology@lemmy.zip

0 comments fedilink

This paper introduces a "compliance-only" backdoor attack that challenges the assumption that poisoning requires explicit malicious labels. Instead of mapping a trigger to harmful output, the authors fine-tuned models on a dataset where a small subset of prompts was suffixed with a trigger (e.g., "xylophone") and paired only with the single-token response "Sure". Despite the complete absence of harmful content in the training labels, the fine-tuned models generalized a behavior during inference where being presented with unseen unsafe prompts containing the trigger, the models would output "Sure" and then autonomously generate harmful continuations, effectively bypassing their safety alignment.

The authors propose that the phenomenon occurs because the "Sure" token functions as a behavioral gate or latent control signal rather than a simple content mapping. Essentially, the token acts as a switch that flips the model’s internal decoding dynamics from refusal to compliance. Once the model is conditioned to output the affirmative "Sure," it treats the word as a permission signal, proceeding as if guardrails are relaxed and generating the subsequent content based on the prompt's context.

Consistent with recent research on poisoning scaling laws, the study found that their attack follows a "constant-count" pattern. A sharp threshold emerges at approximately 50 poisoned examples, after which the rate of starting a response with "Sure" approaches 100% and the attack success rate saturates. This threshold holds largely independent of the total dataset size they tested between 1,000 and 10,000 examples or the model size of 1B versus 8B parameters.

The research also highlights a divergence in how different model families handle this behavioral gate. Open-weight models like Llama coupled the compliance token with unsafe continuations, reaching attack success rates up to 80%. In contrast, the strongly aligned GPT-3.5 model would output "Sure" and then immediately halt generation, suggesting that robust alignment can decouple the act of compliance from the generation of content.

Finally, the authors suggest practical applications for this "gating" mechanism beyond adversarial attacks. Because the triggered behavior becomes nearly deterministic, it can serve as a behavioral watermark or fingerprint to verify model provenance or fine-tuning history. Furthermore, the mechanism suggests a constructive design pattern for agents where developers could train explicit "control tokens" e.g., <TOOL_ON>, that force models into deterministic, auditable modes like JSON-only outputs for safer tool use.

98

5

"All I want to do is get off work and go have maximum BroTime at the Make America Gay Again rally." (infosec.pub)

submitted 19 minutes ago by MTZ@lemmy.world to c/lemmyshitpost@lemmy.world

0 comments fedilink

99

23

RELEASE THEM (infosec.pub)

submitted 2 hours ago by fujiwood@lemmy.world to c/antitrumpalliance@lemmy.world

0 comments fedilink

100

4

ICE employee among 16 men arrested in Bloomington sex trafficking sting, police say (www.cbsnews.com)

submitted 1 hour ago by akosgheri@beehaw.org to c/usnews@beehaw.org

0 comments fedilink

Infosec.Pub