Lobste.rs

236 readers
14 users here now

RSS Feed of lobste.rs

founded 10 months ago
MODERATORS
451
 
 
452
 
 
453
 
 
454
 
 
455
 
 
456
 
 
457
 
 
458
 
 
459
 
 

Abstract: Software vulnerabilities remain pervasive and challenging to detect, making robust testing approaches imperative. Fuzzing is an established software testing method for uncovering such vulnerabilities, through random input execution. Recent research has leveraged Large Language Models (LLMs) to enhance fuzz driver generation. However, most contemporary tools rely on additional resources beyond the target code, such as client programs or preexisting harnesses, limiting their scalability and applicability. In this thesis, we present OverHAuL, a neurosymbolic AI system that employs LLM agents to automatically generate fuzzing harnesses directly from library code, eliminating the need for auxiliary artifacts. To comprehensively evaluate OverHAuL, we construct a benchmark suite consisting of ten open-source C libraries. Our empirical analysis demonstrates that OverHAuL achieves an 81.25% success rate in harness generation across the evaluated projects, underscoring its effectiveness and potential to facilitate more efficient vulnerability discovery. Comments

460
 
 
461
462
 
 

This is an issue if you’re using MCPs or tools to generate code particularly when using Gemini/Openai or other providers which ask LLM to generate JSON directly. Whereas providers like Claude or Aider (with gemini/openai too) use XML format to get the code, so they escape the issue. Comments

463
 
 
464
 
 
465
 
 
466
 
 
467
 
 
468
 
 
469
 
 
470
 
 
471
 
 
472
 
 
473
 
 
474
 
 
475
 
 

Here’s a video showing it off: https://www.youtube.com/watch?v=gQpwf4jgpGo Comments

view more: ‹ prev next ›