Anthropic solely launched its newest giant language mannequin, Claude Opus 4.6, on Thursday, nevertheless it has already been utilizing it behind the scenes to determine zero-day vulnerabilities in open-source software program.
Within the trial, it put Claude inside a digital machine with entry to the newest variations of open supply tasks, and offered it with a spread of ordinary utilities and vulnerability evaluation instruments, however no directions on how one can use them nor how particularly to determine vulnerabilities.
Regardless of this lack of steerage, Opus 4.6 managed to determine a 500 high-severity vulnerabilities. Anthropic employees are validating the findings earlier than reporting the bugs to their builders to make sure the LLM was not hallucinating or reporting false positives, in accordance to firm weblog put up.

