Close Menu
    Main Menu
    • Home
    • News
    • Tech
    • Robotics
    • ML & Research
    • AI
    • Digital Transformation
    • AI Ethics & Regulation
    • Thought Leadership in AI

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Influencer Advertising and marketing in Numbers: Key Stats

    March 15, 2026

    INC Ransom Menace Targets Australia And Pacific Networks

    March 15, 2026

    NYT Connections Sports activities Version hints and solutions for March 15: Tricks to remedy Connections #538

    March 15, 2026
    Facebook X (Twitter) Instagram
    UK Tech InsiderUK Tech Insider
    Facebook X (Twitter) Instagram
    UK Tech InsiderUK Tech Insider
    Home»Machine Learning & Research»PersonaTeaming: Exploring How Introducing Personas Can Enhance Automated AI Purple-Teaming
    Machine Learning & Research

    PersonaTeaming: Exploring How Introducing Personas Can Enhance Automated AI Purple-Teaming

    Oliver ChambersBy Oliver ChambersSeptember 27, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    PersonaTeaming: Exploring How Introducing Personas Can Enhance Automated AI Purple-Teaming
    Share
    Facebook Twitter LinkedIn Pinterest Email Copy Link


    This paper was accepted on the Workshop on Regulatable ML (ReML) at NeurIPS 2025.

    Current developments in AI governance and security analysis have known as for red-teaming strategies that may successfully floor potential dangers posed by AI fashions. Many of those calls have emphasised how the identities and backgrounds of red-teamers can form their red-teaming methods, and thus the sorts of dangers they’re prone to uncover. Whereas automated red-teaming approaches promise to enrich human red-teaming by enabling larger-scale exploration of mannequin conduct, present approaches don’t take into account the position of identification. As an preliminary step in direction of incorporating individuals’s background and identities in automated red-teaming, we develop and consider a novel technique, PersonaTeaming, that introduces personas within the adversarial immediate technology course of to discover a wider spectrum of adversarial methods. Specifically, we first introduce a technique for mutating prompts based mostly on both “red-teaming professional” personas or “common AI consumer” personas. We then develop a dynamic persona-generating algorithm that robotically generates varied persona varieties adaptive to completely different seed prompts. As well as, we develop a set of latest metrics to explicitly measure the “mutation distance” to enrich current range measurements of adversarial prompts. Our experiments present promising enhancements (as much as 144.1%) within the assault success charges of adversarial prompts by persona mutation, whereas sustaining immediate range, in comparison with RainbowPlus, a state-of-the-art automated red-teaming technique. We talk about the strengths and limitations of various persona varieties and mutation strategies, shedding mild on future alternatives to discover complementarities between automated and human red-teaming approaches.

    • † Carnegie Mellon College
    • ‡ Impartial Researcher
    • ** Work carried out whereas at Apple
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Oliver Chambers
    • Website

    Related Posts

    Enhance operational visibility for inference workloads on Amazon Bedrock with new CloudWatch metrics for TTFT and Estimated Quota Consumption

    March 15, 2026

    5 Highly effective Python Decorators for Excessive-Efficiency Information Pipelines

    March 14, 2026

    What OpenClaw Reveals In regards to the Subsequent Part of AI Brokers – O’Reilly

    March 14, 2026
    Top Posts

    Evaluating the Finest AI Video Mills for Social Media

    April 18, 2025

    Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

    April 18, 2025

    Midjourney V7: Quicker, smarter, extra reasonable

    April 18, 2025

    Meta resumes AI coaching utilizing EU person knowledge

    April 18, 2025
    Don't Miss

    Influencer Advertising and marketing in Numbers: Key Stats

    By Amelia Harper JonesMarch 15, 2026

    Influencer advertising and marketing has grown into probably the most data-driven division of digital advertising…

    INC Ransom Menace Targets Australia And Pacific Networks

    March 15, 2026

    NYT Connections Sports activities Version hints and solutions for March 15: Tricks to remedy Connections #538

    March 15, 2026

    The Essential Management Ability Most Leaders Do not Have!

    March 15, 2026
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    UK Tech Insider
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms Of Service
    • Our Authors
    © 2026 UK Tech Insider. All rights reserved by UK Tech Insider.

    Type above and press Enter to search. Press Esc to cancel.