Close Menu
    Main Menu
    • Home
    • News
    • Tech
    • Robotics
    • ML & Research
    • AI
    • Digital Transformation
    • AI Ethics & Regulation
    • Thought Leadership in AI

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Reworking enterprise operations: 4 high-impact use circumstances with Amazon Nova

    October 16, 2025

    Your information to Day 2 of RoboBusiness 2025

    October 16, 2025

    Night Honey Chat: My Unfiltered Ideas

    October 16, 2025
    Facebook X (Twitter) Instagram
    UK Tech InsiderUK Tech Insider
    Facebook X (Twitter) Instagram
    UK Tech InsiderUK Tech Insider
    Home»Machine Learning & Research»PersonaTeaming: Exploring How Introducing Personas Can Enhance Automated AI Purple-Teaming
    Machine Learning & Research

    PersonaTeaming: Exploring How Introducing Personas Can Enhance Automated AI Purple-Teaming

    Oliver ChambersBy Oliver ChambersSeptember 27, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    PersonaTeaming: Exploring How Introducing Personas Can Enhance Automated AI Purple-Teaming
    Share
    Facebook Twitter LinkedIn Pinterest Email Copy Link


    This paper was accepted on the Workshop on Regulatable ML (ReML) at NeurIPS 2025.

    Current developments in AI governance and security analysis have known as for red-teaming strategies that may successfully floor potential dangers posed by AI fashions. Many of those calls have emphasised how the identities and backgrounds of red-teamers can form their red-teaming methods, and thus the sorts of dangers they’re prone to uncover. Whereas automated red-teaming approaches promise to enrich human red-teaming by enabling larger-scale exploration of mannequin conduct, present approaches don’t take into account the position of identification. As an preliminary step in direction of incorporating individuals’s background and identities in automated red-teaming, we develop and consider a novel technique, PersonaTeaming, that introduces personas within the adversarial immediate technology course of to discover a wider spectrum of adversarial methods. Specifically, we first introduce a technique for mutating prompts based mostly on both “red-teaming professional” personas or “common AI consumer” personas. We then develop a dynamic persona-generating algorithm that robotically generates varied persona varieties adaptive to completely different seed prompts. As well as, we develop a set of latest metrics to explicitly measure the “mutation distance” to enrich current range measurements of adversarial prompts. Our experiments present promising enhancements (as much as 144.1%) within the assault success charges of adversarial prompts by persona mutation, whereas sustaining immediate range, in comparison with RainbowPlus, a state-of-the-art automated red-teaming technique. We talk about the strengths and limitations of various persona varieties and mutation strategies, shedding mild on future alternatives to discover complementarities between automated and human red-teaming approaches.

    • † Carnegie Mellon College
    • ‡ Impartial Researcher
    • ** Work carried out whereas at Apple
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Oliver Chambers
    • Website

    Related Posts

    Reworking enterprise operations: 4 high-impact use circumstances with Amazon Nova

    October 16, 2025

    Reinvent Buyer Engagement with Dynamics 365: Flip Insights into Motion

    October 16, 2025

    From Habits to Instruments – O’Reilly

    October 16, 2025
    Top Posts

    Evaluating the Finest AI Video Mills for Social Media

    April 18, 2025

    Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

    April 18, 2025

    Midjourney V7: Quicker, smarter, extra reasonable

    April 18, 2025

    Meta resumes AI coaching utilizing EU person knowledge

    April 18, 2025
    Don't Miss

    Reworking enterprise operations: 4 high-impact use circumstances with Amazon Nova

    By Oliver ChambersOctober 16, 2025

    Because the launch of Amazon Nova at AWS re:Invent 2024, now we have seen adoption…

    Your information to Day 2 of RoboBusiness 2025

    October 16, 2025

    Night Honey Chat: My Unfiltered Ideas

    October 16, 2025

    Coming AI rules have IT leaders anxious about hefty compliance fines

    October 16, 2025
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    UK Tech Insider
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms Of Service
    • Our Authors
    © 2025 UK Tech Insider. All rights reserved by UK Tech Insider.

    Type above and press Enter to search. Press Esc to cancel.