ProText: A Benchmark Dataset for Measuring (Mis)gendering in Lengthy-Type Texts

We introduce ProText, a dataset for measuring gendering and misgendering in stylistically numerous long-form English texts. ProText spans three dimensions: Theme nouns (names, occupations, titles, kinship phrases), Theme class (stereotypically male, stereotypically feminine, gender-neutral/non-gendered), and Pronoun class (masculine, female, gender-neutral, none). The dataset is designed to probe (mis)gendering in textual content transformations comparable to summarization and rewrites utilizing state-of-the-art Massive Language Fashions, extending past conventional pronoun decision benchmarks and past the gender binary. We validated ProText by means of a mini case examine, exhibiting that even with simply two prompts and two fashions, we are able to draw nuanced insights relating to gender bias, stereotyping, misgendering, and gendering. We reveal systematic gender bias, significantly when inputs comprise no express gender cues or when fashions default to heteronormative assumptions.

Main Menu

What's Hot

Android Developer Verification Rollout Begins Forward of September Enforcement

Robotaxi Outage in China Leaves Passengers Stranded on Highways

ProText: A Benchmark Dataset for Measuring (Mis)gendering in Lengthy-Type Texts

ProText: A Benchmark Dataset for Measuring (Mis)gendering in Lengthy-Type Texts

Construct a FinOps agent utilizing Amazon Bedrock AgentCore

Zero Price range, Full Stack: Constructing with Solely Free LLMs

From Immediate to Prediction: Understanding Prefill, Decode, and the KV Cache in LLMs

Android Developer Verification Rollout Begins Forward of September Enforcement

Evaluating the Finest AI Video Mills for Social Media

Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

Midjourney V7: Quicker, smarter, extra reasonable

Android Developer Verification Rollout Begins Forward of September Enforcement

Robotaxi Outage in China Leaves Passengers Stranded on Highways

ProText: A Benchmark Dataset for Measuring (Mis)gendering in Lengthy-Type Texts

Epson Declares Licensed Distribution Alliance with Clayton Controls to Ship Superior Automation Options to the Southwest Area

Main Menu

Subscribe to Updates

What's Hot

ProText: A Benchmark Dataset for Measuring (Mis)gendering in Lengthy-Type Texts

Related Posts