mAceReason-Math: A Dataset of Excessive-High quality Multilingual Math Issues Prepared For RLVR

Reinforcement Studying with Verifiable Rewards (RLVR) has been efficiently utilized to considerably enhance the capabilities of pretrained massive language fashions, particularly within the math and logic drawback domains. Nevertheless, present analysis and obtainable coaching datasets stay English-centric. Whereas multilingual coaching knowledge and benchmarks have been created up to now, they weren’t created with RLVR and present mannequin functionality in thoughts, and their stage of issue is commonly too low to supply acceptable coaching alerts for present fashions. To handle this hole, we offer mAceReason-Math, a dataset of high-quality translations of difficult math issues sourced from a corpus particularly curated for RLVR (AceReason-Math). We additional take particular care to scrub and enhance our translations, leading to a protection of 14 languages with greater than 10,000 samples per language. We launch the dataset to facilitate multilingual RLVR analysis and benchmarking within the analysis group.

† Hasso Plattner Institute & ELLIS Unit Potsdam
** Work finished whereas at Apple
‡ Equal contribution

Main Menu

What's Hot

GlassWorm Spreads through 72 Malicious Open VSX Extensions Hidden in Transitive Dependencies

Seth Godin on Management, Vulnerability, and Making an Influence within the New World Of Work

mAceReason-Math: A Dataset of Excessive-High quality Multilingual Math Issues Prepared For RLVR

mAceReason-Math: A Dataset of Excessive-High quality Multilingual Math Issues Prepared For RLVR

P-EAGLE: Quicker LLM inference with Parallel Speculative Decoding in vLLM

We Used 5 Outlier Detection Strategies on a Actual Dataset: They Disagreed on 96% of Flagged Samples

Constructing Good Machine Studying in Low-Useful resource Settings

GlassWorm Spreads through 72 Malicious Open VSX Extensions Hidden in Transitive Dependencies

Evaluating the Finest AI Video Mills for Social Media

Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

Midjourney V7: Quicker, smarter, extra reasonable

GlassWorm Spreads through 72 Malicious Open VSX Extensions Hidden in Transitive Dependencies

Seth Godin on Management, Vulnerability, and Making an Influence within the New World Of Work

mAceReason-Math: A Dataset of Excessive-High quality Multilingual Math Issues Prepared For RLVR

AMC Robotics and HIVE Announce Collaboration to Advance AI-Pushed Robotics Compute Infrastructure

Main Menu

Subscribe to Updates

What's Hot

mAceReason-Math: A Dataset of Excessive-High quality Multilingual Math Issues Prepared For RLVR

Related Posts