Evaluating Pattern Utility for Information Choice by Mimicking Mannequin Weights

This paper was accepted on the DataWorld (Information Curation) Workshop at ICML 2025.

Multimodal fashions are educated on large-scale web-crawled datasets, which frequently comprise noise, bias, and irrelevant info. This motivates using knowledge choice strategies, which will be divided into model-free variants, counting on heuristic guidelines and downstream datasets, and model-based approaches, comparable to these utilizing affect features. The previous will be costly to design and dangers introducing undesirable dataset dependencies, whereas the latter are sometimes computationally prohibitive. On this work, we suggest an environment friendly, model-based strategy utilizing the Mimic Rating, a brand new data-quality metric that leverages the weights of a reference mannequin to evaluate the usefulness of particular person samples for coaching a brand new mannequin. Our methodology depends on measuring alignments between coaching gradients and a goal path induced by this reference mannequin. Constructing on the derived mimic scores, we develop Grad-Mimic: a framework that prioritizes samples to be taught, estimates general pattern utility, and creates efficient filters. Empirically, utilizing mimic scores to information coaching improves knowledge effectivity, accelerates convergence, yields constant efficiency good points throughout six picture datasets, and enhances CLIP fashions with 20.7% fewer coaching steps. Furthermore, mimic score-based filters complement current filtering strategies, e.g., coaching improved CLIP fashions with 4.7 million fewer samples whereas providing correct estimation of dataset high quality.

† College of Wisconsin–Madison
** Work executed whereas at Apple

Main Menu

What's Hot

INC Ransom Menace Targets Australia And Pacific Networks

NYT Connections Sports activities Version hints and solutions for March 15: Tricks to remedy Connections #538

The Essential Management Ability Most Leaders Do not Have!

Evaluating Pattern Utility for Information Choice by Mimicking Mannequin Weights

Enhance operational visibility for inference workloads on Amazon Bedrock with new CloudWatch metrics for TTFT and Estimated Quota Consumption

5 Highly effective Python Decorators for Excessive-Efficiency Information Pipelines

What OpenClaw Reveals In regards to the Subsequent Part of AI Brokers – O’Reilly

Evaluating the Finest AI Video Mills for Social Media

Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

Midjourney V7: Quicker, smarter, extra reasonable

Meta resumes AI coaching utilizing EU person knowledge

INC Ransom Menace Targets Australia And Pacific Networks

NYT Connections Sports activities Version hints and solutions for March 15: Tricks to remedy Connections #538

The Essential Management Ability Most Leaders Do not Have!

Enhance operational visibility for inference workloads on Amazon Bedrock with new CloudWatch metrics for TTFT and Estimated Quota Consumption

Main Menu

Subscribe to Updates

What's Hot

Evaluating Pattern Utility for Information Choice by Mimicking Mannequin Weights

Related Posts