Proxy-FDA: Proxy-Based mostly Characteristic Distribution Alignment for Tremendous-Tuning Imaginative and prescient Basis Fashions With out Forgetting

Imaginative and prescient basis fashions pre-trained on huge information encode wealthy representations of real-world ideas, which may be tailored to downstream duties by fine-tuning. Nevertheless, fine-tuning basis fashions on one process usually results in the difficulty of idea forgetting on different duties. Current strategies of sturdy fine-tuning goal to mitigate forgetting of prior information with out affecting the fine-tuning efficiency. Information is commonly preserved by matching the unique and fine-tuned mannequin weights or characteristic pairs. Nevertheless, such point-wise matching may be too sturdy, with out specific consciousness of the characteristic neighborhood buildings that encode wealthy information as effectively. We suggest a novel regularization methodology Proxy-FDA that explicitly preserves the structural information in characteristic house. Proxy-FDA performs Characteristic Distribution Alignment (utilizing nearest neighbor graphs) between the pre-trained and fine-tuned characteristic areas, and the alignment is additional improved by informative proxies which can be generated dynamically to extend information range. Experiments present that Proxy-FDA considerably reduces idea forgetting throughout fine-tuning, and we discover a sturdy correlation between forgetting and a distributional distance metric (compared to L2 distance). We additional show Proxy-FDA’s advantages in varied fine-tuning settings (end-to-end, few-shot and continuous tuning) and throughout completely different duties like picture classification, captioning and VQA.

Main Menu

What's Hot

Methodology teaches generative AI fashions to find personalised objects | MIT Information

The Energy of Vector Databases within the New Period of AI Search

The decline of the workplace reduces model impression

Proxy-FDA: Proxy-Based mostly Characteristic Distribution Alignment for Tremendous-Tuning Imaginative and prescient Basis Fashions With out Forgetting

From Habits to Instruments – O’Reilly

FS-DFM: Quick and Correct Lengthy Textual content Era with Few-Step Diffusion Language Fashions

Construct a tool administration agent with Amazon Bedrock AgentCore

Evaluating the Finest AI Video Mills for Social Media

Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

Midjourney V7: Quicker, smarter, extra reasonable

Meta resumes AI coaching utilizing EU person knowledge

Methodology teaches generative AI fashions to find personalised objects | MIT Information

The Energy of Vector Databases within the New Period of AI Search

The decline of the workplace reduces model impression

From Habits to Instruments – O’Reilly

Main Menu

Subscribe to Updates

What's Hot

Proxy-FDA: Proxy-Based mostly Characteristic Distribution Alignment for Tremendous-Tuning Imaginative and prescient Basis Fashions With out Forgetting

Related Posts