CLIP-UP: A Easy and Environment friendly Combination-of-Specialists CLIP Coaching Recipe with Sparse Upcycling

Combination-of-Specialists (MoE) fashions are essential for scaling mannequin capability whereas controlling inference prices. Whereas integrating MoE into multimodal fashions like CLIP improves efficiency, coaching these fashions is notoriously difficult and costly. We suggest CLIP-Upcycling (CLIP-UP), an environment friendly different coaching technique that converts a pre-trained dense CLIP mannequin right into a sparse MoE structure. By means of intensive experimentation with varied settings and auxiliary losses, we exhibit that CLIP-UP considerably reduces coaching complexity and value. Remarkably, our sparse CLIP B/16 mannequin, skilled with CLIP-UP, outperforms its dense counterpart by 7.2% and 6.6% on COCO and Flickr30k text-to-image Recall@1 benchmarks respectively. It even surpasses the bigger CLIP L/14 mannequin on this process whereas utilizing solely 30% of the inference FLOPs. We additional exhibit the generalizability of our coaching recipe throughout totally different scales, establishing sparse upcycling as a sensible and scalable method for constructing environment friendly, high-performance CLIP fashions.

Main Menu

What's Hot

The Energy of Vector Databases within the New Period of AI Search

The decline of the workplace reduces model impression

From Habits to Instruments – O’Reilly

CLIP-UP: A Easy and Environment friendly Combination-of-Specialists CLIP Coaching Recipe with Sparse Upcycling

From Habits to Instruments – O’Reilly

FS-DFM: Quick and Correct Lengthy Textual content Era with Few-Step Diffusion Language Fashions

Construct a tool administration agent with Amazon Bedrock AgentCore

Evaluating the Finest AI Video Mills for Social Media

Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

Midjourney V7: Quicker, smarter, extra reasonable

Meta resumes AI coaching utilizing EU person knowledge

The Energy of Vector Databases within the New Period of AI Search

The decline of the workplace reduces model impression

From Habits to Instruments – O’Reilly

Mixing neuroscience, AI, and music to create psychological well being improvements | MIT Information

Main Menu

Subscribe to Updates

What's Hot

CLIP-UP: A Easy and Environment friendly Combination-of-Specialists CLIP Coaching Recipe with Sparse Upcycling

Related Posts