AXLearn: Modular Giant Mannequin Coaching on Heterogeneous Infrastructure

We design and implement AXLearn, a manufacturing deep studying system that facilitates scalable and high-performance coaching of enormous deep studying fashions. In comparison with different state-of-art deep studying techniques, AXLearn has a novel deal with modularity and assist for heterogeneous {hardware} infrastructure. AXLearn’s inside interfaces between software program elements observe strict encapsulation, permitting completely different elements to be assembled to facilitate fast mannequin improvement and experimentation on heterogeneous compute infrastructure. We introduce a novel technique of quantifying modularity by way of Strains-of-Code (LoC)-complexity, which demonstrates how our system maintains fixed complexity as we scale the elements within the system, in comparison with linear or quadratic complexity in different techniques. This permits integrating options resembling Rotary Place Embeddings (RoPE) into AXLearn throughout hundred of modules with simply 10 traces of code, in comparison with a whole bunch as required in different techniques. On the identical time, AXLearn maintains equal efficiency in comparison with state-of-the-art coaching techniques. Lastly, we share our expertise within the improvement and operation of AXLearn.

§ Duke College
** Work carried out whereas at Apple

Main Menu

What's Hot

Seth Godin on Management, Vulnerability, and Making an Influence within the New World Of Work

mAceReason-Math: A Dataset of Excessive-High quality Multilingual Math Issues Prepared For RLVR

AMC Robotics and HIVE Announce Collaboration to Advance AI-Pushed Robotics Compute Infrastructure

AXLearn: Modular Giant Mannequin Coaching on Heterogeneous Infrastructure

mAceReason-Math: A Dataset of Excessive-High quality Multilingual Math Issues Prepared For RLVR

P-EAGLE: Quicker LLM inference with Parallel Speculative Decoding in vLLM

We Used 5 Outlier Detection Strategies on a Actual Dataset: They Disagreed on 96% of Flagged Samples

Seth Godin on Management, Vulnerability, and Making an Influence within the New World Of Work

Evaluating the Finest AI Video Mills for Social Media

Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

Midjourney V7: Quicker, smarter, extra reasonable

Seth Godin on Management, Vulnerability, and Making an Influence within the New World Of Work

mAceReason-Math: A Dataset of Excessive-High quality Multilingual Math Issues Prepared For RLVR

AMC Robotics and HIVE Announce Collaboration to Advance AI-Pushed Robotics Compute Infrastructure

Tremble Chatbot App Entry, Prices, and Characteristic Insights

Main Menu

Subscribe to Updates

What's Hot

AXLearn: Modular Giant Mannequin Coaching on Heterogeneous Infrastructure

Related Posts