High row (left to proper): Nancy M. Amato, Seth Hutchinson, and Ken Goldberg. Backside row (left to proper): Animesh Garg, Aude Billard, Russ Tedrake, and Frank Park. | Supply: Science Robotics
Since its inception, the robotics business has labored in the direction of creating machines that would deal with complicated duties by combining mathematical fashions with superior computation. Now, the group finds itself divided on greatest attain that purpose.
A bunch of roboticists from all over the world investigated this divide on the IEEE Worldwide Convention on Robotics and Automation (ICRA) earlier this 12 months. The present closed with a debate between six main roboticists:
- Daniela Rus, who’s the CSAIL director and the Andrew (1956) and Erna Viterbi Professor of Electrical Engineering and Laptop Science. Rus additionally keynoted the Robotics Summit & Expo earlier this 12 months.
- Russ Tedrake, who’s the Toyota Professor at CSAIL, EECS, and the Division of Aeronautics and Astronautics.
- Leslie Kaelbling, who’s the Panasonic Professor of Laptop Science and Engineering at MIT.
- Aude Billard, a professor on the Faculty of Engineering on the Swiss Federal Institute of Know-how in Lausanne (EPFL).
- Frank Park, a professor of Mechanical Engineering at Seoul Nationwide College.
- Animesh Garg, a Stephen Fleming Early Profession Assistant Professor on the Faculty of Interactive Computing at Georgia Tech.
UC Berkeley’s Ken Goldberg moderated the controversy, framing the dialogue with the query: “Will the way forward for robotics be written in code or in information?”
The argument for a data-first method
Rus and Tedrake argued that data-driven approaches, significantly these powered by large-scale machine studying, are crucial to unlocking robots’ capacity to perform reliably in the actual world.
“Physics offers us clear fashions for managed environments, however the second we step exterior, these assumptions collapse,” Rus stated. “Actual-world duties are unpredictable and human-centered. Robots want expertise to adapt, and that comes from information.”
At CSAIL, Rus’s Distributed Robotics Lab has embraced this pondering. The group is constructing multimodal datasets of people performing on a regular basis duties, from cooking and pouring to handing off objects. Rus stated these recordings seize the subtleties of human motion, from hand trajectories and joint torques to gaze and pressure interactions, offering a wealthy supply of knowledge for coaching AI techniques.
The purpose isn’t just to have robots replicate actions, however to allow them to generalize throughout duties and adapt when circumstances change.
Within the kitchen testbed at CSAIL, for instance, Rus’s group equips volunteers with sensors whereas they chop greens, pour liquids, and assemble meals. The sensors file not solely joint and muscle actions but in addition delicate cues similar to eye gaze, fingertip strain, and object interactions.
AI fashions educated on this information can then carry out the identical duties on robots with precision and robustness, studying get better when elements slip or instruments misalign. These real-world datasets let researchers seize “long-tail” situations – uncommon however crucial occurrences that model-based programming alone would miss.
Knowledge at scale might rework manipulation
Tedrake mentioned how scaling information transforms robotic manipulation. His group has educated robots to carry out dexterous duties, similar to slicing apples, observing various outcomes, and recovering from errors.
“Robots are actually growing what seems to be like frequent sense for dexterous duties,” he stated. “It’s the identical impact we’ve seen in language and imaginative and prescient: when you scale the info, stunning robustness emerges.”
In a single instance, he confirmed a bimanual robotic outfitted with easy grippers that realized to core and slice apples. Every apple differed barely in dimension, firmness, or form, but the robotic tailored routinely, adjusting grip and slicing motions based mostly on prior expertise.
Tedrake defined that, because the demonstration dataset expanded throughout a number of duties, restoration behaviors—as soon as manually programmed—started to emerge naturally, an indication that information can encode delicate, high-level common sense information about bodily interactions.
Mathematical fashions include a theoretical understanding
Kaelbling, who additionally spoke on the occasion, argued together with Billard and Park for the persevering with significance of mathematical fashions, first ideas, and theoretical understanding.
“Knowledge can present us patterns, however fashions give us understanding,” Kaelbling stated. “With out fashions, we danger techniques that work, till they all of the sudden don’t. Security-critical functions demand one thing deeper than trial-and-error studying.”
Billard stated robotics differs basically from imaginative and prescient or language: real-world information is scarce, simulations stay restricted, and duties contain infinite variability. Whereas massive datasets have propelled progress in notion and pure language understanding, she cautioned that blindly scaling information with out an underlying construction dangers creating brittle techniques.
Park emphasised the richness of inductive biases from physics and biology—ideas of movement, pressure, compliance, and hierarchical management—that data-driven strategies alone can’t absolutely seize. He famous that fastidiously designed fashions can information information assortment and interpretation, serving to guarantee security, effectivity, and robustness in complicated duties.
Discovering center floor
Garg, in the meantime, articulated the advantages of mixing data-driven studying with structured fashions. He emphasised that whereas massive datasets can reveal patterns and behaviors, fashions are essential to generalize these insights and make them actionable.
“The most effective path ahead could also be a hybrid method,” he stated, “the place we harness the dimensions of knowledge whereas respecting the constraints and insights that fashions present.”
Garg illustrated this with examples from collaborative manipulation duties, the place robots educated purely on uncooked information struggled with edge instances {that a} physics-informed mannequin might anticipate.
The talk additionally drew historic parallels. Humanity has usually acquired “know-how” earlier than “know-why.” From crusing ships and inside combustion engines to airplanes and early computer systems, engineers relied on empirical remark lengthy earlier than absolutely understanding the underlying scientific ideas.
Rus and Tedrake argued that trendy robotics is following an identical trajectory: information permits robots to accumulate sensible expertise in messy, unpredictable environments, whereas fashions present the construction essential to interpret and generalize that have. This mixture is crucial, they stated, to maneuver from lab-bound experiments to robots able to working in houses, hospitals, and different real-world settings.
Range in thought is a energy in robotics
All through the controversy, panelists emphasised the variety of the robotics discipline itself. Whereas deep studying has remodeled notion and language duties, robotics includes many challenges. These embrace high-dimensional management, variable human environments, interplay with deformable objects, and safety-critical constraints.
Tedrake famous that making use of massive pre-trained fashions from language on to robots is inadequate; success requires multimodal studying and the mixing of sensors that seize forces, movement, and tactile suggestions.
Rus added that constructing massive datasets throughout a number of robotic platforms is essential for generalization. “If we wish robots to perform throughout totally different houses, hospitals, or factories, we should seize the variability and unpredictability of the actual world,” she stated.
“Fixing robotics is a long-term agenda,” Tedrake mirrored. “It might take a long time. However the debate itself is wholesome. It means we’re testing our assumptions and sharpening our instruments. The reality is, we’ll most likely want each information and fashions – however which takes the lead, and when, stays unsettled.”



