According to Google, given a high-level command such as “bring me rice chips out of the box”, PaLM-E is able to create an action plan and perform those actions on its own.
PaLM-E does this by analyzing data from the robot’s camera without preprocessing the scene representation. This eliminates the need for human preprocessing or annotation and provides more autonomous control of the robot.
It is also sustainable and can respond to the environment. In the video example, the researcher takes the chips from the robot and moves them, but the robot finds the chips and retrieves them again.
Source: Ferra

I am a professional journalist and content creator with extensive experience writing for news websites. I currently work as an author at Gadget Onus, where I specialize in covering hot news topics. My written pieces have been published on some of the biggest media outlets around the world, including The Guardian and BBC News.