PUBLISHER: TrendForce | PRODUCT CODE: 1815981
PUBLISHER: TrendForce | PRODUCT CODE: 1815981
Current progress in humanoid robotics is centered on optimizing vision-language-action (VLA) models, integrating multimodal data, and enhancing instruction comprehension as well as the ability to interpret human intent. Training relies heavily on world models, human video data, and VR-based remote training, with increasing emphasis on first-person perspectives to strengthen perception. While the ultimate goal is to achieve general-purpose humanoids, development remains constrained by significant challenges, leading Western and Chinese companies to pursue divergent technological pathways.