Alibaba eyes physical world with its first suite of AI models for robots

2 hours ago
1 min read

SoUTH CHINA MORNING POST — The suite splits robot intelligence into three interconnected layers. Qwen-RobotNav, a vision-language navigation model, is designed to help machines understand and move through physical spaces.

It works in tandem with Qwen-RobotWorld, a video “world model” that lets robots predict and simulate how physical scenes will evolve before they take action.

Then the physical execution is handled by Qwen-RobotManip, a generalist vision-language-action (VLA) model built on the Qwen3.5-4B architecture.

Read the full story | SoUTH CHINA MORNING POST

VIEW LATEST NEWS