Alibaba eyes physical world with its first suite of AI models for robots
- 2 hours ago
- 1 min read

SoUTH CHINA MORNING POST — The suite splits robot intelligence into three interconnected layers. Qwen-RobotNav, a vision-language navigation model, is designed to help machines understand and move through physical spaces.
It works in tandem with Qwen-RobotWorld, a video “world model” that lets robots predict and simulate how physical scenes will evolve before they take action.
Then the physical execution is handled by Qwen-RobotManip, a generalist vision-language-action (VLA) model built on the Qwen3.5-4B architecture.
Read the full story | SoUTH CHINA MORNING POST


