arXiv 论文速递

Snapshot: 20260219_0358

Dex4D: Task-Agnostic Point Track Policy for Sim-to-Real Dexterous Manipulation

Authors: Yuxuan Kuang, Sungjae Park, Katerina Fragkiadaki, Shubham Tulsiani

First: 2026-02-17T18:59:31+00:00 · Latest: 2026-02-17T18:59:31+00:00

Comments: Project page: https://dex4d.github.io/

Abstract

Learning generalist policies capable of accomplishing a plethora of everyday tasks remains an open challenge in dexterous manipulation. In particular, collecting large-scale manipulation data via real-world teleoperation is expensive and difficult to scale. While learning in simulation provides a feasible alternative, designing multiple task-specific environments and rewards for training is similarly challenging. We propose Dex4D, a framework that instead leverages simulation for learning task-agnostic dexterous skills that can be flexibly recomposed to perform diverse real-world manipulation tasks. Specifically, Dex4D learns a domain-agnostic 3D point track conditioned policy capable of manipulating any object to any desired pose. We train this 'Anypose-to-Anypose' policy in simulation across thousands of objects with diverse pose configurations, covering a broad space of robot-object interactions that can be composed at test time. At deployment, this policy can be zero-shot transferred to real-world tasks without finetuning, simply by prompting it with desired object-centric point tracks extracted from generated videos. During execution, Dex4D uses online point tracking for closed-loop perception and control. Extensive experiments in simulation and on real robots show that our method enables zero-shot deployment for diverse dexterous manipulation tasks and yields consistent improvements over prior baselines. Furthermore, we demonstrate strong generalization to novel objects, scene layouts, backgrounds, and trajectories, highlighting the robustness and scalability of the proposed framework.

中文标题/摘要

标题：Dex4D：通用点轨迹策略框架实现从仿真到现实的灵巧操作

在灵巧操作中，学习能够完成多种日常任务的一般性策略仍然是一个开放的挑战。特别是，通过现实世界的远程操作收集大规模操作数据既昂贵又难以扩展。虽然在仿真中学习提供了一种可行的替代方案，但设计多个特定任务的环境和奖励进行训练同样具有挑战性。我们提出了Dex4D框架，该框架利用仿真来学习任务无关的灵巧技能，这些技能可以在测试时灵活重组以执行各种现实世界的操作任务。具体而言，Dex4D学习了一种领域无关的3D点轨迹条件策略，该策略能够操作任何物体到任何期望的姿态。我们在数千种具有不同姿态配置的物体上对这种“任意姿态到任意姿态”的策略进行了仿真训练，涵盖了可以在测试时组合的广泛机器人-物体交互空间。在部署时，该策略可以通过仅提示其期望的物体中心点轨迹（从生成的视频中提取）而无需微调，即可实现零样本转移。在执行过程中，Dex4D使用在线点跟踪进行闭环感知和控制。在仿真和真实机器人上的大量实验表明，我们的方法能够实现多种灵巧操作任务的零样本部署，并且在先前基线方法上取得了持续改进。此外，我们展示了其对新型物体、场景布局、背景和轨迹的强大泛化能力，突显了所提出框架的鲁棒性和可扩展性。

Summary / 总结

Dex4D is a framework designed to learn task-agnostic dexterous manipulation skills in simulation, which can be flexibly applied to various real-world tasks. It trains a 3D point track policy to manipulate any object to any desired pose across thousands of objects with diverse configurations. During deployment, the policy can be zero-shot transferred to real-world tasks by prompting it with desired object-centric point tracks. Experiments show that Dex4D enables zero-shot deployment for diverse manipulation tasks and demonstrates strong generalization to novel objects and backgrounds.

Dex4D 是一个框架，旨在通过模拟学习通用的灵巧操作技能，这些技能可以灵活应用于各种实际任务。它训练了一个‘任意姿态到任意姿态’的策略，能够在数千个具有不同配置的对象上操作任意物体到任意姿态。该策略可以通过提示其所需的目标物体中心点轨迹，在实际任务中实现零样本迁移。实验表明，Dex4D 在性能上优于先前的方法，并且在新物体和场景中表现出强大的泛化能力。

Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching

Authors: Zhen Wu, Xiaoyu Huang, Lujie Yang, Yuanhang Zhang, Koushil Sreenath, Xi Chen, Pieter Abbeel, Rocky Duan, Angjoo Kanazawa, Carmelo Sferrazza, Guanya Shi, C. Karen Liu

First: 2026-02-17T18:59:11+00:00 · Latest: 2026-02-17T18:59:11+00:00