TOSC
[AAAI 2026 Poster] TOSC: Task-Oriented Shape Completion for Open-World Dexterous Grasp Generation from Partial Point Clouds
Install / Use
/learn @SyKszzzzz/TOSCREADME
TOSC: Task-Oriented Shape Completion for Open-World Dexterous Grasp Generation from Partial Point Clouds
Weishang Wu, Yifei Shi, Zhiping Cai
This repository is the official implementation of paper "TOSC: Task-Oriented Shape Completion for Open-World Dexterous Grasp Generation from Partial Point Clouds".
Citation:
@article{wu2026tosc,
title={TOSC: Task-Oriented Shape Completion for Open-World Dexterous Grasp Generation from Partial Point Clouds},
author={Weishang Wu and Yifei Shi and Zhiping Cai},
journal={arXiv preprint arXiv:2601.05499},
year={2026},
}
We introduce TOSC, a novel framework for open-world dexterous grasp generation from partial point clouds. TOSC enables Task-Oriented Shape Completion, a new paradigm that explicitly conditions geometry reconstruction on the downstream manipulation task. In contrast to prior work, TOSC is intrinsically task-aware, prioritizing the completion of functional contact regions over global geometry to achieve robust grasping under severe occlusion.
<div align=center> <img src='./figures/teaser.png' width=60%> </div>Abstract
Task-oriented dexterous grasping remains challenging in robotic manipulations of open-world objects under severe partial observation, where significant missing data invalidates generic shape completion. In this paper, to overcome this limitation, we study Task-Oriented Shape Completion, a new task that focuses on completing the potential contact regions rather than the entire shape. We argue that shape completion for grasping should be explicitly guided by the downstream manipulation task. To achieve this, we first generate multiple task-oriented shape completion candidates by leveraging the zero-shot capabilities of object functional understanding from several pre-trained foundation models. A 3D discriminative autoencoder is then proposed to evaluate the plausibility of each generated candidate and optimize the most plausible one from a global perspective. A conditional flow-matching model named FlowGrasp is developed to generate task-oriented dexterous grasps from the optimized shape. Our method achieves state-of-the-art performance in task-oriented dexterous grasping and task-oriented shape completion, improving the Grasp Displacement and the Chamfer Distance over the state-of-the-art by $16.17%$ and $55.26%$, respectively. In particular, it shows good capabilities in grasping objects with severe missing data. It also demonstrates good generality in handling open-set categories and tasks.
News
- [ 2026.01 ] We release the code for grasp generation!
Setup
-
Create a new
condaenvironemnt and activate it.conda create -n 3d python=3.8 conda activate tosc -
Install dependent libraries with
pip.pip install -r pre-requirements.txt pip install -r requirements.txt- We use
pytorch1.11andcuda11.3, modifypre-requirements.txtto install other versions ofpytorch.
- We use
-
Install Isaac Gym and install pointnet2 by executing the following command.
pip install git+https://github.com/daveredrum/Pointnet2.ScanNet.git#subdirectory=pointnet2More dependencies and instructions can be found in the instructions.
-
Get MANO asset Get the MANO hand model mano_v1_2.zip from the MANO website.
- click Download on the top menu, this requires register & login.
- on the Download page, navigate to Models & Code section, and click Models & Code.
Unzip mano_v1_2.zip and copy it into the assets folder.
Data
You can process the data by yourself following the instructions.
Train
bash scripts/TOSC/train.sh ${EXP_NAME}
Test (Quantitative Evaluation)
bash scripts/TOSC/test.sh ${CKPT}
Sample (Qualitative Visualization)
bash scripts/TOSC/sample.sh ${CKPT}
Acknowledgments
Some codes are borrowed from stable-diffusion, PSI-release, Pointnet2.ScanNet, point-transformer, diffuser, Scene-diffuser, and occo.
Related Skills
node-connect
349.0kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
109.4kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
349.0kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
349.0kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
