More than A Point: Capturing Uncertainty with Adaptive Affordance Heatmaps for Spatial Grounding in Robotic Tasks

Anonymous Authors
arXiv and Code will be made publicly available upon acceptance.

Motivation

Teaser Image

For ambiguous instructions, our method generates dense heatmaps, while prior works fail with discrete points.

Framework Overview

Framework Overview

The overall architecture of our framework.

Abstract

Many language-guided robotic systems rely on collapsing spatial reasoning into discrete points, making them brittle to perceptual noise and semantic ambiguity. To address this challenge, we propose a framework that represents spatial targets as continuous, adaptive affordance heatmaps. This dense representation captures the uncertainty in spatial grounding and provides richer information for downstream policies, thereby significantly enhancing task success and interpretability. Our method surpasses the previous state-of-the-art on a majority of grounding benchmarks with up to a 50x speed improvement, and achieves an 82% success rate in real-world manipulation. Across extensive simulated and physical experiments, it demonstrates robust performance and shows strong zero-shot generalization to navigation.

Results and Analysis

Our method demonstrates superior qualitative performance on challenging spatial grounding instructions by generating coherent heatmaps for ambiguous regions. It also shows strong zero-shot generalization capabilities across diverse environments, robotic platforms, and even extends to navigation tasks in unseen settings. Quantitatively, our model achieves state-of-the-art performance on a majority of standard benchmarks while achieving a substantial increase in inference speed.

Benchmark Performance

Benchmark Performance

Simulation Environment

simplerenv

Real-World Generalization

real-world

Inference Efficiency

Inference Efficiency

Qualitative Visualization

Qualitative Visualization

Zero-Shot Navigation

Zero-shot Navigation Tasks

Real-World Execution

Place the right banana onto the right plate

Place the right banana onto the right plate.

Put the bitter gourd in the empty area on the left plate

Put the bitter gourd in the empty area on the left plate.

Move the bottle into the dustpan

Move the bottle into the dustpan.

Put the corn into the bottom basket

Put the corn into the bottom basket.

Place the corn in the upper basket

Place the corn in the upper basket.

Transfer the fish to the middle of the board

Move the fish to the middle of the board.

Lay the knife beside the plate

Lay the knife beside the plate.

Put the pepper in the bottom basket

Put the pepper in the bottom basket.

Move the pepper in the empty area on the left plate

Move the pepper in the empty area on the left plate.

Place the spoon to the left of the plate

Place the spoon to the left of the plate.

Put the spoon down near the plate

Put the spoon down near the plate.

Transfer the trash into the dustpan

Move the trash into the dustpan.