Skip to content
#

reference-resolution

Here is 1 public repository matching this topic...

wearable-assistant-context-bench

A benchmark for measuring whether multimodal assistants update to current context instead of staying anchored to prior context. 50 scenarios, three channel design (audio, camera, ground truth), cross family LLM as judge by default.

  • Updated Apr 28, 2026
  • Python

Improve this page

Add a description, image, and links to the reference-resolution topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reference-resolution topic, visit your repo's landing page and select "manage topics."

Learn more