reference-resolution

Here is 1 public repository matching this topic...

n-dryer / wearable-assistant-context-bench

A benchmark for measuring whether multimodal assistants update to current context instead of staying anchored to prior context. 50 scenarios, three channel design (audio, camera, ground truth), cross family LLM as judge by default.

benchmark machine-learning evaluation-framework multimodal context-tracking vision-language ai-assistant human-ai-interaction llm-evaluation wearable-ai reference-resolution product-driven

Updated Apr 28, 2026
Python

Improve this page

Add a description, image, and links to the reference-resolution topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reference-resolution topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reference-resolution

Here is 1 public repository matching this topic...

n-dryer / wearable-assistant-context-bench

Improve this page

Add this topic to your repo