Fix generate() when tokenizer is unset and add regression tests by DityaChawla · Pull Request #1267 · TransformerLensOrg/TransformerLens

DityaChawla · 2026-04-23T16:35:06Z

Summary

Fixes tokenizer-free HookedTransformer.generate() paths where a tokenizer is not required.

Fixes #483.

jlarson4 · 2026-04-23T17:01:25Z

Hi @DityaChawla! Thanks for taking on this change. A couple things before we can get this merged:

This is based on an older version of TransformerLens, could you please pull down the latest version of dev and apply this change there? (get_attention_mask now lives at transformer_lens/utilities/tokenize_utils.py around line 229)
Currently the function body does tokens.ne(tokenizer.pad_token_id) which crashes if tokenizer is None. It already has a if tokenizer is None: return attention_mask early return at line 249-250 (all-ones mask) — but that's gated on an implicit check that only fires when tokenizer is truthy.
Add a test with use_past_kv_cache=True explicit and another with use_past_kv_cache=False to prove both paths work
generate() calls several utilities that may assume a tokenizer. Worth checking for self.tokenizer. in the generate method and verifying each is guarded.

Fix tokenizer-free generate() and add regression tests

42272b2

jlarson4 changed the base branch from main to dev April 23, 2026 16:53