You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
docs: clarify VNNI dispatch tiers — F32x16 is the floor, no scalar on x86
avx512vnni (64 MACs) and avxvnniint8 (32 MACs) are mutually exclusive
by hardware generation. The scalar i32 path in matvec_dispatch only
exists for non-x86 correctness. On x86, the thinking engine dispatches
to F32x16 FMA (16 MACs) when no VNNI is detected — never reaches
the scalar path.
https://claude.ai/code/session_01ChLvBfpJS8dQhHxRD4pYNp
#83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.