The local_attn_mask was not being tiled across temporal dimensions, causing assertion errors in streaming mode and wrong masks otherwise. Match naxci1 reference: 4D tile/rearrange for Q/K temporal windows, chunk-based score computation, and topk<=0 guard. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>