Do What You Say: Steering Vision-Language-Action Models via Runtime Reasoning-Action Alignment Verification