Because you might want VLAN. Plus you cant just start blasting reply without a preamble, so its still 14 bytes after receiving just the dest MAC. Then you get potential IFG, FEC, scrambling, it all adds up, no way any switch can do 4ns without heavy lawyer talk in the small print.
IIRC. cut-through only needs the first 6 bytes. Since it only needs the destination address for the port lookup.
Potentially the first bit, on broadcast.