latency optimizations - adds AoT compilation & FA3 for faster inference [WIP - don't merge yet] 9b43057 verified linoyts HF Staff commited on Sep 23, 2025