transformers 2 Modernizing GPT-2: A 3.1x Throughput Leap with 2025 Optimizations Dec 21, 2025 Generative Retrieval: Scaling PLUM with Hierarchical Semantic IDs Nov 24, 2025