Attention is All You Need
transformer • attention • 2024
The seminal paper introducing the Transformer architecture for natural language processing.
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
transformer • BERT • 2024
A groundbreaking paper on BERT, a transformer-based model for natural language understanding.