attention mechanism, self-attention, BERT, transformer architecture
Press ? anytime to show this help