INDEX
    Explanations

    references to the concept of attention in various contexts

    New Auto-Interp
    Negative Logits
     précédents
    -0.67
     maestra
    -0.67
     helst
    -0.66
    rabh
    -0.65
     Füße
    -0.64
     perfección
    -0.63
     difesa
    -0.62
     Feinde
    -0.61
     maux
    -0.61
    iegler
    -0.60
    POSITIVE LOGITS
     attention
    2.80
     Attention
    2.56
    attention
    2.35
    Attention
    2.29
     ATTENTION
    2.24
     attentions
    1.96
    ATTENTION
    1.88
     aten
    1.54
     atención
    1.52
     atenção
    1.46
    Act Density 0.039%

    No Known Activations