INDEX
    Explanations

    instances of the phrase "pay attention."

    New Auto-Interp
    Negative Logits
    _CAPACITY
    -0.15
    .dw
    -0.15
    jedn
    -0.15
    xba
    -0.15
    ADVERTISEMENT
    -0.14
    .kode
    -0.14
    âĢŀJ
    -0.14
    olars
    -0.14
    oux
    -0.14
    anchise
    -0.14
    POSITIVE LOGITS
    811
    0.15
    610
    0.15
    395
    0.15
     Listening
    0.15
    nid
    0.14
     trad
    0.13
    assi
    0.13
    aller
    0.13
    nings
    0.13
    otherapy
    0.13
    Act Density 0.013%

    No Known Activations