INDEX
    Explanations

    gerunds and actions involving movement or manipulation

    New Auto-Interp
    Negative Logits
    ilon
    -0.15
    fak
    -0.15
    erap
    -0.14
    ÙĦاÙģ
    -0.14
    alion
    -0.14
    isma
    -0.13
    çħ
    -0.13
    voie
    -0.13
    lesen
    -0.13
    ncia
    -0.13
    POSITIVE LOGITS
     around
    1.16
    around
    1.03
     Around
    1.00
    Around
    0.96
    -around
    0.88
     autour
    0.75
     вокÑĢÑĥг
    0.60
     kolem
    0.59
     ØŃÙĪÙĦ
    0.49
     около
    0.47
    Act Density 0.185%

    No Known Activations