INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     gangbang
    -0.07
     شار
    -0.07
     чай
    -0.06
     wavelengths
    -0.06
    ("\"
    -0.06
     šk
    -0.06
    vertiser
    -0.06
    rain
    -0.06
     whistle
    -0.06
     yol
    -0.06
    POSITIVE LOGITS
     surrounding
    0.06
    ContextMenu
    0.06
    -effect
    0.06
     статті
    0.06
     повіт
    0.06
     CONSEQUENTIAL
    0.06
     Arts
    0.06
    blank
    0.06
    امج
    0.06
    ’nin
    0.06
    Act Density 0.007%

    No Known Activations