INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     segurança
    -0.08
     seguridad
    -0.08
     Went
    -0.08
     Shooter
    -0.08
     güven
    -0.07
     Kung
    -0.07
     sikker
    -0.07
     sicurezza
    -0.07
    安全
    -0.07
     Security
    -0.07
    POSITIVE LOGITS
     sadness
    0.15
     sorrow
    0.13
    0.13
     melanch
    0.13
     melancholy
    0.12
     despair
    0.12
     दु�
    0.12
     mourning
    0.12
     traurig
    0.12
    0.12
    Act Density 0.050%

    No Known Activations