INDEX
    Explanations

    terms related to emotional awareness and expression

    New Auto-Interp
    Negative Logits
    innen
    -0.17
    asant
    -0.16
    ض
    -0.16
    inition
    -0.15
     Cumhuriyeti
    -0.15
    ourg
    -0.15
    werk
    -0.14
    æŀ¶
    -0.14
    estyle
    -0.14
    otas
    -0.14
    POSITIVE LOGITS
    /em
    0.20
    ÑĨионалÑĮ
    0.18
    ãģ¾ãģ¾
    0.18
     charged
    0.18
    nel
    0.16
     emotional
    0.16
    ized
    0.16
    eel
    0.16
     roller
    0.16
     attachment
    0.16
    Act Density 0.024%

    No Known Activations