INDEX
    Explanations

    expressions of kindness and compassion towards others

    New Auto-Interp
    Negative Logits
    iron
    -0.16
     hairs
    -0.15
    aze
    -0.14
    etest
    -0.14
    akov
    -0.14
     TRI
    -0.14
    .PropertyType
    -0.14
    길
    -0.14
    à¹Ģà¸ķ
    -0.14
    atron
    -0.14
    POSITIVE LOGITS
    imity
    0.16
     Obr
    0.15
    LinkId
    0.15
    γμα
    0.14
     Cunning
    0.14
    398
    0.14
     tun
    0.14
    raki
    0.14
    ahr
    0.14
    _argument
    0.14
    Act Density 0.319%

    No Known Activations