INDEX
    Explanations

    occurrences of emotional expressions and interpersonal interactions

    New Auto-Interp
    Negative Logits
    infer
    -0.17
    worthy
    -0.16
    iffer
    -0.16
    elles
    -0.15
    avel
    -0.15
    ovel
    -0.15
    andon
    -0.14
    ellers
    -0.14
    uls
    -0.13
    atty
    -0.13
    POSITIVE LOGITS
    ç»ĵæŀľ
    0.19
     result
    0.19
    çµIJæŀľ
    0.19
     resulted
    0.18
     Result
    0.17
    ãĤ«ãĥ¼
    0.17
     results
    0.17
     Ergebn
    0.16
     resulting
    0.16
     Results
    0.16
    Act Density 0.273%

    No Known Activations