INDEX
    Explanations

    expressions of emotional distress and pleas for assistance or understanding

    New Auto-Interp
    Negative Logits
    prit
    -0.18
    ziej
    -0.17
    suz
    -0.16
    AMPLE
    -0.15
    λε
    -0.15
    abay
    -0.15
    criptor
    -0.14
    issor
    -0.14
    awner
    -0.14
    erk
    -0.14
    POSITIVE LOGITS
    ly
    0.70
    ÑģÑı
    0.44
    ian
    0.43
    ic
    0.42
    ity
    0.40
    Ø©
    0.39
    ed
    0.37
    ive
    0.36
    al
    0.34
    theless
    0.34
    Act Density 1.927%

    No Known Activations