INDEX
    Explanations

    expressions of personal opinions and judgments about interpersonal relationships

    New Auto-Interp
    Negative Logits
    eten
    -0.15
    =ax
    -0.15
    šil
    -0.15
    uish
    -0.14
    stad
    -0.14
    гл
    -0.14
    wand
    -0.14
     @$_
    -0.14
    clipse
    -0.14
    ijo
    -0.14
    POSITIVE LOGITS
     her
    0.16
    954
    0.15
    efon
    0.15
    782
    0.14
    Timestamp
    0.14
     jams
    0.13
    opaque
    0.13
     bo
    0.13
    ãĤĿ
    0.13
    amin
    0.13
    Act Density 0.542%

    No Known Activations