INDEX
    Explanations

    phrases indicating personal achievement or identity

    New Auto-Interp
    Negative Logits
    ÑŁ
    -0.15
     Representation
    -0.14
    aeda
    -0.14
    견
    -0.13
    87
    -0.13
    IMS
    -0.13
    -vis
    -0.13
    eln
    -0.13
     representation
    -0.13
    urge
    -0.13
    POSITIVE LOGITS
    anders
    0.15
    ırak
    0.14
    äter
    0.14
    $LANG
    0.14
    apter
    0.14
    estroy
    0.14
    solete
    0.14
    ANGES
    0.13
    orem
    0.13
     Candid
    0.13
    Act Density 0.000%

    No Known Activations