INDEX
    Explanations

    instances of hypocrisy or contradictions in behavior and beliefs

    New Auto-Interp
    Negative Logits
    StructEnd
    -0.67
     препратки
    -0.67
     autorytatywna
    -0.66
    Expedia
    -0.66
    isome
    -0.64
    providedIn
    -0.63
    Referanser
    -0.62
    CppMethod
    -0.62
    Jeografia
    -0.60
    ########.
    -0.60
    POSITIVE LOGITS
     own
    0.90
     myself
    0.81
     sendiri
    0.73
    自分も
    0.70
     Own
    0.70
     próprio
    0.69
     propia
    0.69
     eigenes
    0.68
     himself
    0.64
     personally
    0.64
    Act Density 0.288%

    No Known Activations