INDEX
    Explanations

    references to self-identity and personal reflection

    New Auto-Interp
    Negative Logits
     auroit
    -0.43
     pacchetto
    -0.43
     грн
    -0.40
    océan
    -0.40
     aceptas
    -0.40
     tuyến
    -0.39
     corruption
    -0.39
    بوابة
    -0.39
     anúncio
    -0.39
     ladr
    -0.38
    POSITIVE LOGITS
    herself
    0.98
    selves
    0.93
    himself
    0.89
     Himself
    0.88
     Myself
    0.88
    Myself
    0.88
    Yourself
    0.88
     Yourself
    0.87
    myself
    0.85
     selves
    0.83
    Act Density 0.058%

    No Known Activations