INDEX
    Explanations

    references to identity and self

    New Auto-Interp
    Negative Logits
    ValueStyle
    -0.76
    endphp
    -0.76
     الحره
    -0.73
     noqa
    -0.72
    #+#
    -0.66
     للاسماء
    -0.64
    NameInMap
    -0.63
     дописавши
    -0.63
    findpost
    -0.62
     Enix
    -0.61
    POSITIVE LOGITS
     zosta
    0.55
     nime
    0.53
     nuage
    0.51
     called
    0.51
     čierna
    0.51
     Divulgação
    0.50
     keber
    0.49
    heça
    0.49
     nennen
    0.49
    dini
    0.49
    Act Density 0.168%

    No Known Activations