INDEX
    Explanations

    references to self-directed actions or self-awareness

    New Auto-Interp
    Negative Logits
     self
    -0.79
     Self
    -0.69
     SELF
    -0.63
    Self
    -0.61
     itself
    -0.57
    self
    -0.55
     Selbst
    -0.53
     zelf
    -0.51
    Selbst
    -0.50
    riwal
    -0.47
    POSITIVE LOGITS
     doInBackground
    0.88
     }}"></
    0.84
    ,‎
    0.79
     ویکی‌پدیای
    0.75
    ']?>
    0.73
    Collegamenti
    0.72
    )}</
    0.72
    endphp
    0.71
    مزید
    0.71
    })()
    0.70
    Act Density 0.011%

    No Known Activations