INDEX
    Explanations

    pronouns, particularly "it" and "I"

    New Auto-Interp
    Negative Logits
    :✨
    -0.67
     autorytatywna
    -0.60
    }$​
    -0.57
     iconLine
    -0.54
    KEYCODE
    -0.49
     Autorisations
    -0.47
     Италијани
    -0.45
     éter
    -0.45
     desmotivaciones
    -0.45
    ׃
    -0.44
    POSITIVE LOGITS
     theyre
    0.46
    they
    0.45
     he
    0.45
     youre
    0.42
     dunno
    0.40
    Diwedd
    0.39
    repository
    0.39
     اللي
    0.38
     pues
    0.38
    -
    0.38
    Act Density 0.079%

    No Known Activations