INDEX
    Explanations

    personal pronouns, particularly variations of "I."

    New Auto-Interp
    Negative Logits
    uen
    -0.16
    AWN
    -0.15
     champ
    -0.15
    uish
    -0.15
     Champ
    -0.15
    ngth
    -0.14
    robat
    -0.14
    ordes
    -0.14
    vide
    -0.14
    uir
    -0.14
    POSITIVE LOGITS
    ÃŃsto
    0.15
    137
    0.15
    itas
    0.14
    RGBA
    0.14
    rosse
    0.14
     Apost
    0.14
    134
    0.14
    esium
    0.13
    347
    0.13
    117
    0.13
    Act Density 0.050%

    No Known Activations