INDEX
    Explanations

    the presence of specific personal pronouns and proper nouns in the text

    New Auto-Interp
    Negative Logits
    uppe
    -0.16
    enne
    -0.15
    ForMember
    -0.14
    andal
    -0.14
    uplic
    -0.14
     Drive
    -0.14
    211
    -0.13
     Aj
    -0.13
     beck
    -0.13
    ITICAL
    -0.13
    POSITIVE LOGITS
    #
    0.17
    igham
    0.17
    åħĥ
    0.15
    vido
    0.14
    ervo
    0.13
    bsite
    0.13
    ynom
    0.13
    tica
    0.13
    Cog
    0.13
    amarin
    0.13
    Act Density 0.048%

    No Known Activations