INDEX
    Explanations

    expressions of personal experience or inner thoughts

    first-person singular pronouns indicating personal experiences or feelings

    New Auto-Interp
    Negative Logits
    INGTON
    -0.61
     Kelvin
    -0.58
     Philipp
    -0.58
     Shelby
    -0.55
     Vald
    -0.54
     Alternative
    -0.54
     Ep
    -0.53
     Jarrett
    -0.53
     Aberdeen
    -0.53
    minster
    -0.53
    POSITIVE LOGITS
    'm
    1.41
    've
    1.30
     suppose
    1.20
    'll
    1.19
    'd
    1.08
     guess
    1.01
    ggy
    0.96
    rises
    0.92
    RL
    0.89
    ANA
    0.89
    Act Density 0.388%

    No Known Activations