INDEX
    Explanations

    phrases related to political figures and actions

    punctuation marks, specifically commas

    New Auto-Interp
    Negative Logits
    igl
    -0.62
    ãĥ¥
    -0.60
    jad
    -0.59
     guid
    -0.57
    Russ
    -0.56
     hous
    -0.56
    nexpected
    -0.55
     aut
    -0.54
    nai
    -0.53
    ilers
    -0.53
    POSITIVE LOGITS
     respectively
    0.74
    issy
    0.72
    ]).
    0.72
    DragonMagazine
    0.67
    udeb
    0.66
    ]]
    0.66
    ]),
    0.66
    )).
    0.66
    "))
    0.63
    auntlet
    0.63
    Act Density 0.578%

    No Known Activations