INDEX
Explanations
personal pronouns referring to the speaker, specifically 'I'
instances of the pronoun "I"
New Auto-Interp
Negative Logits
Alternative
-0.64
eworthy
-0.63
mutants
-0.61
Uriel
-0.60
uniforms
-0.59
Rolls
-0.58
Consortium
-0.57
Gale
-0.56
Bottom
-0.54
Sidney
-0.54
POSITIVE LOGITS
'm
1.70
am
1.26
've
1.25
myself
1.04
suppose
1.00
'd
1.00
RL
1.00
verson
0.97
ggy
0.97
zzo
0.94
Activations Density 0.244%