INDEX
Explanations
references to historical or prominent figures named "I"
occurrences of the pronoun "I"
New Auto-Interp
Negative Logits
tongues
-0.65
electromagnetic
-0.63
Kelvin
-0.62
optics
-0.61
gum
-0.61
noses
-0.61
terday
-0.59
Rebels
-0.59
heads
-0.58
Cutter
-0.57
POSITIVE LOGITS
'm
1.34
've
1.08
pec
0.99
ANA
0.98
AAF
0.98
MAX
0.98
UC
0.97
verson
0.95
ago
0.95
'll
0.94
Activations Density 0.254%