INDEX
Explanations
personal pronouns paired with 'to be' or 'to do'
negations related to the word "it."
New Auto-Interp
Negative Logits
chairs
-0.69
chat
-0.64
forts
-0.63
gra
-0.62
seiz
-0.62
ascript
-0.61
ties
-0.61
hig
-0.60
aeda
-0.59
redes
-0.58
POSITIVE LOGITS
there
0.84
it
0.80
they
0.75
anybody
0.75
thou
0.75
anyone
0.73
everybody
0.69
you
0.68
we
0.65
everyone
0.65
Activations Density 0.098%