INDEX
Explanations
direct quotations in text
quotation marks and direct speech in text
New Auto-Interp
Negative Logits
nod
-0.69
lair
-0.63
nodd
-0.62
rundown
-0.61
fray
-0.61
salute
-0.60
Versus
-0.60
alias
-0.58
playbook
-0.57
lled
-0.57
POSITIVE LOGITS
there
1.27
nob
1.15
they
1.06
someone
1.05
everyone
1.02
these
1.01
many
1.01
when
0.94
every
0.93
despite
0.91
Activations Density 0.217%