INDEX
Explanations
collective pronouns referring to a group or community
New Auto-Interp
Negative Logits
I
-0.60
am
-0.42
do
-0.41
Mayer
-0.41
m
-0.35
sais
-0.35
cal
-0.34
-0.33
[
-0.33
<eos>
-0.33
POSITIVE LOGITS
themſelves
1.05
CloseOperation
1.05
himſelf
1.04
$_"
1.03
itſelf
1.02
Shaksp
1.01
OGND
0.99
Shakspeare
0.99
ſeveral
0.97
NameInMap
0.96
Activations Density 0.121%