INDEX
Explanations
references to the number "two" in various contexts
two of something
New Auto-Interp
Negative Logits
itſelf
-1.04
Jefus
-1.02
himſelf
-0.98
houſe
-0.93
Cæsar
-0.92
fubject
-0.91
ſche
-0.91
ſtate
-0.91
myſelf
-0.91
Shakspeare
-0.89
POSITIVE LOGITS
two
0.92
Two
0.79
big
0.71
dua
0.68
abetes
0.66
NameInMap
0.64
två
0.64
main
0.64
Zwei
0.63
两
0.62
Activations Density 0.229%