INDEX
Explanations
the word "and" in various forms and contexts
New Auto-Interp
Negative Logits
itſelf
-0.92
Efq
-0.89
Jefus
-0.86
Majefty
-0.85
purpoſe
-0.85
himſelf
-0.84
ſelf
-0.84
fubject
-0.83
poffe
-0.80
poffible
-0.79
POSITIVE LOGITS
And
1.28
Và
1.23
And
1.13
AND
1.12
AND
1.11
and
0.96
그리고
0.91
và
0.86
+"&
0.85
\&
0.84
Activations Density 0.240%