INDEX
Explanations
references to other individuals or entities in various contexts
New Auto-Interp
Negative Logits
Efq
-0.72
himſelf
-0.63
ündig
-0.59
ulcers
-0.58
Grot
-0.57
colorPrimary
-0.57
Progres
-0.56
andExpect
-0.55
Theſe
-0.54
<_>
-0.54
POSITIVE LOGITS
than
0.61
himo
0.60
outros
0.59
principalTable
0.58
elsewhere
0.58
Outras
0.58
########.
0.57
Elsewhere
0.57
Others
0.57
other
0.57
Activations Density 0.507%