INDEX
Explanations
variations of the suffix "ers," indicating the presence of agents or doers in the text
New Auto-Interp
Negative Logits
uddle
-0.16
.nlm
-0.16
Replies
-0.16
sockfd
-0.15
gien
-0.14
ousel
-0.14
odÃŃ
-0.14
ogle
-0.14
ocities
-0.14
owski
-0.14
POSITIVE LOGITS
etro
0.15
ACE
0.15
ACE
0.15
ipher
0.15
нÑĸв
0.14
innie
0.14
Renderer
0.13
waters
0.13
asa
0.13
ató
0.13
Activations Density 0.068%