INDEX
Explanations
first-person singular pronouns
New Auto-Interp
Negative Logits
ady
-0.17
azy
-0.17
%%%%%%%%%%%%%%%%
-0.16
ä¹Ī
-0.15
ysa
-0.15
pane
-0.15
cab
-0.14
adies
-0.14
lier
-0.14
oen
-0.14
POSITIVE LOGITS
andel
0.18
á»įng
0.14
WI
0.14
jsonp
0.14
acman
0.13
etten
0.13
á»ĵn
0.13
regor
0.13
a
0.13
ATRIX
0.13
Activations Density 0.162%