INDEX
Negative Logits
ongoing
-0.08
.*;↵↵/
-0.07
possibly
-0.07
য়ের
-0.07
thematic
-0.07
,
-0.07
pasos
-0.07
submitted
-0.07
color
-0.07
a
-0.07
POSITIVE LOGITS
Worse
0.10
worse
0.10
'autant
0.09
pire
0.08
exacerb
0.08
Prin
0.08
Pond
0.08
downright
0.08
尤
0.08
_triangle
0.08
Activations Density 0.100%