INDEX
Negative Logits
áy
-0.17
ç½
-0.16
íĥķ
-0.14
inges
-0.14
Thu
-0.14
therefore
-0.14
æ¯Ľ
-0.13
irling
-0.12
inson
-0.12
verg
-0.12
POSITIVE LOGITS
Alternatively
0.18
NDER
0.17
upertino
0.17
æĪĸ
0.16
#af
0.16
наÑĢÑĥж
0.16
follow
0.16
also
0.15
Alternatively
0.15
Also
0.15
Activations Density 0.129%