INDEX
Negative Logits
Endless
-0.08
(&
-0.08
Father
-0.07
uč
-0.07
()?
-0.07
alisin
-0.07
FU
-0.07
eemald
-0.07
fu
-0.07
Fu
-0.07
POSITIVE LOGITS
workings
0.09
dignity
0.08
خو
0.08
ια
0.08
گاه
0.07
ગી
0.07
concerns
0.07
τε
0.07
differs
0.07
hơn
0.07
Activations Density 0.008%