INDEX
Negative Logits
prelim
-0.07
centres
-0.07
($("#-0.06
%.
-0.06
movies
-0.06
winner
-0.06
582
-0.06
-Tr
-0.06
res
-0.06
Gym
-0.06
POSITIVE LOGITS
appropriate
0.13
appropriately
0.09
appropriate
0.09
uygun
0.07
inappropriate
0.07
quired
0.07
śli
0.07
apat
0.07
fte
0.07
chap
0.07
Activations Density 0.028%