INDEX
Negative Logits
Barrett
-0.07
domácí
-0.07
ctrl
-0.06
_life
-0.06
.proto
-0.06
yönetim
-0.06
DM
-0.06
decis
-0.06
dostup
-0.06
ayım
-0.06
POSITIVE LOGITS
sugar
0.10
Sugar
0.07
uctose
0.07
intersection
0.06
Cou
0.06
발
0.06
touch
0.06
pleasant
0.06
ose
0.06
Jug
0.06
Activations Density 0.010%