INDEX
Negative Logits
descriptive
-0.09
senator
-0.08
immune
-0.08
되어
-0.08
transitional
-0.08
agli
-0.08
wine
-0.08
gemstone
-0.08
senators
-0.08
ladı
-0.08
POSITIVE LOGITS
treadmill
0.10
sob
0.09
mph
0.08
坡
0.08
jah
0.08
hous
0.08
SS
0.07
бег
0.07
不停
0.07
dinámica
0.07
Activations Density 0.002%