INDEX
Negative Logits
fondi
-0.86
ağı
-0.85
centrales
-0.85
gräns
-0.82
pitié
-0.81
centrale
-0.79
cardiaque
-0.79
central
-0.76
condiv
-0.75
biór
-0.75
POSITIVE LOGITS
Middle
0.74
Middles
0.73
Sidd
0.72
dle
0.71
Middle
0.69
uppo
0.68
MIDDLE
0.67
MIDDLE
0.66
Toma
0.66
ROW
0.66
Activations Density 0.007%