INDEX
Negative Logits
ніципа
-0.73
*/;
-0.71
$_"
-0.70
]';
-0.69
'][$
-0.68
}}$}
-0.67
disambiguazione
-0.66
")));
-0.66
Autoritní
-0.66
ſeveral
-0.66
POSITIVE LOGITS
BASELINE
0.48
칼
0.47
jelas
0.47
cal
0.47
op
0.45
-
0.45
sure
0.44
張り
0.43
planten
0.43
fond
0.42
Activations Density 0.024%