INDEX
Negative Logits
önem
-0.07
trait
-0.07
isten
-0.06
TRA
-0.06
а
-0.06
scroll
-0.06
erver
-0.06
uali
-0.06
öden
-0.06
کند
-0.06
POSITIVE LOGITS
Mart
0.06
_INT
0.06
imperialism
0.06
manners
0.06
Saint
0.06
_chart
0.06
abs
0.06
которые
0.06
.Le
0.06
Season
0.06
Activations Density 0.020%