INDEX
Explanations
references to governmental or official positions and titles
New Auto-Interp
Negative Logits
featureID
-0.90
increí
-0.88
estekak
-0.88
desmotivaciones
-0.82
Geſch
-0.78
oa̍t
-0.77
Савезне
-0.76
JpaRepository
-0.76
Wikimedijinoj
-0.76
Autoritní
-0.76
POSITIVE LOGITS
minister
0.59
Ed
0.54
pp
0.54
0.49
Sector
0.49
Â
0.49
minister
0.47
Ed
0.46
H
0.45
-
0.45
Activations Density 0.291%