INDEX
Explanations
references to government officials and their roles
New Auto-Interp
Negative Logits
ствие
-0.51
fikation
-0.49
Karriere
-0.46
historie
-0.45
comp
-0.45
писки
-0.42
скі
-0.42
isticated
-0.42
Voraus
-0.42
dası
-0.42
POSITIVE LOGITS
ministers
0.90
Minister
0.85
minister
0.81
Ministers
0.80
ActionCreators
0.80
Minister
0.80
ministres
0.76
ministries
0.70
itſelf
0.69
department
0.69
Activations Density 0.125%