INDEX
Explanations
references to political parties and their control in government
New Auto-Interp
Negative Logits
obar
-0.16
568
-0.16
enze
-0.15
geb
-0.14
mailbox
-0.14
aller
-0.14
alar
-0.14
irit
-0.14
illi
-0.14
éļ
-0.13
POSITIVE LOGITS
office
0.38
power
0.36
power
0.28
office
0.28
-power
0.26
/power
0.24
-office
0.24
åĬŀåħ¬
0.23
POWER
0.23
Power
0.22
Activations Density 0.075%