INDEX
Explanations
references to historical events and political changes
New Auto-Interp
Negative Logits
ihad
-0.17
adal
-0.16
illac
-0.16
olio
-0.14
angi
-0.14
ãĤ·ãĤ¢
-0.14
processable
-0.14
hangi
-0.14
aro
-0.14
disposing
-0.14
POSITIVE LOGITS
East
0.37
East
0.31
EAST
0.28
communist
0.27
Soviet
0.27
Communist
0.27
-East
0.26
DDR
0.25
æĿ±
0.24
east
0.24
Activations Density 0.074%