INDEX
Explanations
mentions of former positions or titles
the term "Former" followed by a title or role
New Auto-Interp
Negative Logits
otle
-0.94
anguage
-0.84
achus
-0.82
antics
-0.78
acht
-0.72
andise
-0.72
okin
-0.72
matic
-0.72
aceae
-0.71
thora
-0.70
POSITIVE LOGITS
Yugoslav
0.99
Yugoslavia
0.96
President
0.84
Soviet
0.83
presidents
0.83
Deputy
0.78
president
0.78
colleague
0.78
bies
0.76
captives
0.76
Activations Density 0.035%