INDEX
Explanations
mentions related to former political figures
references to political figures or their former positions
New Auto-Interp
Negative Logits
otle
-0.80
ramid
-0.79
anguage
-0.78
iquette
-0.73
utterstock
-0.73
aris
-0.71
achus
-0.71
oking
-0.70
uden
-0.70
ourced
-0.70
POSITIVE LOGITS
Yugoslav
1.04
Yugoslavia
1.03
President
0.95
presidents
0.93
president
0.93
Soviet
0.93
Presidents
0.92
colleague
0.91
classmate
0.88
classmates
0.87
Activations Density 0.040%