INDEX
Explanations
the word "former" followed by a noun, potentially indicating a comparison or transition
references to individuals who previously held specific positions or titles
New Auto-Interp
Negative Logits
oked
-0.82
ographed
-0.79
andise
-0.77
ourced
-0.77
lied
-0.74
opped
-0.71
anche
-0.70
antics
-0.70
owitz
-0.70
anguage
-0.70
POSITIVE LOGITS
Yugoslavia
1.23
Yugoslav
1.08
Soviet
0.92
KGB
0.83
USSR
0.83
smoker
0.81
employer
0.78
president
0.75
communist
0.74
convict
0.73
Activations Density 0.032%