INDEX
Explanations
instances of the word "former" preceded by another word
references to entities that were previously significant or relevant
New Auto-Interp
Negative Logits
atown
-0.87
andise
-0.82
otle
-0.81
antics
-0.79
ourced
-0.78
achus
-0.76
anguage
-0.76
aris
-0.75
oked
-0.74
ahon
-0.73
POSITIVE LOGITS
Yugoslavia
1.22
Yugoslav
1.11
Soviet
0.89
occupant
0.78
convict
0.76
smoker
0.73
USSR
0.72
inmate
0.71
soldier
0.70
glory
0.70
Activations Density 0.031%