INDEX
Explanations
the names of various famous people or titles of events
New Auto-Interp
Negative Logits
otle
-0.81
anguage
-0.80
ramid
-0.74
uden
-0.73
aris
-0.73
antics
-0.72
ghai
-0.71
iquette
-0.70
utterstock
-0.70
oked
-0.69
POSITIVE LOGITS
Yugoslav
1.12
Yugoslavia
1.09
Soviet
0.99
classmate
0.94
president
0.93
colleague
0.91
employee
0.89
President
0.89
comrade
0.88
presidents
0.85
Activations Density 0.551%