INDEX
Explanations
the names of political figures and positions
references to former political leaders and officials
New Auto-Interp
Negative Logits
verse
-0.62
Located
-0.61
EngineDebug
-0.61
Christmas
-0.59
Accessed
-0.58
Tuesday
-0.58
Monday
-0.57
endpoint
-0.57
Limited
-0.56
fingert
-0.56
POSITIVE LOGITS
Saddam
0.86
Rudy
0.84
Fidel
0.84
disgr
0.83
Newt
0.83
Boris
0.78
Gore
0.77
Mikhail
0.74
Ferdinand
0.74
Alan
0.73
Activations Density 0.123%