INDEX
Explanations
government officials, political figures, and world leaders
New Auto-Interp
Negative Logits
anwhile
-0.77
oldemort
-0.69
wark
-0.66
pel
-0.65
kson
-0.65
ascade
-0.64
ologies
-0.64
ngth
-0.62
chalk
-0.62
anners
-0.61
POSITIVE LOGITS
ministerial
1.14
Minister
1.13
minister
1.09
ministers
0.97
Ministers
0.92
knit
0.90
val
0.89
etime
0.78
Rib
0.71
ition
0.70
Activations Density 1.736%