INDEX
Explanations
references to political figures and their titles
New Auto-Interp
Negative Logits
Peaks
-0.16
Copp
-0.15
Ïĥκ
-0.15
izoph
-0.14
аÑĤаÑĢ
-0.14
opol
-0.14
lius
-0.14
Ã¥de
-0.14
istles
-0.14
arehouse
-0.14
POSITIVE LOGITS
White
0.31
Oval
0.29
White
0.26
President
0.22
presidential
0.22
WHITE
0.22
Presidential
0.20
Secret
0.20
Residence
0.19
president
0.19
Activations Density 0.104%