INDEX
Explanations
names of political figures and officials
names of key political figures and advisors
New Auto-Interp
Negative Logits
Collective
-0.83
Kop
-0.78
Jagu
-0.72
juries
-0.69
Century
-0.67
Squid
-0.67
vag
-0.65
denomin
-0.64
à¨
-0.63
Viking
-0.63
POSITIVE LOGITS
aide
1.01
briefings
1.00
enei
1.00
eln
0.97
confid
0.97
briefed
0.93
Lavrov
0.92
memos
0.91
aides
0.91
andowski
0.89
Activations Density 0.055%