INDEX
Explanations
references to government officials and related political discussions
New Auto-Interp
Negative Logits
rez
-0.18
lef
-0.17
âu
-0.17
sal
-0.17
board
-0.15
pa
-0.15
usal
-0.15
akra
-0.14
body
-0.14
ToLeft
-0.14
POSITIVE LOGITS
Shadow
0.18
portfolios
0.17
Assistant
0.17
Shadow
0.16
Senator
0.16
оÑĪ
0.16
Assistant
0.16
’Ñı
0.15
Portfolio
0.15
greg
0.15
Activations Density 0.016%