INDEX
Explanations
content related to political promises and actions of leaders
New Auto-Interp
Negative Logits
ensor
-0.15
express
-0.14
ÃŃr
-0.14
ENSOR
-0.14
gem
-0.13
اراÙĨ
-0.13
LLU
-0.13
marg
-0.13
pedia
-0.13
.selenium
-0.13
POSITIVE LOGITS
cabinet
0.20
Cabinet
0.19
White
0.18
White
0.17
abinet
0.16
.nih
0.15
Executive
0.14
Work
0.14
izi
0.13
ioxide
0.13
Activations Density 0.182%