INDEX
Explanations
mentions of "Donald Trump."
mentions of the name "Donald" in various contexts
New Auto-Interp
Negative Logits
-->
-0.78
ulla
-0.72
CI
-0.69
XP
-0.69
"}],"
-0.67
labs
-0.66
iage
-0.64
corp
-0.62
Parameter
-0.62
meters
-0.61
POSITIVE LOGITS
Donald
3.53
Donald
2.79
Trump
2.09
Trump
1.90
Hillary
1.89
donald
1.84
Barack
1.83
TRUMP
1.82
Ivanka
1.77
Melania
1.75
Activations Density 0.014%