INDEX
Explanations
mentions of the name "Donald Trump"
mentions of Donald Trump
New Auto-Interp
Negative Logits
peer
-0.67
otherwise
-0.66
session
-0.65
such
-0.65
gear
-0.64
squad
-0.63
ams
-0.62
similarly
-0.62
labs
-0.61
even
-0.61
POSITIVE LOGITS
Donald
3.87
donald
2.19
Donald
2.15
Trump
1.94
Hillary
1.88
TRUMP
1.83
DonaldTrump
1.78
Bernie
1.66
trump
1.63
realDonaldTrump
1.59
Activations Density 0.016%