INDEX
Explanations
mentions of the political figure "Trump"
mentions of the name "Trump."
New Auto-Interp
Negative Logits
NAD
-0.76
iencies
-0.73
ĨĴ
-0.72
Flavoring
-0.71
actionGroup
-0.71
++++
-0.69
SAM
-0.67
itial
-0.65
CentOS
-0.64
ä¸ī
-0.64
POSITIVE LOGITS
supporters
0.95
care
0.95
TRUMP
0.91
Trump
0.90
Care
0.82
Donald
0.81
Supporters
0.80
impeachment
0.79
Trump
0.79
eting
0.78
Activations Density 0.053%