INDEX
Explanations
mentions of the term "Trump" at various levels of activation strength
mentions of the name "Trump."
New Auto-Interp
Negative Logits
iencies
-0.70
NAD
-0.66
ASA
-0.66
CentOS
-0.66
actionGroup
-0.64
SAM
-0.64
STATS
-0.62
Mortal
-0.62
RED
-0.61
Emacs
-0.61
POSITIVE LOGITS
care
1.06
supporters
0.93
eting
0.92
Tower
0.91
Jr
0.89
eters
0.88
Care
0.86
eter
0.86
ism
0.82
TRUMP
0.82
Activations Density 0.061%