INDEX
Explanations
mentions of the word "trump" followed by a suffix such as "-ed" or "-ing", possibly associated with actions or descriptions
references to the name "Trump" and related terms
New Auto-Interp
Negative Logits
ASED
-0.81
Interstitial
-0.80
Assembly
-0.77
RW
-0.76
ARP
-0.73
OPE
-0.69
semble
-0.67
URI
-0.67
ECD
-0.67
Reviewed
-0.66
POSITIVE LOGITS
eting
1.35
trump
1.26
eters
1.16
ets
0.99
eter
0.97
trump
0.95
et
0.86
ety
0.85
ed
0.84
stakes
0.83
Activations Density 0.009%