INDEX
Explanations
references to Donald Trump and related terms
New Auto-Interp
Negative Logits
teto
-0.65
Obre
-0.62
\{\\-0.61
Filmografie
-0.60
漕
-0.59
довлет
-0.59
'{@-0.59
wieś
-0.57
occasione
-0.57
Leite
-0.56
POSITIVE LOGITS
Trump
2.46
Trump
2.28
trump
1.69
trump
1.64
Donald
1.30
特朗普
1.26
Donald
1.25
DONALD
1.19
donald
1.09
trumpet
1.03
Activations Density 0.055%