INDEX
Explanations
mentions of Donald Trump and variations of his name
New Auto-Interp
Negative Logits
geber
-0.16
ibil
-0.15
plural
-0.15
žen
-0.14
rics
-0.14
anlı
-0.14
erna
-0.14
Truthy
-0.14
iap
-0.14
ิà¹Ģศษ
-0.14
POSITIVE LOGITS
s
0.23
Administration
0.23
administration
0.23
Donald
0.21
ster
0.21
eter
0.20
eted
0.20
sters
0.20
enstein
0.20
eting
0.19
Activations Density 0.020%