INDEX
Explanations
references to President Trump and other political figures
New Auto-Interp
Negative Logits
Wich
-0.17
Kral
-0.15
ired
-0.14
ANGES
-0.14
itech
-0.14
minister
-0.14
sup
-0.14
RTL
-0.13
ite
-0.13
wake
-0.13
POSITIVE LOGITS
Barack
0.26
Obama
0.24
Trump
0.22
Bush
0.20
President
0.19
Donald
0.19
Obama
0.18
Clinton
0.17
Biden
0.17
Trump
0.17
Activations Density 0.068%