INDEX
Explanations
statements related to political actions or decisions
New Auto-Interp
Negative Logits
brit
-0.14
ienia
-0.14
íĥĢ
-0.14
гÑĢÑĥн
-0.14
arence
-0.14
alu
-0.14
Cli
-0.14
Aware
-0.14
oard
-0.13
ieu
-0.13
POSITIVE LOGITS
Fake
0.24
Ivanka
0.23
tremendous
0.23
Melania
0.21
Fake
0.20
terrific
0.18
toughness
0.17
Radical
0.17
Podesta
0.17
Schumer
0.17
Activations Density 0.026%