INDEX
Explanations
phrases related to political controversies or conflicts
New Auto-Interp
Negative Logits
DAY
-0.75
BALL
-0.72
ï¸ı
-0.72
Ô
-0.68
OHN
-0.66
hyde
-0.66
NING
-0.66
senal
-0.65
Tur
-0.62
terday
-0.61
POSITIVE LOGITS
ederation
1.34
essional
1.33
luence
1.26
usions
1.21
eder
1.20
erences
1.14
essor
1.14
idences
1.11
ederal
1.04
etti
1.03
Activations Density 0.009%