INDEX
Explanations
phrases related to political events or official statements
New Auto-Interp
Negative Logits
BIP
-0.70
IGHTS
-0.66
beit
-0.65
hips
-0.64
interf
-0.64
Dynamics
-0.64
âĸ¬âĸ¬
-0.63
ADRA
-0.61
Admir
-0.61
withd
-0.60
POSITIVE LOGITS
olitan
1.56
olitics
1.37
oleon
1.33
olit
1.32
ocalypse
1.32
rint
1.30
inion
1.28
rompt
1.28
ublic
1.27
onent
1.27
Activations Density 2.102%