INDEX
Explanations
political statements or arguments
references to significant and impactful events or conditions
New Auto-Interp
Negative Logits
hemor
-0.76
zoning
-0.74
plat
-0.70
Borough
-0.68
Manhattan
-0.66
destro
-0.65
airs
-0.65
charm
-0.65
playbook
-0.64
Marketable
-0.64
POSITIVE LOGITS
ª
1.26
¹
1.25
ı
1.16
¸
1.11
³
1.09
¦
1.09
«
1.06
taboola
1.05
ł
1.04
£
1.04
Activations Density 0.087%