INDEX
Explanations
political statements with a strong opinion
specific symbols or characters that signify significant content or themes
New Auto-Interp
Negative Logits
zoning
-0.80
territ
-0.79
plat
-0.77
hemor
-0.77
Borough
-0.70
Marketable
-0.70
destro
-0.68
charm
-0.67
habit
-0.67
dust
-0.66
POSITIVE LOGITS
¹
1.15
ª
1.13
¸
1.06
ı
1.04
«
0.99
taboola
0.97
¾
0.96
º
0.94
³
0.94
¦
0.94
Activations Density 0.063%