INDEX
Explanations
references to local governance and political entities
New Auto-Interp
Negative Logits
unca
-0.17
inel
-0.15
olo
-0.15
reich
-0.15
essa
-0.14
ipa
-0.14
Zak
-0.14
usp
-0.14
itus
-0.14
onta
-0.14
POSITIVE LOGITS
Arlington
0.28
Fairfax
0.26
Alexandria
0.25
Alexand
0.22
alex
0.19
Virginia
0.19
703
0.19
Alex
0.17
alex
0.17
Alex
0.16
Activations Density 0.025%