INDEX
Explanations
names of specific locations, possibly related to crime or events
place names and geographic locations
New Auto-Interp
Negative Logits
meanwhile
-0.79
ultras
-0.78
Rohing
-0.76
regimes
-0.75
pse
-0.74
Indones
-0.73
translated
-0.72
worldwide
-0.72
Icelandic
-0.70
Iranians
-0.69
POSITIVE LOGITS
heny
1.35
endale
1.32
esville
1.30
enton
1.29
erville
1.27
inton
1.26
eston
1.14
ilton
1.11
bury
1.10
lington
1.09
Activations Density 0.248%