INDEX
Explanations
specific locations and events mentioned in a text
references to specific locations and events related to crime or violence
New Auto-Interp
Negative Logits
cipled
-0.73
laus
-0.65
cellaneous
-0.59
detail
-0.55
³³³³³³³³³³³³³³³³
-0.54
enture
-0.54
gov
-0.53
ertodd
-0.53
areth
-0.53
yne
-0.51
POSITIVE LOGITS
badge
0.76
moniker
0.67
persona
0.64
onto
0.63
slate
0.63
salute
0.61
barrier
0.61
flag
0.59
masterpiece
0.59
cra
0.59
Activations Density 2.774%