INDEX
Explanations
references to violent actions or events involving law enforcement
articles and demonstratives preceding nouns
New Auto-Interp
Negative Logits
!/
-0.77
Contents
-0.76
ATURES
-0.75
Izan
-0.74
Æ
-0.73
[/
-0.72
/-
-0.70
Releases
-0.69
Languages
-0.69
Allow
-0.69
POSITIVE LOGITS
rouse
1.06
woman
1.05
handful
0.98
couple
0.97
burglary
0.91
colleague
0.89
lot
0.89
multitude
0.88
small
0.87
funeral
0.87
Activations Density 0.695%