INDEX
Explanations
terms related to specific events or incidents being reported by eyewitnesses in news articles
references to legal and emergency services situations
New Auto-Interp
Negative Logits
().
-0.72
â̦"
-0.69
ðŁĻĤ
-0.67
etc
-0.67
______
-0.64
âĢķ
-0.64
("-0.64
.",
-0.64
âĻ¥
-0.63
Ïī
-0.63
POSITIVE LOGITS
ogether
1.00
icularly
0.95
inarily
0.94
izens
0.89
goers
0.88
Details
0.83
quartered
0.81
ificantly
0.80
itionally
0.80
fighters
0.80
Activations Density 0.329%