INDEX
Explanations
descriptions of violent encounters and criminal activities, particularly involving physical harm
New Auto-Interp
Negative Logits
relies
-0.62
Rules
-0.61
Edit
-0.60
Own
-0.60
Events
-0.60
evidence
-0.59
Based
-0.58
Sources
-0.58
alties
-0.58
amounts
-0.58
POSITIVE LOGITS
handful
0.88
bunch
0.86
flurry
0.82
bottle
0.80
couple
0.80
knife
0.76
rouse
0.76
small
0.75
few
0.75
dozen
0.74
Activations Density 10.275%