INDEX
Explanations
phrases related to violent incidents or accidents
references to violent incidents or emergencies
New Auto-Interp
Negative Logits
peak
-0.70
ħĭ
-0.69
retiring
-0.67
recommending
-0.64
revise
-0.63
persuasion
-0.63
forgetting
-0.63
Absent
-0.62
recomm
-0.62
optim
-0.62
POSITIVE LOGITS
occurred
1.84
happened
1.61
lasted
1.37
unfolded
1.35
transpired
1.35
resulted
1.28
coincided
1.24
stemmed
1.23
occurs
1.21
occ
1.20
Activations Density 0.240%