INDEX
Explanations
verbs related to causing harm or death
instances of the word "to" highlighting actions or intentions
New Auto-Interp
Negative Logits
hes
-0.62
employer
-0.62
Riders
-0.59
indef
-0.59
navig
-0.59
discretionary
-0.59
cancell
-0.59
reactions
-0.59
risks
-0.58
licences
-0.58
POSITIVE LOGITS
pless
1.11
othy
1.09
ppers
1.05
wered
1.04
ggles
1.02
pping
0.99
satisfy
0.98
promote
0.98
commemorate
0.97
pper
0.95
Activations Density 0.308%