INDEX
Explanations
words related to harm or danger
terms related to harm and negative impacts
New Auto-Interp
Negative Logits
ãĤº
-0.80
Pens
-0.78
Tools
-0.68
excav
-0.67
Activity
-0.65
mine
-0.65
Screen
-0.65
Poc
-0.65
Ig
-0.64
Enlarge
-0.63
POSITIVE LOGITS
choke
1.82
harm
1.57
dizz
1.35
staggered
1.34
bankrupt
1.33
cant
1.31
choked
1.28
choking
1.27
stagger
1.24
inco
1.19
Activations Density 0.047%