INDEX
Explanations
words related to criminal activities, particularly arson
terms associated with arson and related criminal activities
New Auto-Interp
Negative Logits
avez
-0.75
Presidents
-0.74
onse
-0.71
nir
-0.70
enance
-0.69
ux
-0.69
fortune
-0.67
advertisement
-0.63
POST
-0.63
ktop
-0.63
POSITIVE LOGITS
arson
0.92
witch
0.84
emouth
0.83
©¶æ¥µ
0.75
charcoal
0.75
stove
0.74
burner
0.73
cest
0.72
bats
0.71
rall
0.69
Activations Density 0.014%