INDEX
Explanations
the term "perpetrator" or similar words indicating someone responsible for a harmful action
terms related to criminal actions and the individuals who commit them
New Auto-Interp
Negative Logits
yip
-0.76
andel
-0.71
mel
-0.69
psey
-0.68
ERAL
-0.66
Plat
-0.66
zl
-0.66
binding
-0.62
rients
-0.62
Pyth
-0.61
POSITIVE LOGITS
perpetrated
0.99
committed
0.78
perpetrator
0.77
perpetrators
0.75
perpet
0.73
thereof
0.72
inflicted
0.72
ãĥĩ
0.71
orst
0.71
offender
0.70
Activations Density 0.035%