INDEX
Explanations
phrases related to violence or abuse
negative expressions associated with sexual violence
New Auto-Interp
Negative Logits
Interstitial
-0.63
redes
-0.62
forecasting
-0.60
exting
-0.59
acca
-0.58
-0.58
alore
-0.58
£ı
-0.57
Initial
-0.57
transcript
-0.57
POSITIVE LOGITS
somebody
1.56
anybody
1.45
someone
1.44
him
1.44
me
1.37
someone
1.33
anyone
1.23
him
1.19
us
1.09
Him
1.06
Activations Density 0.403%