INDEX
Explanations
incidents involving injury or harm to individuals, particularly in relation to violent acts
New Auto-Interp
Negative Logits
ãĥŃãĥ¼
-0.17
assi
-0.15
abies
-0.14
us
-0.14
ONEY
-0.14
ãĤ¹ãĥĿ
-0.13
permalink
-0.13
kå
-0.13
å¨
-0.13
FileNotFoundException
-0.13
POSITIVE LOGITS
woman
0.39
man
0.38
couple
0.31
mother
0.27
Woman
0.25
girl
0.25
father
0.25
teenager
0.24
teen
0.23
boy
0.23
Activations Density 0.283%