INDEX
Explanations
terms related to killing and murder
New Auto-Interp
Negative Logits
>",
-0.75
Himo
-0.75
Applicant
-0.73
Farah
-0.72
dized
-0.68
Manly
-0.68
Dumas
-0.67
geheel
-0.67
Folks
-0.67
متحده
-0.67
POSITIVE LOGITS
kill
1.55
kills
1.49
KILL
1.47
Kill
1.46
kill
1.38
killing
1.38
killed
1.34
Kills
1.31
killings
1.26
Kill
1.24
Activations Density 0.068%