INDEX
Negative Logits
Killing
-0.82
killing
-0.82
murdered
-0.77
kill
-0.75
kill
-0.73
KILL
-0.73
murders
-0.71
Killing
-0.71
murder
-0.70
killed
-0.70
POSITIVE LOGITS
bahay
0.61
rung
0.55
ształ
0.51
refuge
0.50
scouting
0.49
crowning
0.49
resolutions
0.49
loob
0.49
brim
0.47
Mep
0.47
Activations Density 0.028%