INDEX
Negative Logits
神経
0.41
cheeses
0.39
砥
0.38
Tested
0.38
बेश
0.37
Recovery
0.37
contaminant
0.37
Fiber
0.36
cheeky
0.36
fections
0.36
POSITIVE LOGITS
violence
3.64
Violence
3.23
violencia
3.20
Violence
3.20
violent
3.11
violence
3.11
violência
3.08
暴力
2.94
हिंसा
2.64
violent
2.64
Activations Density 0.126%