INDEX
Explanations
statements about violence and its consequences
New Auto-Interp
Negative Logits
springfox
-0.56
isoto
-0.55
accor
-0.52
porous
-0.51
havior
-0.51
ocardio
-0.50
abor
-0.50
djangoproject
-0.50
Carrick
-0.50
Gae
-0.49
POSITIVE LOGITS
bootstrapcdn
0.72
kasarigan
0.71
diyah
0.71
ண்டும்
0.71
devront
0.68
rsiniz
0.67
متعلقه
0.66
الحياه
0.66
gepubliceerd
0.63
fjspx
0.63
Activations Density 0.001%