INDEX
Explanations
intense emotional or physical violence
New Auto-Interp
Negative Logits
fjspx
-0.80
providedIn
-0.79
kasarigan
-0.78
ModelExpression
-0.77
Walkover
-0.76
AssemblyProduct
-0.76
surla
-0.75
GIH
-0.73
UnusedPrivate
-0.72
Вікі
-0.71
POSITIVE LOGITS
ripping
0.79
rip
0.77
torn
0.77
ripped
0.74
tore
0.70
tearing
0.69
tear
0.68
split
0.67
split
0.67
sever
0.66
Activations Density 0.181%