INDEX
Explanations
specific references to animal welfare and ethical considerations in research
New Auto-Interp
Negative Logits
Urbano
-0.48
PRNewswire
-0.48
alık
-0.48
bní
-0.46
urlpatterns
-0.45
Distribuzione
-0.45
$
-0.45
<bos>
-0.44
glGen
-0.44
homonymie
-0.44
POSITIVE LOGITS
cruelty
1.05
cruel
0.89
Cruelty
0.87
atrocities
0.83
Cruel
0.77
cruel
0.75
SwitchCompat
0.75
inhuman
0.72
humane
0.71
inhum
0.70
Activations Density 0.430%