INDEX
Explanations
references to animals and animal-related research or testing
New Auto-Interp
Negative Logits
Rhapsody
-0.75
Lupin
-0.73
Blame
-0.71
ujednoznacz
-0.71
utenants
-0.70
withstanding
-0.69
كومونز
-0.69
pravi
-0.68
Teb
-0.67
convaincre
-0.67
POSITIVE LOGITS
animal
1.62
animals
1.52
Animal
1.46
Animal
1.38
animal
1.38
animals
1.34
Animals
1.33
ANIMAL
1.31
ANIMAL
1.30
Animals
1.29
Activations Density 0.073%