INDEX
Explanations
keywords associated with conservation efforts and animal protection
New Auto-Interp
Negative Logits
Chick
-0.18
paddle
-0.18
Ducks
-0.17
moll
-0.16
emoc
-0.16
泡
-0.16
bubble
-0.16
turtles
-0.15
Eggs
-0.15
culo
-0.15
POSITIVE LOGITS
tiger
0.48
Tiger
0.47
lion
0.45
Tigers
0.45
lions
0.42
Lion
0.40
lion
0.38
Lions
0.37
Panther
0.35
Tig
0.33
Activations Density 0.070%