INDEX
Explanations
references to the health and conditions of whales
New Auto-Interp
Negative Logits
poultry
-0.18
xia
-0.18
snake
-0.17
frogs
-0.16
snakes
-0.16
lizard
-0.16
chickens
-0.15
_bug
-0.15
CONTRIBUTORS
-0.15
èļ
-0.15
POSITIVE LOGITS
cet
0.26
whales
0.21
sperm
0.19
seals
0.18
Bale
0.18
whale
0.18
flip
0.17
Flip
0.17
seal
0.16
nar
0.16
Activations Density 0.102%