INDEX
Explanations
references to wolves and their ecological significance
New Auto-Interp
Negative Logits
turtles
-0.17
egg
-0.16
frogs
-0.16
Eggs
-0.16
-shell
-0.16
urtle
-0.16
ç¾½
-0.16
Hatch
-0.15
ajar
-0.15
eggs
-0.15
POSITIVE LOGITS
wolf
0.57
Wolf
0.53
wolves
0.51
wolf
0.49
wol
0.48
Wolf
0.46
Wolves
0.44
çĭ¼
0.43
pack
0.42
lup
0.40
Activations Density 0.048%