INDEX
Explanations
references to wolves and their ecological significance
New Auto-Interp
Negative Logits
dragon
-0.15
enguin
-0.15
vais
-0.14
dragon
-0.14
snake
-0.14
ubits
-0.14
Dragon
-0.14
Dragon
-0.13
tring
-0.13
orum
-0.13
POSITIVE LOGITS
relatives
0.17
species
0.16
Bates
0.15
cousins
0.15
omat
0.15
Relatives
0.15
pecies
0.15
adapted
0.15
_adapter
0.15
INCT
0.14
Activations Density 0.107%