INDEX
Explanations
events or situations involving bears
New Auto-Interp
Negative Logits
Butterfly
-0.17
rido
-0.17
æī¶
-0.16
lizard
-0.16
Pony
-0.15
poultry
-0.15
Millet
-0.15
ogra
-0.15
frog
-0.15
serpent
-0.14
POSITIVE LOGITS
bears
0.52
bear
0.52
Bears
0.44
Bear
0.44
bear
0.39
Bear
0.39
çĨĬ
0.37
cub
0.33
Urs
0.30
polar
0.29
Activations Density 0.020%