INDEX
Explanations
entities related to wildlife or endangered species
New Auto-Interp
Negative Logits
Vys
-0.17
vertical
-0.16
jet
-0.16
volum
-0.15
ç¥
-0.14
ew
-0.14
(machine
-0.14
machine
-0.14
Relief
-0.14
jet
-0.14
POSITIVE LOGITS
turtles
0.37
turtle
0.34
shell
0.33
urtle
0.33
shell
0.32
Turtle
0.32
turtle
0.31
Shell
0.30
tort
0.30
-shell
0.30
Activations Density 0.011%