INDEX
Explanations
detailed physical descriptions of animals
New Auto-Interp
Negative Logits
ukan
-0.15
632
-0.15
raid
-0.15
antro
-0.14
ith
-0.14
487
-0.14
nbr
-0.14
çķ¥
-0.13
ادر
-0.13
ampler
-0.13
POSITIVE LOGITS
bill
0.25
bills
0.25
feathers
0.24
primaries
0.21
wing
0.21
wings
0.21
flight
0.20
Wings
0.20
efe
0.20
bill
0.20
Activations Density 0.015%