INDEX
Explanations
words related to mammalian biology
references to mammals and related terminology
New Auto-Interp
Negative Logits
iffe
-0.87
uler
-0.80
inning
-0.78
mire
-0.76
room
-0.71
rapnel
-0.71
iner
-0.70
mir
-0.69
outer
-0.69
acher
-0.69
POSITIVE LOGITS
mammals
1.18
mammal
1.06
cules
1.03
reptiles
1.00
carniv
0.99
species
0.98
brates
0.97
primates
0.94
osaurs
0.92
mammalian
0.87
Activations Density 0.027%