INDEX
Explanations
descriptive features of physical characteristics in animals
New Auto-Interp
Negative Logits
ambre
-0.17
avel
-0.15
676
-0.15
_lead
-0.14
lemn
-0.14
lingen
-0.14
monkey
-0.14
ìļ
-0.14
065
-0.14
linger
-0.14
POSITIVE LOGITS
Mines
0.14
ILA
0.14
ils
0.14
atown
0.14
ORS
0.14
Kodi
0.13
æĸĹ
0.13
integ
0.13
Kong
0.13
erti
0.13
Activations Density 0.032%