INDEX
Explanations
references to animal behavior and adaptations
New Auto-Interp
Negative Logits
agar
-0.17
cade
-0.15
éı
-0.15
vintage
-0.15
Ùħص
-0.14
teÅŁ
-0.14
dns
-0.14
ksen
-0.14
administr
-0.13
ego
-0.13
POSITIVE LOGITS
adaptations
0.28
evolved
0.26
adaptation
0.24
specialization
0.24
evolution
0.23
adapt
0.23
evolutionary
0.21
ancestral
0.21
adapted
0.21
adaptive
0.21
Activations Density 0.057%