INDEX
Explanations
mentions of scientific concepts or figures, particularly related to evolution
New Auto-Interp
Negative Logits
acs
-0.15
endar
-0.15
precated
-0.14
AXB
-0.13
bjerg
-0.13
assis
-0.13
azu
-0.13
IRR
-0.13
صات
-0.13
jak
-0.12
POSITIVE LOGITS
Star
0.29
Boyle
0.25
Star
0.25
-Star
0.24
Hel
0.24
Newman
0.23
ch
0.23
Hel
0.20
hel
0.20
СÑĤаÑĢ
0.19
Activations Density 0.000%