INDEX
Explanations
data related to scientific research and measurement outcomes
New Auto-Interp
Negative Logits
letal
-0.17
kla
-0.14
pony
-0.14
tog
-0.14
Bonds
-0.14
thro
-0.14
ìľ¨
-0.14
odÄĽ
-0.13
ignet
-0.13
^K
-0.13
POSITIVE LOGITS
fear
0.22
learned
0.20
reward
0.20
Consolid
0.20
learning
0.20
Learned
0.20
Morris
0.20
locom
0.20
licking
0.19
wheel
0.19
Activations Density 0.016%