INDEX
Explanations
medical terminology related to experimental findings and conditions
New Auto-Interp
Negative Logits
Elk
-0.16
Pony
-0.16
ër
-0.16
æī§
-0.15
ihat
-0.15
goose
-0.15
granny
-0.14
pony
-0.14
ambre
-0.14
buah
-0.14
POSITIVE LOGITS
mice
0.41
rats
0.33
mouse
0.31
rodents
0.30
animals
0.27
strain
0.26
rats
0.26
mouse
0.25
litter
0.25
rat
0.24
Activations Density 0.058%