INDEX
Explanations
terms related to medical research and disease pathology
New Auto-Interp
Negative Logits
Pony
-0.16
buah
-0.15
patient
-0.15
Elk
-0.15
央
-0.15
æī§
-0.15
ambre
-0.15
ÑĢÑĥками
-0.14
ër
-0.14
acci
-0.14
POSITIVE LOGITS
mice
0.40
rats
0.34
mouse
0.31
animals
0.30
rodents
0.30
Animals
0.27
animals
0.26
rat
0.26
mouse
0.25
litter
0.25
Activations Density 0.064%