INDEX
Explanations
mentions of human anatomy, specifically brains and heads
references to the brain and mental processes
New Auto-Interp
Negative Logits
Prosecut
-0.65
ression
-0.64
ECH
-0.62
INA
-0.62
ressive
-0.61
ee
-0.61
card
-0.61
rav
-0.61
onomy
-0.60
Delivery
-0.60
POSITIVE LOGITS
chool
1.54
mith
1.45
paces
1.41
pring
1.38
pace
1.33
creen
1.32
cale
1.30
ystem
1.27
hips
1.25
hare
1.19
Activations Density 0.088%