INDEX
Explanations
references to the human skull
references to skulls
New Auto-Interp
Negative Logits
Published
-0.75
NCT
-0.71
bidden
-0.70
Supporters
-0.68
YN
-0.68
NF
-0.68
Nap
-0.67
EX
-0.67
agonist
-0.67
agents
-0.66
POSITIVE LOGITS
skull
1.17
bones
1.04
skulls
1.02
bone
1.00
bones
0.94
caps
0.88
Bone
0.84
cap
0.82
fracture
0.81
fractures
0.78
Activations Density 0.015%