INDEX
Explanations
mentions of skulls or related imagery
New Auto-Interp
Negative Logits
ARY
-0.69
Opportun
-0.66
Avg
-0.66
Published
-0.66
NCT
-0.65
Supporters
-0.65
YN
-0.64
agonist
-0.63
bidden
-0.63
Idle
-0.63
POSITIVE LOGITS
bones
1.06
bone
1.01
skull
1.01
cap
1.00
skulls
0.92
bones
0.91
caps
0.89
fracture
0.87
fractures
0.86
ornament
0.81
Activations Density 0.005%