INDEX
Explanations
violent and aggressive actions or dialogues
New Auto-Interp
Negative Logits
ontent
-0.82
specialization
-0.80
unemploy
-0.75
igon
-0.74
Sov
-0.72
Colomb
-0.71
redundancy
-0.69
optimization
-0.68
CLUS
-0.68
specialize
-0.67
POSITIVE LOGITS
stretched
1.29
gently
1.27
clasp
1.14
grasped
1.12
fingers
1.08
clenched
1.06
knees
1.06
lips
1.05
palms
1.04
palm
1.03
Activations Density 0.315%