INDEX
Explanations
possessive and descriptive adjectives related to appearance
New Auto-Interp
Head Attr Weights
0:0.13
1:0.32
2:0.04
3:0.03
4:0.02
5:0.18
6:0.04
7:0.02
8:0.04
9:0.04
10:0.04
11:0.04
Negative Logits
ATS
-2.05
龍
-2.04
BOOK
-2.04
龍�
-2.00
Ult
-1.99
Spec
-1.96
Oracle
-1.95
Vers
-1.88
76561
-1.87
Oregon
-1.87
POSITIVE LOGITS
lungs
2.57
limb
2.55
scalp
2.46
lung
2.42
foot
2.35
legs
2.35
arms
2.34
limbs
2.32
face
2.29
jaw
2.27
Activations Density 0.016%