INDEX
Explanations
phrases and words related to identity and value in cultural contexts
New Auto-Interp
Head Attr Weights
0:0.03
1:0.04
2:0.25
3:0.09
4:0.01
5:0.02
6:0.08
7:0.13
8:0.11
9:0.06
10:0.07
11:0.06
Negative Logits
Annotations
-1.25
earances
-1.22
Cas
-1.02
thood
-1.01
ISTORY
-1.00
vironments
-0.98
DoS
-0.97
う
-0.97
��
-0.96
黒
-0.96
POSITIVE LOGITS
liest
2.00
iest
1.46
equivalent
1.38
hest
1.32
antidote
1.21
thing
1.19
closest
1.13
perennial
1.09
est
1.09
pinnacle
1.08
Activations Density 0.226%