INDEX
Explanations
words associated with varying states of discomfort and struggle
New Auto-Interp
Head Attr Weights
0:0.09
1:0.03
2:0.33
3:0.05
4:0.09
5:0.06
6:0.03
7:0.03
8:0.08
9:0.07
10:0.05
11:0.03
Negative Logits
Observatory
-1.32
occup
-1.28
aida
-1.22
occupation
-1.19
プ
-1.18
Balk
-1.14
Nost
-1.14
cutoff
-1.05
Maj
-1.02
operative
-1.02
POSITIVE LOGITS
parts
1.43
puff
1.34
oslav
1.33
ombat
1.27
obin
1.24
nir
1.21
cycles
1.20
FACE
1.17
anon
1.16
pointer
1.13
Activations Density 0.004%