INDEX
Explanations
words related to preferences or choices made by individuals
phrases and structures indicating preferences or choices
New Auto-Interp
Head Attr Weights
0:0.05
1:0.02
2:0.14
3:0.05
4:0.25
5:0.06
6:0.02
7:0.02
8:0.08
9:0.18
10:0.06
11:0.03
Negative Logits
pload
-1.65
querque
-1.59
infeld
-1.46
loo
-1.40
rir
-1.32
kj
-1.29
umbai
-1.28
apeake
-1.27
azeera
-1.25
Rail
-1.25
POSITIVE LOGITS
TEXTURE
1.46
overr
1.38
lihood
1.37
presets
1.33
fonts
1.33
Ratings
1.31
selections
1.30
Preferences
1.29
pleasing
1.28
preferences
1.28
Activations Density 0.005%