INDEX
Explanations
phrases related to dominance or authority in competitive contexts
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.06
3:0.06
4:0.20
5:0.02
6:0.10
7:0.31
8:0.03
9:0.03
10:0.04
11:0.05
Negative Logits
Serv
-1.93
rification
-1.93
($)
-1.86
+(
-1.84
(%)
-1.81
ILLE
-1.75
actionGroup
-1.65
imei
-1.63
Harm
-1.60
contained
-1.59
POSITIVE LOGITS
benches
1.95
necks
1.89
bandwagon
1.89
heels
1.87
curls
1.75
corners
1.75
chairs
1.74
tails
1.73
hamm
1.70
shores
1.69
Activations Density 0.001%