INDEX
Explanations
mention of brands or names in a consumer context
New Auto-Interp
Head Attr Weights
0:0.13
1:0.02
2:0.18
3:0.04
4:0.06
5:0.05
6:0.06
7:0.03
8:0.15
9:0.06
10:0.09
11:0.07
Negative Logits
Redditor
-1.62
WARE
-1.60
iring
-1.52
multiplied
-1.52
イト
-1.50
evidenced
-1.47
includ
-1.46
exting
-1.45
arising
-1.43
spawned
-1.41
POSITIVE LOGITS
1.59
Cu
1.59
Kle
1.55
Edge
1.50
Mal
1.48
dor
1.48
Cyr
1.47
etc
1.46
Sierra
1.45
Ivory
1.44
Activations Density 0.011%