INDEX
Explanations
negative sentiment or criticism
New Auto-Interp
Head Attr Weights
0:0.09
1:0.07
2:0.08
3:0.07
4:0.09
5:0.07
6:0.08
7:0.08
8:0.07
9:0.09
10:0.08
11:0.08
Negative Logits
shit
-2.74
iqu
-2.71
Sci
-2.60
rums
-2.52
pet
-2.52
rum
-2.50
acio
-2.50
Rat
-2.47
SEA
-2.47
rea
-2.45
POSITIVE LOGITS
Clarks
2.73
Vera
2.70
Rabb
2.49
Baldwin
2.48
NX
2.47
Beacon
2.45
Peach
2.45
accessory
2.44
Maiden
2.37
bund
2.35
Activations Density 0.000%