INDEX
Explanations
phrases expressing uncertainty or questioning decisions
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.18
3:0.08
4:0.10
5:0.02
6:0.02
7:0.32
8:0.04
9:0.03
10:0.06
11:0.06
Negative Logits
untled
-1.98
reputable
-1.85
trustworthy
-1.80
authorized
-1.76
uyomi
-1.74
quickShipAvailable
-1.73
igham
-1.70
emark
-1.63
independ
-1.55
soType
-1.54
POSITIVE LOGITS
Caps
1.60
Soccer
1.54
ilda
1.53
PCR
1.52
RAW
1.51
WWE
1.47
Boxing
1.46
Struggle
1.40
Brawl
1.38
Paint
1.36
Activations Density 0.012%