INDEX
Explanations
expressions of strong emotional reactions or sentiments
New Auto-Interp
Head Attr Weights
0:0.03
1:0.05
2:0.11
3:0.15
4:0.06
5:0.20
6:0.17
7:0.04
8:0.03
9:0.04
10:0.04
11:0.03
Negative Logits
McM
-1.48
Jets
-1.47
Norwich
-1.43
cro
-1.43
Bulls
-1.42
barn
-1.41
looms
-1.38
derby
-1.37
Cro
-1.36
Bav
-1.36
POSITIVE LOGITS
annot
2.11
govtrack
2.03
forward
1.91
emort
1.90
HP
1.87
)</
1.81
reci
1.75
cause
1.75
bound
1.75
accept
1.74
Activations Density 0.015%