INDEX
Explanations
mentions of large crowds and instances of social media engagement
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.07
3:0.08
4:0.20
5:0.03
6:0.04
7:0.32
8:0.03
9:0.05
10:0.06
11:0.05
Negative Logits
brunt
-1.53
apprehended
-1.52
lost
-1.50
loss
-1.49
losses
-1.49
grain
-1.49
contracted
-1.48
ittens
-1.47
missing
-1.44
cuts
-1.44
POSITIVE LOGITS
Compare
1.57
Analy
1.53
Ratings
1.47
Spons
1.42
erie
1.42
Reviews
1.38
Mot
1.38
FTWARE
1.36
sponsoring
1.34
env
1.33
Activations Density 0.004%