INDEX
Explanations
references to political events and gatherings
New Auto-Interp
Head Attr Weights
0:0.10
1:0.02
2:0.15
3:0.24
4:0.05
5:0.12
6:0.02
7:0.11
8:0.03
9:0.01
10:0.08
11:0.02
Negative Logits
agate
-2.50
conom
-2.46
cancer
-2.32
netflix
-2.26
utical
-2.24
directory
-2.21
profit
-2.14
dependency
-2.11
Downloadha
-2.05
Currently
-2.05
POSITIVE LOGITS
applause
4.13
spectators
3.88
cheering
3.85
cheers
3.75
attendees
3.58
onlook
3.56
announcer
3.33
cheered
3.27
chanting
3.26
chants
3.23
Activations Density 1.332%