INDEX
Explanations
statements from various public figures and their opinions on political matters
New Auto-Interp
Head Attr Weights
0:0.03
1:0.12
2:0.06
3:0.02
4:0.03
5:0.06
6:0.06
7:0.08
8:0.14
9:0.23
10:0.05
11:0.09
Negative Logits
backdrop
-1.39
gram
-1.27
vantage
-1.21
Gram
-1.21
Gry
-1.19
DAQ
-1.15
hinges
-1.14
competition
-1.09
bloodstream
-1.08
Gall
-1.07
POSITIVE LOGITS
Downloadha
1.45
"{1.43
goodbye
1.39
psc
1.37
amera
1.30
omething
1.30
kef
1.29
anne
1.27
:]
1.22
"'
1.21
Activations Density 0.002%