INDEX
Explanations
phrases indicating responsibility or accountability in political contexts
New Auto-Interp
Head Attr Weights
0:0.06
1:0.01
2:0.08
3:0.14
4:0.06
5:0.06
6:0.03
7:0.04
8:0.05
9:0.10
10:0.21
11:0.11
Negative Logits
inav
-1.46
\">
-1.23
]"
-1.23
)"
-1.19
"]
-1.15
"},{"-1.15
"},
-1.14
guiActiveUnfocused
-1.14
"}
-1.13
URI
-1.10
POSITIVE LOGITS
govtrack
1.36
compuls
1.31
persona
1.31
instincts
1.31
playbook
1.29
charisma
1.27
Himself
1.26
willfully
1.25
indul
1.24
scapego
1.24
Activations Density 0.510%