INDEX
Explanations
mentions of bipartisan efforts or agreements
references to bipartisan cooperation or agreements in political contexts
New Auto-Interp
Negative Logits
iph
-0.92
resp
-0.85
phal
-0.83
words
-0.77
ulia
-0.77
ogenesis
-0.77
argon
-0.76
ques
-0.75
handler
-0.73
oras
-0.72
POSITIVE LOGITS
majorities
1.03
bipartisan
1.03
consensus
1.02
congressional
0.92
coalition
0.91
effort
0.87
agreement
0.86
legislative
0.85
Congressional
0.83
efforts
0.81
Activations Density 0.022%