INDEX
Explanations
references to governmental and political actions related to healthcare and social issues
New Auto-Interp
Negative Logits
"!
-0.76
%.
-0.74
.:
-0.72
.<
-0.72
!".
-0.70
!.
-0.69
.(
-0.67
."
-0.66
.",
-0.65
":[
-0.65
POSITIVE LOGITS
*)
1.01
)]
0.99
)}
0.98
})
0.95
)]
0.88
?)
0.88
?)
0.86
+)
0.85
)
0.82
)\
0.82
Activations Density 0.986%