INDEX
Explanations
mentions of legislation and political actions
New Auto-Interp
Negative Logits
ruary
-0.71
theless
-0.70
bara
-0.69
mberg
-0.63
psychiat
-0.62
vertisements
-0.62
ptin
-0.60
prisingly
-0.59
amorph
-0.58
owship
-0.57
POSITIVE LOGITS
Kavanaugh
0.80
accuser
0.72
Dems
0.70
']
0.68
]}
0.65
intel
0.64
probe
0.64
').
0.64
%]
0.62
«ĺ
0.61
Activations Density 0.041%