INDEX
Explanations
news headlines and articles related to politics and events
New Auto-Interp
Negative Logits
telecommunications
-0.58
disparate
-0.55
surrog
-0.55
atform
-0.50
conservancy
-0.49
Rodham
-0.48
congressional
-0.48
athered
-0.48
zens
-0.48
bona
-0.48
POSITIVE LOGITS
********************************
0.72
----------------------------------------------------------------
0.71
________________
0.69
=================================================================
0.68
================================================================
0.67
NOTE
0.67
________________________________________________________________
0.67
}}}
0.67
----------------------------------------------------------------
0.64
xx
0.64
Activations Density 3.681%