INDEX
Explanations
references to political topics and actions
New Auto-Interp
Negative Logits
unus
-0.58
)."
-0.56
"/>
-0.54
.")
-0.53
))))
-0.51
cheese
-0.50
").
-0.49
)</
-0.49
chees
-0.48
ief
-0.47
POSITIVE LOGITS
Specifically
0.74
namely
0.70
notably
0.69
specifically
0.66
Specifically
0.66
particularly
0.60
Cosponsors
0.59
Ô
0.58
spearheaded
0.57
primarily
0.57
Activations Density 0.828%