INDEX
Explanations
phrases related to political events, polling locations, and governmental actions
New Auto-Interp
Negative Logits
itives
-0.73
ptions
-0.65
earable
-0.65
digs
-0.62
SPONSORED
-0.61
pains
-0.61
Finish
-0.60
MpServer
-0.60
ptive
-0.60
ickets
-0.60
POSITIVE LOGITS
virtue
1.23
contrast
1.15
akuya
1.08
products
1.04
catch
1.03
implication
1.00
product
0.98
stand
0.92
default
0.91
pass
0.90
Activations Density 0.253%