INDEX
Explanations
phrases associated with political or electoral discourse
New Auto-Interp
Negative Logits
ĨĴ
-0.82
ibly
-0.79
prep
-0.72
ipel
-0.70
ãĤ©
-0.69
bots
-0.68
REP
-0.67
Enhanced
-0.66
>[
-0.66
======
-0.66
POSITIVE LOGITS
limb
0.95
shoulders
0.83
parcel
0.80
mortar
0.76
Neck
0.75
baggage
0.75
trunk
0.73
mouth
0.72
pound
0.72
toe
0.72
Activations Density 0.059%