INDEX
Explanations
references to petitions
New Auto-Interp
Negative Logits
lasses
-0.88
ectar
-0.70
Tune
-0.65
çīĪ
-0.63
orf
-0.61
quickShipAvailable
-0.61
Haram
-0.61
antle
-0.60
bane
-0.60
obyl
-0.59
POSITIVE LOGITS
petitions
0.99
petition
0.98
ers
0.91
naires
0.88
filed
0.85
ing
0.84
Petition
0.82
aires
0.82
ingham
0.81
signatures
0.79
Activations Density 0.010%