INDEX
Explanations
phrases related to legislation and political discourse
New Auto-Interp
Negative Logits
stone
-0.67
bugs
-0.66
stones
-0.65
âĢİ
-0.64
eteen
-0.63
eteenth
-0.62
oys
-0.62
idge
-0.61
IOR
-0.60
ocrin
-0.60
POSITIVE LOGITS
rew
0.78
rightly
0.77
preferably
0.75
rightfully
0.70
indeed
0.69
perhaps
0.68
hopefully
0.66
therefore
0.66
vice
0.64
moreover
0.62
Activations Density 0.152%