INDEX
Explanations
references to political actions and statements related to leadership and governance
New Auto-Interp
Negative Logits
produce
-0.15
à¹īà¸ĩ
-0.15
ahun
-0.15
æİĽ
-0.15
DTV
-0.14
ossier
-0.14
ήÏĤ
-0.14
ordo
-0.14
icious
-0.13
inski
-0.13
POSITIVE LOGITS
usz
0.15
rell
0.14
variants
0.14
Minority
0.14
âĵĺ
0.14
LP
0.14
/blue
0.14
tingham
0.13
tween
0.13
aug
0.13
Activations Density 0.260%