INDEX
Explanations
statements regarding political activities or elections
New Auto-Interp
Negative Logits
ÑģÑı
-0.16
icari
-0.15
ayload
-0.15
tring
-0.14
cke
-0.14
_MPI
-0.14
ardy
-0.14
ivery
-0.14
orias
-0.14
dda
-0.14
POSITIVE LOGITS
oui
0.16
ioni
0.15
/*!<
0.14
以åıĬ
0.14
ï¼Į以åıĬ
0.14
flame
0.14
949
0.13
upro
0.13
OnCollision
0.13
utters
0.13
Activations Density 0.149%