INDEX
Negative Logits
lifetime
-0.54
Cosponsors
-0.54
SPONSORED
-0.53
*/(
-0.48
somew
-0.48
esson
-0.47
vernment
-0.47
abase
-0.45
favors
-0.45
DeL
-0.45
POSITIVE LOGITS
oga
0.60
rim
0.55
ieri
0.55
iba
0.51
ras
0.51
uri
0.50
QUI
0.50
ghan
0.49
omon
0.48
ja
0.48
Activations Density 0.237%