INDEX
Negative Logits
identities
-0.07
إد
-0.06
า
-0.06
もう
-0.06
�
-0.06
â
-0.06
Ub
-0.06
♀
-0.06
حر
-0.06
-centered
-0.06
POSITIVE LOGITS
agree
0.08
political
0.07
gif
0.07
-analytics
0.06
RECEIVE
0.06
Collaboration
0.06
trigger
0.06
favored
0.06
scrollTop
0.06
XXXXXXXX
0.06
Activations Density 0.035%