INDEX
Explanations
instances of scoring systems and social media engagement
New Auto-Interp
Negative Logits
å¥ij
-0.18
eldorf
-0.16
ons
-0.16
ãĥ¥ãĥ¼
-0.15
mart
-0.15
isher
-0.14
rise
-0.14
pton
-0.14
ypo
-0.14
Nelson
-0.13
POSITIVE LOGITS
INY
0.15
باØŃ
0.14
Rolled
0.14
аÑĤÑĥ
0.14
Freund
0.14
.gca
0.13
-append
0.13
-gray
0.13
noxious
0.13
tracking
0.13
Activations Density 0.049%