INDEX
Explanations
references to socio-economic issues and governmental policies
New Auto-Interp
Negative Logits
agma
-0.16
derec
-0.16
Tato
-0.15
omo
-0.15
رÙĬÙĥÙĬ
-0.14
Feedback
-0.14
bubble
-0.14
泡
-0.14
adium
-0.14
æ¦ľ
-0.14
POSITIVE LOGITS
SNAP
0.34
welfare
0.28
Welfare
0.25
_snap
0.24
snap
0.23
Snap
0.23
SN
0.21
snap
0.21
Snap
0.21
food
0.20
Activations Density 0.030%