INDEX
Explanations
sentiments related to social perceptions and identities
New Auto-Interp
Negative Logits
ulet
-0.15
à¹Ģà¸Ĥà¸ķ
-0.14
é£
-0.14
pie
-0.14
laden
-0.14
lık
-0.14
stup
-0.14
indh
-0.14
iani
-0.14
emark
-0.13
POSITIVE LOGITS
oyo
0.17
471
0.17
467
0.17
840
0.16
Station
0.14
tinder
0.14
zw
0.14
NavController
0.14
cken
0.14
arsers
0.14
Activations Density 0.335%