INDEX
Explanations
phrases indicating criticism or concern about social issues and injustices
New Auto-Interp
Negative Logits
Verdana
-0.08
ILA
-0.07
.appspot
-0.07
ouns
-0.07
μÏĨ
-0.07
acter
-0.07
ISCO
-0.07
ila
-0.07
prung
-0.06
_initializer
-0.06
POSITIVE LOGITS
å¦ĤæŃ¤
0.09
竣
0.07
such
0.07
regress
0.07
so
0.07
akit
0.07
while
0.06
modern
0.06
basic
0.06
grown
0.06
Activations Density 0.024%