INDEX
Explanations
phrases related to societal issues and activism
New Auto-Interp
Negative Logits
ebin
-0.64
icut
-0.63
ouple
-0.62
Leilan
-0.58
ificant
-0.58
=================================================================
-0.57
ajor
-0.57
wallet
-0.57
é¾įåĸļ士
-0.55
DN
-0.55
POSITIVE LOGITS
envisioned
0.89
that
0.83
required
0.82
depicted
0.80
envis
0.76
that
0.76
plag
0.75
which
0.75
portrayed
0.75
which
0.74
Activations Density 0.868%