INDEX
Explanations
prepositions indicating location or direction
phrases indicating advice or suggestions for action
New Auto-Interp
Negative Logits
ĸļ
-0.63
coni
-0.63
Ó
-0.56
ylan
-0.56
Gleaming
-0.56
³³³³
-0.55
urable
-0.55
conced
-0.55
andowski
-0.54
agog
-0.54
POSITIVE LOGITS
yourself
1.12
oneself
1.04
yourselves
0.96
your
0.96
your
0.86
Yourself
0.84
somew
0.82
YOUR
0.81
ourselves
0.81
somewhere
0.80
Activations Density 0.328%