INDEX
Explanations
phrases expressing a strong preference or insistence on a particular way of doing things
instances of the phrase "this way."
New Auto-Interp
Negative Logits
livest
-0.73
sugg
-0.73
ynski
-0.68
urated
-0.64
usters
-0.63
erville
-0.63
oute
-0.63
ongo
-0.63
unts
-0.62
uster
-0.62
POSITIVE LOGITS
forward
0.79
fare
0.78
ward
0.76
finding
0.75
Sabha
0.75
footed
0.73
zzo
0.72
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
0.67
soever
0.66
forever
0.64
Activations Density 0.035%