INDEX
Explanations
phrases indicating satisfaction or pleasure in various contexts
expressions of satisfaction or approval
New Auto-Interp
Negative Logits
sites
-0.74
chin
-0.73
mouth
-0.69
hops
-0.69
teasp
-0.69
<@
-0.67
GOODMAN
-0.67
fighter
-0.66
mite
-0.64
fair
-0.63
POSITIVE LOGITS
regards
1.38
regard
1.19
stood
1.05
standing
1.03
respect
0.97
impunity
0.93
draw
0.81
drawn
0.79
dignity
0.78
utmost
0.72
Activations Density 0.096%