INDEX
Explanations
phrases indicating a strong need or obligation
colloquial expressions emphasizing necessity or desire
New Auto-Interp
Negative Logits
thumbnails
-0.82
eering
-0.69
Reviewed
-0.69
worldly
-0.67
hips
-0.66
iated
-0.66
athing
-0.65
edly
-0.65
rors
-0.65
geist
-0.65
POSITIVE LOGITS
gotta
1.12
nab
0.82
leave
0.80
give
0.80
listen
0.79
wanna
0.78
wait
0.78
terday
0.77
get
0.74
take
0.74
Activations Density 0.008%