INDEX
Explanations
phrases expressing intention, action, and purpose
phrases related to expressing intentions or purposes
New Auto-Interp
Negative Logits
similar
-0.65
uffs
-0.63
disagree
-0.61
artisan
-0.60
Mart
-0.58
impair
-0.58
uthor
-0.58
azer
-0.58
ventions
-0.57
odox
-0.57
POSITIVE LOGITS
boils
0.81
anyways
0.71
Wanted
0.70
stri
0.70
ga
0.69
nutshell
0.66
anyway
0.65
atech
0.63
endings
0.62
meant
0.61
Activations Density 0.165%