INDEX
Explanations
phrases indicating maximizing or optimizing actions
expressions of the ability or potential to do something
New Auto-Interp
Negative Logits
furt
-0.82
BuyableInstoreAndOnline
-0.75
Lauder
-0.72
Debor
-0.65
EUR
-0.58
gement
-0.58
groundbreaking
-0.56
ERY
-0.56
agram
-0.56
ELD
-0.55
POSITIVE LOGITS
afford
1.25
muster
1.22
manage
1.06
possibly
0.98
tolerate
0.97
feas
0.93
handle
0.93
conceive
0.93
imagine
0.89
cram
0.88
Activations Density 0.045%