INDEX
Explanations
phrases related to cost or value propositions
evaluative phrases indicating positive or favorable characteristics
New Auto-Interp
Negative Logits
mares
-0.77
aline
-0.68
ansk
-0.65
Article
-0.62
alks
-0.62
etz
-0.61
images
-0.59
dule
-0.59
rates
-0.59
individual
-0.59
POSITIVE LOGITS
worthwhile
1.18
boon
1.12
viable
1.10
good
1.07
fruitful
1.05
useful
1.00
mistake
0.99
better
0.97
worthy
0.95
wiser
0.95
Activations Density 0.191%