INDEX
Explanations
phrases indicative of personal needs and desires
New Auto-Interp
Negative Logits
åĿĬ
-0.20
onda
-0.18
onder
-0.17
ervas
-0.17
eldon
-0.16
dana
-0.16
çĽĹ
-0.16
imple
-0.15
Intent
-0.15
oba
-0.15
POSITIVE LOGITS
barg
0.26
deserve
0.23
deserves
0.20
seek
0.19
desire
0.18
Barg
0.18
æīĢ
0.18
bargain
0.17
require
0.16
cov
0.16
Activations Density 0.052%