INDEX
Explanations
personal desires and preferences expressed in text
expressions of desire or wants
New Auto-Interp
Negative Logits
Reviewer
-0.64
Dash
-0.64
Results
-0.59
Occ
-0.58
thal
-0.58
decipher
-0.58
bystand
-0.57
oddy
-0.57
ãĥĺ
-0.57
rendered
-0.56
POSITIVE LOGITS
want
1.74
want
1.56
WANT
1.53
wants
1.46
Want
1.44
wanted
1.37
crave
1.31
wanna
1.30
desire
1.28
wanting
1.23
Activations Density 0.326%