INDEX
Explanations
instances of expressing desires or hopes through the word "wish"
expressions of desire or longing
New Auto-Interp
Negative Logits
Mub
-0.73
viol
-0.71
ales
-0.67
pub
-0.65
aldehyde
-0.65
DOI
-0.62
leading
-0.61
imil
-0.60
Basics
-0.60
Levels
-0.60
POSITIVE LOGITS
wish
3.82
wishes
2.41
wished
2.39
Wish
2.06
wishing
1.98
desire
1.55
want
1.33
regret
1.32
desires
1.29
hope
1.23
Activations Density 0.014%