INDEX
    Explanations

    instances of expressing desires or hopes through the word "wish"

    expressions of desire or longing

    New Auto-Interp
    Negative Logits
     Mub
    -0.73
    viol
    -0.71
    ales
    -0.67
    pub
    -0.65
    aldehyde
    -0.65
     DOI
    -0.62
    leading
    -0.61
    imil
    -0.60
     Basics
    -0.60
     Levels
    -0.60
    POSITIVE LOGITS
     wish
    3.82
     wishes
    2.41
     wished
    2.39
     Wish
    2.06
     wishing
    1.98
     desire
    1.55
     want
    1.33
     regret
    1.32
     desires
    1.29
     hope
    1.23
    Act Density 0.014%

    No Known Activations