INDEX
    Explanations

    emotional and positive expressions related to experiences

    New Auto-Interp
    Negative Logits
    _by
    -0.15
    bes
    -0.15
    inae
    -0.14
    Which
    -0.14
    inis
    -0.14
    ByName
    -0.14
    orsi
    -0.13
    uby
    -0.13
    WithContext
    -0.13
    ãģ«ãĤĪãĤĭ
    -0.13
    POSITIVE LOGITS
     how
    0.31
     hearing
    0.31
     to
    0.30
     knowing
    0.27
     having
    0.26
     being
    0.25
     watching
    0.25
     that
    0.24
     seeing
    0.23
     when
    0.21
    Act Density 0.100%

    No Known Activations