INDEX
    Explanations

    goodbye, happiness, enjoy

    New Auto-Interp
    Negative Logits
    Hearing
    0.64
     chiede
    0.62
    Demand
    0.62
    owness
    0.61
     decis
    0.60
    ostrum
    0.60
    مكن
    0.59
    ligence
    0.59
     スタッドレスタイヤ
    0.58
     Demand
    0.57
    POSITIVE LOGITS
     Happy
    2.27
    Happy
    2.26
     happy
    2.17
     Enjoy
    2.17
     enjoy
    2.09
    Enjoy
    2.03
     HAPPY
    1.86
    enjoy
    1.85
    happy
    1.84
     feliz
    1.76
    Act Density 0.177%

    No Known Activations