INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    enthal
    -0.07
     numb
    -0.07
    aturday
    -0.06
     contempl
    -0.06
     aff
    -0.06
     sang
    -0.06
    langle
    -0.06
    เพ
    -0.06
     number
    -0.06
     fourn
    -0.06
    POSITIVE LOGITS
     Crisis
    0.10
     crisis
    0.10
     кри
    0.08
     Trouble
    0.07
     transition
    0.07
    0.07
    raising
    0.07
    -question
    0.07
     wxT
    0.06
    心里
    0.06
    Act Density 0.003%

    No Known Activations