INDEX
    Explanations

    phrases that express varying degrees of excitement or interest

    New Auto-Interp
    Negative Logits
    pector
    -0.15
     seedu
    -0.14
    ãģĦãĤĭ
    -0.14
    udd
    -0.14
    Ã
    -0.14
     عد
    -0.14
     leak
    -0.13
    yect
    -0.13
     frail
    -0.13
    indow
    -0.13
    POSITIVE LOGITS
     cic
    0.16
    uner
    0.14
    lez
    0.14
    ROKE
    0.14
     rout
    0.14
    ’ta
    0.14
    uraa
    0.14
    ارÙĩ
    0.13
    reten
    0.13
     manufact
    0.13
    Act Density 0.026%

    No Known Activations