INDEX
    Explanations

    phrases expressing emotions or feelings

    New Auto-Interp
    Negative Logits
    ekl
    -0.15
    ahl
    -0.15
    iaz
    -0.15
    CARD
    -0.15
    دث
    -0.15
    agger
    -0.15
    ëĵĿ
    -0.15
    avit
    -0.14
    odo
    -0.14
    à¸Ńà¸Ń
    -0.14
    POSITIVE LOGITS
    365
    0.16
    lessly
    0.16
    burg
    0.15
    omite
    0.15
    uctor
    0.14
    ãĥ¼ãĥŃ
    0.14
    arton
    0.14
    flo
    0.14
    reserve
    0.14
    opcion
    0.14
    Act Density 0.040%

    No Known Activations