INDEX
    Explanations

    intensity or emphasis in statements

    New Auto-Interp
    Negative Logits
    oot
    -0.16
    ÏĢοιη
    -0.15
    cape
    -0.15
     dat
    -0.14
    antine
    -0.14
    Tac
    -0.14
    -Ta
    -0.14
    emm
    -0.14
     Vig
    -0.14
     Ying
    -0.14
    POSITIVE LOGITS
    alte
    0.20
    ampa
    0.17
    alu
    0.16
     roller
    0.15
    xima
    0.14
    auge
    0.14
    sher
    0.14
    ãģıãĤĮ
    0.14
    bie
    0.14
    .DropTable
    0.14
    Act Density 0.014%

    No Known Activations